Converting large set of word documents automatically into xml, modify them and than convert them into latex, pdf, html -
A word about quality that is part of the quality management system word, is very sad for me because a ) It corrupts images in large doctors. B) Layouts are sometimes dissolved c) It is cumbersome to configure documentation for different clients.
I convert them to XML / html or text and latex, but it is not possible for 400 documents. I know that I can print word documents directly on pdf such as primo pdf, but it is not flexible enough because i need to modify the content.
Does the structure of the document change into plain text, titles, tables, images and XML? Later, I would like to convert XML to HTML, Latex and PDF in accordance with the preferences of our customers, and can I modify the content too? Is there a way to go to Xslt to convert xml into other formats?
Thanks for any advice.
You can convert your documents to Word 2007. Office 2007 documents are XML documents: just change the file extension to .zip
and upzip. Additionally, Microsoft Office 2007 publishes an API to work with documents that have higher levels than working with XML tags.
Comments
Post a Comment