Is there any tool/API to convert formatted file types to html?
2 posts in topic
Flat View  Flat View
TOPIC ACTIONS:
 

Posted By:   Saran_Babu
Posted On:   Tuesday, November 14, 2006 08:19 AM

Is there any java API / java tool to convert formatted files like (.doc, .ppt, .pdf etc) to html?


I mainly need this two, convert doc to html and ppt to html.



Please help me its very urgent.




Thanks in advance,

Saran

Re: Is there any tool/API to convert formatted file types to html?

Posted By:   Robert_Lybarger  
Posted On:   Tuesday, November 14, 2006 08:37 AM

Agreed w/ warnerjan. Use the "save as" to get .doc's / .ppt's best interpretation of HTML pages from the source material. Expect that you might have to clean up afterwards, but, for any practical person/use, it's the only way.

Now, had your source material been in an open office document, you could just unzip the *.odt file and take your information directly out of the content.xml file itself and done whatever you wanted to with it. This is why people-who-know are pushing for the open document formats.

Re: Is there any tool/API to convert formatted file types to html?

Posted By:   WarnerJan_Veldhuis  
Posted On:   Tuesday, November 14, 2006 08:23 AM

Both MS Word and MS Powerpoint have a save as... function that lets you save this to HTML. Since our friends in Redmond keep their fileformats very closed, it's near to impossible. Perhaps POI (Apache) can help out a bit...
About | Sitemap | Contact