Searching PPT
1 posts in topic
Flat View  Flat View
TOPIC ACTIONS:
 

Posted By:   Divya_Jesuraj
Posted On:   Monday, July 26, 2004 02:53 PM

Hi,

I know it isn't straightforward and easy to search powerpoint - but anyone have any finished parser code(in java) they are willing to share?

Or tips on how to go about doing it...

~Divya

Re: Searching PPT

Posted By:   Benoit_Quintin  
Posted On:   Tuesday, July 27, 2004 12:34 PM

Well PPT is a binary format, while Lucene really is equipped to deal with plain text... Maybe you could parse the ppt with Apache's POI project, then get the raw text into an arbitrary xml format, then inject that format into Lucene, with different fields (like filename, text content, file size, etc.)... Then you could use those indexed fields as search parameters. But searching THROUGH PPT files, I don't know how you can do that...
About | Sitemap | Contact