How can I index Word documents?

Otis Gospodnetic

In order to index Word documents you need to first parse them to extract text that you want to index from them. Here are some Word parsers that can help you with that:

Jakarta Apache POI has an early development level Microsoft Word parser for versions of Word from Office 97, 2000, and XP.

Simple Text Extractor Library for use with PDF documents. Relies on POI.

0 Comments  (click to add your comment)
Comment and Contribute

 

 

 

 

 


(Maximum characters: 1200). You have 1200 characters left.

 

 

About | Sitemap | Contact