Lucene Section Index
How can I index Powerpoint documents?
In order to index Powerpoint documents you need to first parse them to extract text that you want to index from them. You can use the Jakarta Apache POI, as it contains a parser for Powerpoint do...more
Can Lucene do a "search within search", so that the second search is constrained by the results of the first query?
Yes. There are two primary options:
Use QueryFilter with the previous query as the filter.
(you can search the mailing list archives for QueryFilter and Doug Cutting's recommendations against ...more
Can I use Lucene to index text in Chinese, Japanese, Korean, and other multi-byte character sets?
Yes, you can. Lucene is not limited to English, nor any other language. To index text properly, you need to use an Analyzer appropriate for the language of the text you are indexing. Lucene's d...more
Where does the name Lucene come from?
Lucene is Doug Cutting's wife's middle name, and her maternal grandmother's first name.
Where does the name Lucene come from?
Lucene is Doug Cutting's wife's middle name, and her maternal grandmother's first name.
Can I cache search results with Lucene?
Lucene does come with a simple cache mechanism, if you use Lucene Filters.
The classes to look at are CachingWrapperFilter and QueryFilter.
more
Why can't I use Lucene with IBM JDK 1.3.1?
Apparently there is a bug in IBM's JIT code in JDK 1.3.1.
To work around it, disable JIT for the org.apache.lucene.store.OutputStream.writeInt method by setting the following environment variable:...more
What is the difference between IndexWriter.addIndexes(IndexReader[]) and IndexWriter.addIndexes(Directory[]), besides them taking different arguments?
What is the difference between IndexWriter.addIndexes(IndexReader[]) and
IndexWriter.addIndexes(Directory[]), besides them taking different arguments?
If I have two indexes and use the MultiSearcher will it be faster than only one index with all my documents?
That depends on the environment where MultiSearcher is used.
If you have a single computer with a single CPU, then it may actually be a bit slower. However, it could be faster if you're either r...more
If I use a compound file-style index, do I still need to optimize my index?
Yes. Each .cfs file created in the compound file-style index represents a single segment, which means you can still merge multiple segments into a single segment by optimizing the index.
more
What version of the JDK is required to use Lucene?
Lucene requires at least version 1.2 of the JDK. Versions 1.3 and 1.4 of the JDK are also supported.
Which version of the JDK does Lucene require?
Lucene requires at least version 1.2 of the JDK. Versions 1.3 and 1.4 of the JDK are also supported.
Can I search an index while it is being optimized?
Yes, an index can be searched and optimized simultaneously.
Can I search an index while it is being optimized?
Yes, an index can be searched and optimized simultaneously.
How can I index JSP files?
To index the content of JSPs that a user would see using a Web browser, you would need to write an application that acts as a Web client, in order to mimic the Web browser behaviour. Once you hav...more