Lucene Section Index | Page 2
How can I index JSP files?
To index the content of JSPs that a user would see using a Web browser, you would need to write an application that acts as a Web client, in order to mimic the Web browser behaviour. Once you hav...more
How can I index Excel documents?
In order to index Excel documents you need to first parse them to extract text that you want to index from them. Here are some Excel parsers that can help you with that:
Jakarta Apache POI has a...more
How can I index HTML documents?
In order to index HTML documents you need to first parse them to extract text that you want to index from them. Here are some HTML parsers that can help you with that:
An example that uses JavaC...more
How can I index PDF documents?
In order to index PDF documents you need to first parse them to extract text that you want to index from them. Here are some PDF parsers that can help you with that:
PDFBox is a Java API from Be...more
How can I index RTF documents?
In order to index RTF documents you need to first parse them to extract text that you want to index from them. Here are some RTF parsers that can help you with that:
MajiX is a translation utili...more
How can I index Word documents?
In order to index Word documents you need to first parse them to extract text that you want to index from them. Here are some Word parsers that can help you with that:
Jakarta Apache POI has an ...more
How can I index XML documents?
In order to index XML documents you need to first parse them to extract text that you want to index from them. Here are some XML parsers that can help you with that:
See XML
Demo. This contrib...more
Is it safe an index while it is being optimized?
Yes, it is safe.
How do I retrieve all the values of a particular field that exists within an index, across all documents?
The trick is to enumerate terms with that field. Terms are sorted first
by field, then by text, so all terms with a given field are adjacent in
enumerations. Term enumeration is also efficient...more
Is there a way to retrieve the original term positions during the search?
The term positions are not stored in the index, so this cant be done without modifying Lucene.
The primary rationale for not including this in the index is that one typically displays ten or so d...more
Is there a way to retrieve the original term positions during the search? - Javadoc
Is there a way to retrieve the original term positions during the search?
How do I index non Latin characters?
The solution is to ensure that the query string is encoded the same way that strings in the index are.
For instance, something along the lines of this will work if your index is also using UTF-8 e...more
How do I write my own Analyzer?
Here is an example:
public class MyAnalyzer extends Analyzer
{
private static final Analyzer STANDARD = new StandardAnalyzer();
public TokenStream tokenStream(String field, final Reader...more
How do I update a document or a set of documents that are already indexed?
To update an index incrementally you must first delete the documents that were updated, and then re-add them to the index.
There is no direct update procedure in Lucene. There is no update() met...more
What are all possible concurrent Lucene requests?
query
read doc
write
delete
optimize
merge
query
Y
Y
Y
Y
Y
Y
read doc
Y
Y
Y
Y
Y
Y
write
Y
...more