dcsimg
How to index only meta tags in HTML files ?
1 posts in topic
Flat View  Flat View
TOPIC ACTIONS:
 

Posted By:   colle_stephane
Posted On:   Wednesday, March 13, 2002 11:51 PM

I'm a beginner with Lucene.

I was previously working with Swish-e. This soft enables you to index meta-tags (authors, keywords...) in HTML files.

How can I do the same with Lucene ? I heard from 'Fields' but is it the solution ?

Re: How to index only meta tags in HTML files ?

Posted By:   Otis_Gospodnetic  
Posted On:   Thursday, March 14, 2002 10:09 PM

First you have to be able to parse those META tags out of HTML documents. This is really not a part of Lucene and you could use something like JTidy (you can get it from sf.net) to do that. Once you have the data you can add it to the index as Fields, yes.
About | Sitemap | Contact