dcsimg

How can I parse an HTML page to capture a URL returned from, e.g., a search engine?

Davanum Srinivas

Sun has an good article (with sources) at:

Writing a Web Crawler in the Java Programming Language