An interesting Java based open source tool Web harvest. Lets you extract data from the existing web pages for the future processing. The core idea is HTML to XML transformation with the future processing of XML.
P.S. On the web components level you can use the following taglib from Coldtags suite: Request taglib. It lets you easily extract data from web pages without the regular expressions and other extra complexity.
No comments:
Post a Comment