A new release of the Open Source java GUI-Less browser is available,
which allows high-level manipulation of web pages, such as filling
forms, clicking links, accessing attributes and values of specific
elements within the pages, you do not have to create lower-level
requests of TCP/IP or HTTP, but just getPage(url), find a hyperlink,
click() and you have all the HTML, JavaScript, and Ajax are
automatically processed.
The most common use of HtmlUnit is test automation of web pages (even
with complex JavaScript libraries, like Google Web Toolkit and
jQuery), but sometimes it can be used for web scraping, or downloading
website content.
The main enhancements of this release:
- Improved JavaScript support, particularly full support for
MooTools, adding to already supported Google Web Toolkit 1.5/1.6,
jQuery, Mochikit and Sarissa
- Repackaged Rhino classes to allow the use of HtmlUnit and a
regular Rhino version in the same project
- Support all HTML elements
- Experimental WebClient.waitForBackgroundJavaScript() and
WebClient.waitForBackgroundJavaScriptStartingBefore() for simple, fast
and deterministic AJAX testing
- Reworked handling of background JavaScript tasks using Java 5
executors
- And as usual, various bug fixes
You can find more information in the official website (http://
htmlunit.sourceforge.net/), the development team is looking forward to
getting your feedback.