The html library allows Processing to take an html document and change it from a pur string to a tree structure. This treestructure contains different elements from a simple HTML Element eg <TABLE> up to a single letter.

ProHtml gives you the possibility to iterate over this tree structure, and get all of its elements and attributes. It makes it possible to directly use all colors specified in the page.

To work with the library just put the prohtml.jar file in the code-folder of your sketch.
The intention of that library was to build a tool to be able to visualise the structure of a html document and simplify the process of acessing content and attributes. The htmlDocument is parsed while it is loading. This makes the parsing process quiet fast. So every incoming char of the file is parsed one after the other and put in the chosen Structure.

ProHtml also allows you to parse all elements of the a document into a list. Of course you then loose the tree structure. Further you can scan a html document for certain elements with the HtmlElementFinder.

Using the HtmlImageFinder you get the possibilty to extract the images of a page. You get the complete link of each found picture, so you are able to load and process it.

Download: prohtml.zip

Contains:
library folder (with sourcecode and jarfile)
example folder (all examples as processing sketches)
doc folder (complete documentation)
javadoc folder (for further information)

Install:
put the prohtml folder in the librarie folder of processing

Further steps:
Take a look at the proHTML docs and the examples in the prohtml_examples folder. Place the examples in the processing/examples folder or in your sketch folder for fast access from your sketchbook.