The html library allows Processing to
take an html document and change it from a pur string to a tree structure. This
treestructure contains different elements from a simple HTML Element
eg <TABLE> up to a single letter.
ProHtml gives you the possibility to iterate over this tree structure, and
get all of its elements and attributes. It makes it possible to directly use
all colors specified in the page.
To work with the library just put the prohtml.jar file in the code-folder
of your sketch.
The intention of that library was to build a tool to be able to visualise
the structure of a html document and simplify the process of acessing
content and attributes.
The htmlDocument is parsed while it is loading. This makes the parsing
process quiet fast. So every incoming char of the file is parsed one
after the other and put in the chosen Structure.
ProHtml also allows you to parse all elements of the a document into
a list. Of course you then loose the tree structure. Further you can
scan a html document for certain elements with the HtmlElementFinder.
Using the HtmlImageFinder you get the possibilty to extract the images
of a page. You get the complete link of each found picture, so you are
able to load and process it.
Download:
prohtml.zip
Contains:
library folder (with sourcecode and jarfile)
example folder (all examples as processing sketches)
doc folder (complete documentation)
javadoc folder (for further information)
Install:
put the prohtml folder in the librarie folder of processing
Further steps:
Take a look at the proHTML docs and the examples in the prohtml_examples folder. Place the examples in the processing/examples folder or in your sketch folder for fast access from your sketchbook.