[html_parser][parser][html][java_lib][java][lib]Jericho HTML Parser : 「Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It also provides high-level HTML form manipulation functions.」
[html_parser][parser][html][java_lib][java][lib]Jericho HTML Parser : 「Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It also provides high-level HTML form manipulation functions.」
[xpath][java][lib][java_lib][xml]jaxen: universal Java XPath engine - jaxen : 「Jaxen is an open source XPath library written in Java. It is adaptable to many different object models, including DOM, XOM, dom4j, and JDOM. Is it also possible to write adapters that treat non-XML trees such as compiled Java byte code or Java beans as