|
|||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectcom.gargoylesoftware.htmlunit.html.HTMLParser
public final class HTMLParser
SAX parser implementation that uses the neko HTMLConfiguration
to parse HTML into a HtmlUnit-specific DOM (HU-DOM) tree.
Note that the parser currently does not handle CDATA or comment sections, i.e. these do not appear in the resulting DOM tree
Method Summary | |
---|---|
static IElementFactory |
getFactory(java.lang.String tagName)
|
static boolean |
getIgnoreOutsideContent()
Get the state of the flag to ignore content outside the BODY and HTML tags |
static HtmlPage |
parse(WebResponse webResponse,
WebWindow webWindow)
parse the HTML content from the given WebResponse into an object tree representation |
static void |
parseFragment(DomNode parent,
java.lang.String source)
Parses the HTML content from the given string into an object tree representation. |
static void |
setIgnoreOutsideContent(boolean ignoreOutsideContent)
Set the flag to control validation of the HTML content that is outside of the BODY and HTML tags. |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Method Detail |
---|
public static IElementFactory getFactory(java.lang.String tagName)
tagName
- an HTML element tag name
public static boolean getIgnoreOutsideContent()
public static HtmlPage parse(WebResponse webResponse, WebWindow webWindow) throws java.io.IOException
webResponse
- the response datawebWindow
- the web window into which the page is to be loaded
null
if the <HTML>
tag is missing
java.io.IOException
- io errorpublic static void parseFragment(DomNode parent, java.lang.String source) throws org.xml.sax.SAXException, java.io.IOException
parent
- the parent for the new nodessource
- the (X)HTML to be parsed
org.xml.sax.SAXException
- if a SAX error occurs
java.io.IOException
- if an IO error occurspublic static void setIgnoreOutsideContent(boolean ignoreOutsideContent)
ignoreOutsideContent
- - boolean flag to set
|
|||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |