Package org.jsoup.parser

Contains the HTML parser, tag specifications, and HTML tokeniser.
  • Class Summary 
    Class Description
    CharacterReader
    CharacterReader consumes tokens off a string.
    HtmlTreeBuilder
    HTML Tree Builder; creates a DOM from Tokens.
    ParseError
    A Parse Error records an error in the input HTML that occurs in either the tokenisation or the tree building phase.
    ParseErrorList
    A container for ParseErrors.
    Parser
    Parses HTML into a Document.
    ParseSettings
    Controls parser settings, to optionally preserve tag and/or attribute name case.
    Tag
    HTML Tag capabilities.
    TokenQueue
    A character queue with parsing helpers.
    XmlTreeBuilder
    Use the XmlTreeBuilder when you want to parse XML without any of the HTML DOM rules being applied to the document.