HTML and XML Parsers.
Tidy's HTML parser corrects many conditions and enforces certain user preferences during the parsing process. The XML parser produces a tree of nodes useful to Tidy but also suitable for use in other XML processing applications.
tidy.h
for the complete license.Go to the source code of this file.
Functions | |
Bool | TY_❪CheckNodeIntegrity❫ (Node *node) |
Is used to perform a node integrity check recursively after parsing an HTML or XML document. More... | |
void | TY_❪CoerceNode❫ (TidyDocImpl *doc, Node *node, TidyTagId tid, Bool obsolete, Bool expected) |
Transforms a given node to another element, for example, from a p to a br . More... | |
Node * | TY_❪DiscardElement❫ (TidyDocImpl *doc, Node *element) |
Remove node from markup tree and discard it. More... | |
Node * | TY_❪DropEmptyElements❫ (TidyDocImpl *doc, Node *node) |
Trims a tree of empty elements recursively, returning the next node. More... | |
void | TY_❪InsertNodeAfterElement❫ (Node *element, Node *node) |
Insert node into markup tree after element. More... | |
void | TY_❪InsertNodeAtEnd❫ (Node *element, Node *node) |
Insert node into markup tree as the last element of content of element. More... | |
void | TY_❪InsertNodeAtStart❫ (Node *element, Node *node) |
Insert node into markup tree as the firt element of content of element. More... | |
void | TY_❪InsertNodeBeforeElement❫ (Node *element, Node *node) |
Insert node into markup tree before element. More... | |
Bool | TY_❪IsBlank❫ (Lexer *lexer, Node *node) |
Indicates whether or not a text node is blank, meaning that it consists of nothing, or a single space. More... | |
Bool | TY_❪IsJavaScript❫ (Node *node) |
Indicates whether or not a node is declared as containing javascript code. More... | |
Bool | TY_❪IsNewNode❫ (Node *node) |
Used to check if a node uses CM_NEW, which determines how attributes without values should be printed. More... | |
void | TY_❪ParseDocument❫ (TidyDocImpl *doc) |
Parses a document after lexing using the HTML parser. More... | |
void | TY_❪ParseXMLDocument❫ (TidyDocImpl *doc) |
Parses a document after lexing using the XML parser. More... | |
Node * | TY_❪RemoveNode❫ (Node *node) |
Extract a node and its children from a markup tree. More... | |
Bool | TY_❪TextNodeEndWithSpace❫ (Lexer *lexer, Node *node) |
Indicates whether or not a text node ends with a space or newline. More... | |
Node * | TY_❪TrimEmptyElement❫ (TidyDocImpl *doc, Node *element) |
Trims a single, empty element, returning the next node. More... | |
Bool | TY_❪XMLPreserveWhiteSpace❫ (TidyDocImpl *doc, Node *element) |
Indicates whether or not whitespace is to be preserved in XHTML/XML documents. More... | |