Overview Units Class Hierarchy Classes, Interfaces, Objects and Records Types Variables Constants Functions and Procedures Identifiers |
Unit simplehtmltreeparser
Description
This unit contains a html/xml -> tree converter
Overview
Classes, Interfaces, Objects and Records
Functions and Procedures
Types
Constants
Variables
Description
Functions and Procedures
function xmlStrEscape(s: string; attrib: boolean = false):string; |
|
function xmlStrWhitespaceCollapse(const s: string):string; |
|
function equalNamespaces(const ans, bns: INamespace): boolean; inline; |
|
function equalNamespaces(const ans, bns: string): boolean; inline; |
|
function namespaceGetURL(const n: INamespace): string; inline; |
|
function strEncodingFromContentType(const contenttype: string): TSystemCodePage; |
|
function isInvalidUTF8(const s: string): boolean; |
|
Types
TXQHashKeyString = TFLRERawByteString; |
|
TTreeNodeType = (...); |
The type of a tree element. <Open>, text, or </close>
Values
-
tetOpen:
-
tetClose:
-
tetText:
-
tetComment:
-
tetProcessingInstruction:
-
tetAttribute:
-
tetDocument:
-
tetInternalDoNotUseCDATAText:
-
tetNamespace:
|
TTreeNodeFindOptions = set of (tefoIgnoreType, tefoIgnoreText, tefoCaseSensitive, tefoNoChildren, tefoNoGrandChildren); |
Controls the search for a tree element. ignore type: do not check for a matching type, ignore text: do not check for a matching text, case sensitive: do not ignore the case, no descend: only check elements that direct children of the current node
|
TStringComparisonFunc = function (const a,b: string): boolean of object; |
|
TBasicParsingState = (...); |
Values
-
bpmBeforeHtml:
-
bpmBeforeHead:
-
bpmInHead:
-
bpmAfterHead:
-
bpmInBody:
-
bpmInFrameset:
-
bpmAfterBody:
-
bpmAfterAfterBody:
|
TParsingModel = (...); |
Parsing model used to interpret the document pmStrict: every tag must be closed explicitely (otherwise an exception is raised) pmHtml: accept everything, tries to create the best fitting tree using a heuristic to recover from faulty documents (no exceptions are raised), detect encoding
Values
-
pmStrict:
-
pmHTML:
-
pmUnstrictXML:
|
TInternetToolsFormat = (...); |
Values
-
itfXML:
-
itfHTML:
-
itfJSON:
-
itfXMLPreparsedEntity:
|
Constants
XMLNamespaceUrl_XML = 'http://www.w3.org/XML/1998/namespace'; |
|
XMLNamespaceUrl_XMLNS = 'http://www.w3.org/2000/xmlns/'; |
|
TreeNodesWithChildren = [tetOpen, tetDocument]; |
|
Variables
Author
Generated by PasDoc 0.14.0.
|