And non validating parsers


15-Aug-2015 16:19

and non validating parsers-18

dating cold

xml Valid Ctxt vctxt : The validity context xml Parser Input State instate : current type of input int token : next char look-ahead char * directory : the data directory Node name stack const xml Char * name : Current parsed Node int name Nr : Depth of the parsing stack int name Max : Max depth of the parsing stack const xml Char * * name Tab : array of nodes long nb Chars : number of xml Char processed long check Index : used by progressive parsing lookup int keep Blanks : ugly but ...

int disable SAX : SAX callbacks are disabled int in Subset : Parsing is in int 1/ext 2 subset const xml Char * int Sub Name : name of subset xml Char * ext Sub URI : URI of external subset xml Char * ext Sub System : SYSTEM ID of external subset xml:space int * space : Should the parser preserve spaces int space Nr : Depth of the parsing stack int space Max : Max depth of the parsing stack int * space Tab : array of space infos int depth : to prevent entity substitution loops xml Parser Input Ptr entity : used to check entities boundaries int charset : encoding of the in-memory content actua int nodelen : Those two fields are there to int nodemem : Speed up large node parsing int pedantic : signal pedantic warnings void * _private : For user data, libxml won't touch it int loadsubset : should the external subset be loaded int linenumbers : set line number in element content void * catalogs : document's own catalog int recovery : run in recovery mode int progressive : is this a progressive parsing xml Dict Ptr dict : dictionary for the parser const xml Char * * atts : array for the attributes callbacks int maxatts : the size of the array int docdict : * pre-interned strings * const xml Char * str_xml const xml Char * str_xmlns const xml Char * str_xml_ns : * Everything below is used only by the n int sax2 : operating in the new SAX mode int ns Nr : the number of inherited namespaces int ns Max : the size of the arrays const xml Char * * ns Tab : the array of prefix/namespace name int * attallocs : which attribute were allocated void * * push Tab : array of data for push xml Hash Table Ptr atts Default : defaulted attributes if any xml Hash Table Ptr atts Special : non-CDATA attributes if any int ns Well Formed : is the document XML Nanespace okay int options : * Those fields are needed only for tream int dict Names : Use dictionary names for the tree int free Elems Nr : number of freed element nodes xml Node Ptr free Elems : List of freed element nodes int free Attrs Nr : number of freed attributes nodes xml Attr Ptr free Attrs : * the complete error informations for th xml Error last Error xml Parser Mode parse Mode : the parser mode unsigned long nbentities : number of entities references unsigned long sizeentities : size of parsed entities for use by HTML xml Parser Node Info * node Info : Current Node Info int node Info Nr : Depth of the parsing stack int node Info Max : Max depth of the parsing stack xml Parser Node Info * node Info Tab : array of node Infos int input_id : we need to label inputs unsigned long sizeentcopy : volume of entity copy }Structure xml Parser Inputstruct _xml Parser Input { xml Parser Input Buffer Ptr buf : UTF-8 encoded buffer const char * filename : The file analyzed, if any const char * directory : the directory/base of the file const xml Char * base : Base of the array to parse const xml Char * cur : Current char being parsed const xml Char * end : end of the array to parse int length : length if known int line : Current line int col : * NOTE: consumed is only tested for equa unsigned long consumed : How many xml Chars already consumed xml Parser Input Deallocate free : function to deallocate the base const xml Char * encoding : the encoding string for entity const xml Char * version : the version string for entity int standalone : Was that entity marked standalone int id : an unique identifier for the entity }Structure xml Parser Input Bufferstruct _xml Parser Input Buffer { void * context xml Input Read Callback readcallback xml Input Close Callback closecallback xml Char Encoding Handler Ptr encoder : I18N conversions to UTF-8 xml Buf Ptr buffer : Local buffer encoded in UTF-8 xml Buf Ptr raw : if encoder !

const xml Char * name : Name of the DTD struct _xml Node * children : the value of the property link struct _xml Node * last : last child link struct _xml Doc * parent : child-parent link struct _xml Node * next : next sibling link struct _xml Node * prev : previous sibling link struct _xml Doc * doc : the containing document End of common p void * notations : Hash table for notations if any void * elements : Hash table for elements if any void * attributes : Hash table for attributes if any void * entities : Hash table for entities if any const xml Char * External ID : External identifier for PUBLIC DTD const xml Char * System ID : URI for a SYSTEM or PUBLIC DTD void * pentities : Hash table for param entities if any }Structure xml Elementstruct _xml Element { void * _private : application data xml Element Type type : XML_ELEMENT_DECL, must be second !

const xml Char * name : Element name struct _xml Node * children : NULL struct _xml Node * last : NULL struct _xml Dtd * parent : - DTD struct _xml Node * next : next sibling link struct _xml Node * prev : previous sibling link struct _xml Doc * doc : the containing document xml Element Type Val etype : The type xml Element Content Ptr content : the allowed element content xml Attribute Ptr attributes : List of the declared attributes const xml Char * prefix : the namespace prefix if any xml Regexp Ptr cont Model : the validating regexp void * cont Model }Structure xml Element Contentstruct _xml Element Content { xml Element Content Type type : PCDATA, ELEMENT, SEQ or OR xml Element Content Occur ocur : ONCE, OPT, MULT or PLUS const xml Char * name : Element name struct _xml Element Content * c1 : first child struct _xml Element Content * c2 : second child struct _xml Element Content * parent : parent const xml Char * prefix : Namespace prefix }Enum xml Element Type { XML_ELEMENT_NODE = 1 XML_ATTRIBUTE_NODE = 2 XML_TEXT_NODE = 3 XML_CDATA_SECTION_NODE = 4 XML_ENTITY_REF_NODE = 5 XML_ENTITY_NODE = 6 XML_PI_NODE = 7 XML_COMMENT_NODE = 8 XML_DOCUMENT_NODE = 9 XML_DOCUMENT_TYPE_NODE = 10 XML_DOCUMENT_FRAG_NODE = 11 XML_NOTATION_NODE = 12 XML_HTML_DOCUMENT_NODE = 13 XML_DTD_NODE = 14 XML_ELEMENT_DECL = 15 XML_ATTRIBUTE_DECL = 16 XML_ENTITY_DECL = 17 XML_NAMESPACE_DECL = 18 XML_XINCLUDE_START = 19 XML_XINCLUDE_END = 20 XML_DOCB_DOCUMENT_NODE = 21 } Structure xml Entitystruct _xml Entity { void * _private : application data xml Element Type type : XML_ENTITY_DECL, must be second !

= NULL buffer for output int written : total number of byte written int error }Structure xml Parser Ctxtstruct _xml Parser Ctxt { struct _xml SAXHandler * sax : The SAX handler void * user Data : For SAX interface only, used by DOM bui xml Doc Ptr my Doc : the document being built int well Formed : is the document well formed int replace Entities : shall we replace entities ?

const xml Char * version : the XML version string const xml Char * encoding : the declared encoding, if any int standalone : standalone document int html : an HTML(1)/Docbook(2) document * 3 is H xml Parser Input Ptr input : Current input stream int input Nr : Number of current input streams int input Max : Max number of input streams xml Parser Input Ptr * input Tab : stack of inputs Node analysis stack onl xml Node Ptr node : Current parsed Node int node Nr : Depth of the parsing stack int node Max : Max depth of the parsing stack xml Node Ptr * node Tab : array of nodes int record_info : Whether node info should be kept xml Parser Node Info Seq node_seq : info about each node parsed int err No : error code int has External Subset : reference and external subset int has PErefs : the internal subset has PE refs int external : are we parsing an external entity int valid : is the document valid int validate : shall we try to validate ?

and non validating parsers-76

Dirty sex chatzy

char * name : name/filename/URI of the document struct _xml Node * children : the document tree struct _xml Node * last : last child link struct _xml Node * parent : child-parent link struct _xml Node * next : next sibling link struct _xml Node * prev : previous sibling link struct _xml Doc * doc : autoreference to itself End of common p int compression : level of zlib compression int standalone : standalone document (no external refs) struct _xml Dtd * int Subset : the document internal subset struct _xml Dtd * ext Subset : the document external subset struct _xml Ns * old Ns : Global namespace, the old way const xml Char * version : the XML version string const xml Char * encoding : external initial encoding, if any void * ids : Hash table for ID attributes if any void * refs : Hash table for IDREFs attributes if any const xml Char * URL : The URI for that document int charset : encoding of the in-memory content actua struct _xml Dict * dict : dict used to allocate names or NULL void * psvi : for type/PSVI informations int parse Flags : set of xml Parser Option used to parse th int properties : set of xml Doc Properties for this docume }Enum xml Doc Properties { XML_DOC_WELLFORMED = 1 : document is XML well formed XML_DOC_NSVALID = 2 : document is Namespace valid XML_DOC_OLD10 = 4 : parsed with old XML-1.0 parser XML_DOC_DTDVALID = 8 : DTD validation was successful XML_DOC_XINCLUDE = 16 : XInclude substitution was done XML_DOC_USERBUILT = 32 : Document was built using the API and not by parsing an instance XML_DOC_INTERNAL = 64 : built for internal processing XML_DOC_HTML = 128 : parsed or built HTML document } Structure xml Dtdstruct _xml Dtd { void * _private : application data xml Element Type type : XML_DTD_NODE, must be second !const xml Char * name : Entity name struct _xml Node * children : First child link struct _xml Node * last : Last child link struct _xml Dtd * parent : - DTD struct _xml Node * next : next sibling link struct _xml Node * prev : previous sibling link struct _xml Doc * doc : the containing document xml Char * orig : content without ref substitution xml Char * content : content or ndata if unparsed int length : the content length xml Entity Type etype : The entity type const xml Char * External ID : External identifier for PUBLIC const xml Char * System ID : URI for a SYSTEM or PUBLIC Entity struct _xml Entity * nexte : unused const xml Char * URI : the full URI as computed int owner : does the entity own the childrens int checked : was the entity content checked this is }Structure xml IDstruct _xml ID { struct _xml ID * next : next ID const xml Char * value : The ID name xml Attr Ptr attr : The attribute holding it const xml Char * name : The attribute if attr is not available int lineno : The line number if attr is not availabl struct _xml Doc * doc : The document holding the ID }Structure xml Nodestruct _xml Node { void * _private : application data xml Element Type type : type number, must be second !const xml Char * name : the name of the node, or the entity struct _xml Node * children : parent-parent link struct _xml Node * next : next sibling link struct _xml Node * prev : previous sibling link struct _xml Doc * doc : the containing document End of common p xml Ns * ns : pointer to the associated namespace xml Char * content : the content struct _xml Attr * properties : properties list xml Ns * ns Def : namespace definitions on this node void * psvi : for type/PSVI informations unsigned short line : line number unsigned short extra : extra data for XPath/XSLT }Structure xml Nsstruct _xml Ns { struct _xml Ns * next : next Ns link for this node xml Ns Type type : global or local const xml Char * href : URL for the namespace const xml Char * prefix : prefix for the namespace void * _private : application data struct _xml Doc * context : normally an xml Doc }Structure xml Output Bufferstruct _xml Output Buffer { void * context xml Output Write Callback writecallback xml Output Close Callback closecallback xml Char Encoding Handler Ptr encoder : I18N conversions to UTF-8 xml Buf Ptr buffer : Local buffer encoded in UTF-8 or ISOLat xml Buf Ptr conv : if encoder !Add a new node @elem as the next sibling of @cur If the new node was already inserted in a document it is first unlinked from its existing context. If the new node is ATTRIBUTE, it is added into properties instead of children.

Structure xml Attrstruct _xml Attr { void * _private : application data xml Element Type type : XML_ATTRIBUTE_NODE, must be second !const xml Char * name : the name of the property struct _xml Node * children : the value of the property struct _xml Node * last : NULL struct _xml Node * parent : child-parent link struct _xml Attr * next : next sibling link struct _xml Attr * prev : previous sibling link struct _xml Doc * doc : the containing document xml Ns * ns : pointer to the associated namespace xml Attribute Type atype : the attribute type if validating void * psvi : for type/PSVI informations }Structure xml Attributestruct _xml Attribute { void * _private : application data xml Element Type type : XML_ATTRIBUTE_DECL, must be second !