Research on Addressing Method in XML File Based on XPointer

XML is a markup language that describes data and is widely used to exchange data across platforms. Currently, there are three main tools for addressing XML document: XLink, XPath, and XPointer. Among them, XPointer is an advanced addressing tool, which can not only address elements, but also can address specific data such as strings, points and ranges in XML documents. However, for the positioning based on XPointer, how to extract the non-well-formed data content in XML document is still a problem. This paper focuses on the extraction of non-well-formed data content in XML documents. Based on XPath 3.0, extracting and filtering nodes was analyzed. Based on XSLT template, the content of XML documents was selectively output. Finally, a location system based on XPointer was derived, and ultimately achieved advanced addressing for XML documents. 20 XML files were selected as experiment samples to verify the model proposed in the paper. The experimental results demonstrate that the proposed method can locate and represent non-well-formed as well as well-formed data content in XML documents.