Measuring the similarity of PML documents with RFID-based sensors

The electronic product code (EPC) network is an important part of the internet of things. The physical mark-up language (PML) is to represent and describe data related to objects in EPC network. The PML documents of each component to exchange data in EPC network system are XML documents based on PML Core schema. For managing theses huge amount of PML documents of tags captured by radio frequency identification (RFID) readers, it is inevitable to develop the high-performance technology, such as filtering and integrating these tag data. So in this paper, we propose an approach for measuring the similarity of PML documents based on Bayesian network of several sensors. With respect to the features of PML, while measuring the similarity, we firstly reduce the redundancy data except information of EPC. On the basis of this, the Bayesian network model derived from the structure of the PML documents being compared is constructed.

[1]  Carlos Alberto Heuser,et al.  Matching XML documents in highly dynamic applications , 2008, DocEng '08.

[2]  Richard Chbeir,et al.  An overview on XML similarity: Background, current trends and future directions , 2009, Comput. Sci. Rev..

[3]  Tomi Silander,et al.  A Simple Approach for Finding the Globally Optimal Bayesian Network Structure , 2006, UAI.

[4]  Felix Naumann,et al.  DogmatiX tracks down duplicates in XML , 2005, SIGMOD '05.

[5]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[6]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[7]  Joe Marini,et al.  Document Object Model , 2002, Encyclopedia of GIS.

[8]  Athanasios V. Vasilakos,et al.  Joint Forensics-Scheduling Strategy for Delay-Sensitive Multimedia Applications over Heterogeneous Networks , 2011, IEEE Journal on Selected Areas in Communications.

[9]  Edleno Silva de Moura,et al.  Measuring similarity between collection of values , 2004, WIDM '04.

[10]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[11]  Timos K. Sellis,et al.  A methodology for clustering XML documents by structure , 2006, Inf. Syst..

[12]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[13]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[14]  H. V. Jagadish,et al.  Evaluating Structural Similarity in XML Documents , 2002, WebDB.

[15]  Ron S. Kenett,et al.  Encyclopedia of statistics in quality and reliability , 2007 .

[16]  Haruo Yokota,et al.  LAX: An Efficient Approximate XML Join Based on Clustered Leaf Nodes for XML Data Integration , 2005, BNCOD.

[17]  Damith C. Ranasinghe,et al.  EPC Network Architecture , 2008 .

[18]  Russ Ferguson,et al.  The Document Object Model , 2015 .

[19]  Jati K. Sengupta,et al.  Introduction to Information , 1993 .

[20]  Haim Levkowitz,et al.  Introduction to information retrieval (IR) , 2008 .

[21]  Pável Calado,et al.  Structure-based inference of xml similarity for fuzzy duplicate detection , 2007, CIKM '07.

[22]  O. Kitao,et al.  I: METHODOLOGY , 2003, Deception: Counterdeception and Counterintelligence.