Leveraging Communication Information among Readers for RFID Data Cleaning

Radio Frequency Identification (RFID) technologies are used in many applications for data collection. However, raw RFID readings are usually of low quality due to frequent occurrences of false negative, false positive and duplicate readings. A number of RFID data cleaning techniques are proposed to solve the problem. In this paper we explore to use communication information for RFID data cleaning and make RFID readers produce less dirty data at the early stage. First, we devise a reader communication protocol for efficiently utilizing the communication information among readers. Then, the cell event sequence tree with parameters is proposed. Finally, we present three novel RFID data cleaning methods, respectively for duplicate readings, false positive readings and data interpolating. To the best of our knowledge, this is the first work utilizing the communication information among readers in RFID data cleaning. We conduct extensive experiments, and the experimental results demonstrate the feasibility and effectiveness of our methods.

[1]  Gustavo Alonso,et al.  Declarative Support for Sensor Data Cleaning , 2006, Pervasive.

[2]  Philip S. Yu,et al.  A Sampling-Based Approach to Information Recovery , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[3]  Uwe Hansmann,et al.  Pervasive Computing , 2003 .

[4]  Amol Deshpande,et al.  Online Filtering, Smoothing and Probabilistic Modeling of Streaming data , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[5]  Fusheng Wang,et al.  Efficiently Filtering RFID Data Streams , 2006, CleanDB.

[6]  Klemens Böhm,et al.  Finding misplaced items in retail by clustering RFID data , 2010, EDBT '10.

[7]  Haixun Wang,et al.  Leveraging spatio-temporal redundancy for RFID data cleansing , 2010, SIGMOD Conference.

[8]  Minos N. Garofalakis,et al.  Adaptive cleaning for RFID data streams , 2006, VLDB.

[9]  Prashant J. Shenoy,et al.  Efficient Data Interpretation and Compression over RFID Streams , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[10]  Gustavo Alonso,et al.  A Pipelined Framework for Online Cleaning of Sensor Data Streams , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[11]  Jiawei Han,et al.  Cost-Conscious Cleaning of Massive RFID Data Sets , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[12]  Josep Domingo-Ferrer,et al.  A distributed architecture for scalable private RFID tag identification , 2007, Comput. Networks.

[13]  Frederick Reiss,et al.  Design Considerations for High Fan-In Systems: The HiFi Approach , 2005, CIDR.

[14]  Dan Suciu,et al.  Towards correcting input data errors probabilistically using integrity constraints , 2006, MobiDE '06.

[15]  Jun Rao,et al.  A deferred cleansing method for RFID data analytics , 2006, VLDB.

[16]  Beng Chin Ooi,et al.  Efficient RFID Data Imputation by Analyzing the Correlations of Monitored Objects , 2009, DASFAA.