Feature Data Selection for Improving the Performance of Entity Similarity Searches in the Internet of Things

Sensors are used to sense the state information of physical entities in the Internet of Things (IoT). Thus, a large amount of dynamic real-time data is generated. The entity similarity search based on the quantitative dynamic sensor data is thus worth studying. To the best of our knowledge, there is no research on the entity similarity search based on feature data selection for the quantitative dynamic sensor data in the IoT. This paper proposes a selection mechanism for the entity main features (SMEF). The SMEF is a feature data selection method based on the quantitative dynamic sensor data. It uses the feature matrix to delete the irrelevant entity features, applies an improved relief algorithm (iRelief) to calculate the feature data relevance and proposes a three-component storage relation table of the entities, models, and features (TEMF) for the dynamic feature weights calculation. The experimental results show that the similarity search algorithm based on feature data selection can improve the average search accuracy by more than 10%, as well as increase the search speed and reduce the data transmission and storage costs.

[1]  Peng Li,et al.  Similarity Search Algorithm over Data Supply Chain Based on Key Points , 2016, BigCom.

[2]  Fan Wu,et al.  Low-Overhead and High-Precision Prediction Model for Content-Based Sensor Search in the Internet of Things , 2016, IEEE Communications Letters.

[3]  Kay Römer,et al.  Fuzzy-based sensor search in the Web of Things , 2012, 2012 3rd IEEE International Conference on the Internet of Things.

[4]  Terrance E. Boult,et al.  Multi-attribute spaces: Calibration for attribute fusion and similarity search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Reynold Cheng,et al.  Scalable Algorithms for Nearest-Neighbor Joins on Big Trajectory Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[6]  Xiangjun Dong,et al.  An Industrial Internet of Things Feature Selection Method Based on Potential Entropy Evaluation Criteria , 2018, IEEE Access.

[7]  Jaime Lloret,et al.  Intelligent IoT Traffic Classification Using Novel Search Strategy for Fast-Based-Correlation Feature Selection in Industrial Environments , 2018, IEEE Internet of Things Journal.

[8]  Xuyuan Kang,et al.  High-Accuracy Entity State Prediction Method Based on Deep Belief Network Toward IoT Search , 2019, IEEE Wireless Communications Letters.

[9]  Samina Khalid,et al.  A survey of feature selection and feature extraction techniques in machine learning , 2014, 2014 Science and Information Conference.

[10]  Wolfgang Kellerer,et al.  A real-time search engine for the Web of Things , 2008, 2010 Internet of Things (IOT).

[11]  Dave Evans,et al.  How the Next Evolution of the Internet Is Changing Everything , 2011 .

[12]  Qun Li,et al.  Snoogle: A Search Engine for Pervasive Environments , 2010, IEEE Transactions on Parallel and Distributed Systems.

[13]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[14]  Guanglu Sun,et al.  Feature selection for IoT based on maximal information coefficient , 2018, Future Gener. Comput. Syst..

[15]  Oscar Corcho,et al.  Semantic Sensor Network XG Final Report , 2011 .