Data Mining for Internet of Things: A Survey

It sounds like mission impossible to connect everything on the Earth together via Internet, but Internet of Things (IoT) will dramatically change our life in the foreseeable future, by making many "impossibles" possible. To many, the massive data generated or captured by IoT are considered having highly useful and valuable information. Data mining will no doubt play a critical role in making this kind of system smart enough to provide more convenient services and environments. This paper begins with a discussion of the IoT. Then, a brief review of the features of "data from IoT" and "data mining for IoT' is given. Finally, changes, potentials, open issues, and future trends of this field are addressed.

[1]  Elio Masciari,et al.  A Framework for Outlier Mining in RFID data , 2007, 11th International Database Engineering and Applications Symposium (IDEAS 2007).

[2]  Samuel Pierre,et al.  A distributed energy-efficient clustering protocol for wireless sensor networks , 2010, Comput. Electr. Eng..

[3]  Frank Siegemund,et al.  A Context-Aware Communication Platform for Smart Objects , 2004, Pervasive.

[4]  Alexandra Brintrup,et al.  Resource Management in the Internet of Things: Clustering, Synchronisation and Software Agents , 2011, Architecting the Internet of Things.

[5]  Gennaro Boggia,et al.  Standardized Protocol Stack for the Internet of (Important) Things , 2013, IEEE Communications Surveys & Tutorials.

[6]  Johannes Stallkamp,et al.  A video-based door monitoring system using local appearance-based face models , 2010, Comput. Vis. Image Underst..

[7]  Sherali Zeadally,et al.  TMS-RFID: Temporal management of large-scale RFID applications , 2011, Inf. Syst. Frontiers.

[8]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[9]  Michel Vacher,et al.  SVM-Based Multimodal Classification of Activities of Daily Living in Health Smart Homes: Sensors, Algorithms, and First Experimental Results , 2010, IEEE Transactions on Information Technology in Biomedicine.

[10]  Douglas C. Schmidt,et al.  Using Smartphones to Detect Car Accidents and Provide Situational Awareness to Emergency Responders , 2010, MOBILWARE.

[11]  James R. Larus,et al.  Imagining the Future: Thoughts on Computing , 2012, Computer.

[12]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[13]  N. Noury,et al.  Supervised classification of activities of daily living in health smart homes using SVM , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[14]  George C. Polyzos,et al.  Monitoring and Modeling Simple Everyday Activities of the Elderly at Home , 2010, 2010 7th IEEE Consumer Communications and Networking Conference.

[15]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[16]  Richard G Baraniuk,et al.  More Is Less: Signal Processing and the Data Deluge , 2011, Science.

[17]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.

[18]  Sydney Katz Assessing Self‐maintenance: Activities of Daily Living, Mobility, and Instrumental Activities of Daily Living , 1983, Journal of the American Geriatrics Society.

[19]  Makoto Takizawa,et al.  A Survey on Clustering Algorithms for Wireless Sensor Networks , 2010, 2010 13th International Conference on Network-Based Information Systems.

[20]  Viktor Mayer-Schnberger,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2013 .

[21]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[22]  M. Lawton,et al.  Assessment of Older People: Self-Maintaining and Instrumental Activities of Daily Living , 1969 .

[23]  Florian Michahelles,et al.  An Architectural Approach Towards the Future Internet of Things , 2011, Architecting the Internet of Things.

[24]  Graham Clarke,et al.  Affect-aware behaviour modelling and control inside an intelligent environment , 2010, Pervasive Mob. Comput..

[25]  Byeong-Hee Roh,et al.  Tree-Based Classification Algorithm for Heterogeneous Unique Item ID Schemes , 2005, EUC Workshops.

[26]  Frank Romano,et al.  An Investigation into printing industry trends , 2004 .

[27]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[28]  Chunming Qiao,et al.  A plant-and-play wireless sensor network system for gate monitoring , 2009, 2009 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks & Workshops.

[29]  Diane J. Cook,et al.  Discovering Temporal Features and Relations of Activity Patterns , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[30]  Li Wang,et al.  Using the relationship of shared neighbors to find hierarchical overlapping communities for effective connectivity in IoT , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[31]  Diane J. Cook,et al.  Temporal pattern discovery for anomaly detection in a smart home , 2007 .

[32]  Lida Xu,et al.  The internet of things: a survey , 2014, Information Systems Frontiers.

[33]  M. Lawton,et al.  Assessment of older people: self-maintaining and instrumental activities of daily living. , 1969, The Gerontologist.

[34]  Luca Lombardi,et al.  Challenges for Data Mining in Distributed Sensor Networks , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[35]  Christian Blum,et al.  Metaheuristics in combinatorial optimization: Overview and conceptual comparison , 2003, CSUR.

[36]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[37]  Ganesh K. Venayagamoorthy,et al.  Computational Intelligence in Wireless Sensor Networks: A Survey , 2011, IEEE Communications Surveys & Tutorials.

[38]  Matthew B. Jones,et al.  Challenges and Opportunities of Open Data in Ecology , 2011, Science.

[39]  Alexander Klapproth,et al.  Prometheus — Fuzzy information retrieval for semantic homes and environments , 2010, 3rd International Conference on Human System Interaction.

[40]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[41]  James F. Allen,et al.  Actions and Events in Interval Temporal Logic , 1994, J. Log. Comput..

[42]  Yang Xiao,et al.  A survey of communication/networking in Smart Grids , 2012, Future Gener. Comput. Syst..

[43]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[44]  Georgios Kalogridis,et al.  Smart Grid Privacy via Anonymization of Smart Metering Data , 2010, 2010 First IEEE International Conference on Smart Grid Communications.

[45]  Ossama Younis,et al.  HEED: a hybrid, energy-efficient, distributed clustering approach for ad hoc sensor networks , 2004, IEEE Transactions on Mobile Computing.

[46]  R. K. Rayudu,et al.  Wellness determination of inhabitant based on daily activity behaviour in real-time monitoring using Sensor Networks , 2011, 2011 Fifth International Conference on Sensing Technology.

[47]  Najah AbuAli,et al.  Data management for the Internet of Things: Green directions , 2012, 2012 IEEE Globecom Workshops.

[48]  Diane J. Cook,et al.  PREDIcting inhabitant action using action and task models with application to smart homes , 2004, Int. J. Artif. Intell. Tools.

[49]  Diane J. Cook,et al.  An Adaptive Sensor Mining Framework for Pervasive Computing Applications , 2008, KDD Workshop on Knowledge Discovery from Sensor Data.

[50]  Charu C. Aggarwal,et al.  The Internet of Things: A Survey from the Data-Centric Perspective , 2013, Managing and Mining Sensor Data.

[51]  Maurizio Tomasella,et al.  Vision and Challenges for Realising the Internet of Things , 2010 .

[52]  Niall Rooney,et al.  Temporal Data Mining for Smart Homes , 2006, Designing Smart Homes.

[53]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[54]  Ghadir Radman,et al.  Survey on Smart Grid , 2010, Proceedings of the IEEE SoutheastCon 2010 (SoutheastCon).

[55]  Sachchidanand Singh,et al.  Big Data analytics , 2012 .

[56]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[57]  Svetha Venkatesh,et al.  Recognition of emergent human behaviour in a smart home: A data mining approach , 2007, Pervasive Mob. Comput..

[58]  Guangwei Ren,et al.  The Research of Data Mining Technology of Privacy Preserving in Sharing Platform of Internet of Things , 2012 .

[59]  Artemis Moroni,et al.  Vision and Challenges for Realising the Internet of Things , 2010 .

[60]  Kevin Ashton,et al.  That ‘Internet of Things’ Thing , 1999 .

[61]  Yen-Kuang Chen,et al.  Challenges and opportunities of internet of things , 2012, 17th Asia and South Pacific Design Automation Conference.

[62]  Dan Istrate,et al.  Sound Processing for Health Smart Home , 2004 .

[63]  Diane J. Cook,et al.  Using Association Rule Mining to Discover Temporal Relations of Daily Activities , 2011, ICOST.

[64]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[65]  Dongkyoo Shin,et al.  Ubiquitous Intelligent Sensing System for a Smart Home , 2006, SSPR/SPR.

[66]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[67]  Wenyuan Xu,et al.  Temporal privacy in wireless sensor networks: Theory and practice , 2009, TOSN.

[68]  Martin Hilbert,et al.  The World’s Technological Capacity to Store, Communicate, and Compute Information , 2011, Science.

[69]  Florian Michahelles,et al.  Architecting the Internet of Things , 2011 .

[70]  Parag A. Pathak,et al.  Massachusetts Institute of Technology , 1964, Nature.

[71]  Keun Ho Ryu,et al.  A framework of spatial co-location pattern mining for ubiquitous GIS , 2014, Multimedia Tools and Applications.

[72]  Eric Horvitz,et al.  Prediction, Expectation, and Surprise: Methods, Designs, and Study of a Deployed Traffic Forecasting Service , 2005, UAI.

[73]  Lawrence B. Holder,et al.  Discovering Activities to Recognize and Track in a Smart Environment , 2011, IEEE Transactions on Knowledge and Data Engineering.

[74]  Joel J. P. C. Rodrigues,et al.  Metaheuristic Scheduling for Cloud: A Survey , 2014, IEEE Systems Journal.

[75]  Gaurav S. Sukhatme,et al.  Coarse In-Building Localization with Smartphones , 2009, MobiCASE.

[76]  Chu-Sing Yang,et al.  PREACO: A fast ant colony optimization for codebook generation , 2013, Appl. Soft Comput..

[77]  Michel Vacher,et al.  A wavelet-based pattern recognition algorithm to classify postural transitions in humans , 2009, 2009 17th European Signal Processing Conference.

[78]  Euiho Suh,et al.  Context-aware system for proactive personalized service based on context history , 2009, Expert Syst. Appl..

[79]  Peter Friess,et al.  Internet of Things Strategic Research Roadmap , 2011 .

[80]  Sasikanth Avancha,et al.  Security for Sensor Networks , 2004 .

[81]  Mari Carmen Domingo,et al.  An overview of the Internet of Things for people with disabilities , 2012, J. Netw. Comput. Appl..

[82]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[83]  Pedro Faria,et al.  Intelligent energy resource management considering vehicle-to-grid: A Simulated Annealing approach , 2012, 2012 IEEE Power and Energy Society General Meeting.

[84]  Srinivas Sampalli,et al.  An Intelligent RFID System for Consumer Businesses , 2010, 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing.

[85]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[86]  Sajal K. Das,et al.  A framework for energy-saving data gathering using two-phase clustering in wireless sensor networks , 2004, The First Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services, 2004. MOBIQUITOUS 2004..

[87]  Dongkyoo Shin,et al.  Real-Time Human Tracker Based on Location and Motion Recognition of User for Smart Home , 2009, 2009 Third International Conference on Multimedia and Ubiquitous Engineering.

[88]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[89]  Rodrigo Roman,et al.  On the features and challenges of security and privacy in distributed internet of things , 2013, Comput. Networks.

[90]  Ning Yang,et al.  A non-contact health monitoring model based on the Internet of things , 2012, 2012 8th International Conference on Natural Computation.

[91]  Young-Kyu Yang,et al.  A Distributed Data Mining System for a Novel Ubiquitous Healthcare Framework , 2007, International Conference on Computational Science.

[92]  Damith Chinthana Ranasinghe,et al.  Taxonomy, technology and applications of smart objects , 2011, Inf. Syst. Frontiers.

[93]  Min Zhang,et al.  Application Study of Precision Agriculture Based on Ontology in the Internet of Things Environment , 2011, ICAIC.

[94]  Nada Golmie,et al.  NIST Framework and Roadmap for Smart Grid Interoperability Standards, Release 3.0 , 2014 .

[95]  William J. Knottenbelt,et al.  Deriving generalised stochastic Petri net performance models from high-precision location tracking data , 2011, VALUETOOLS.

[96]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[97]  Diane J. Cook,et al.  MavHome: an agent-based smart home , 2003, Proceedings of the First IEEE International Conference on Pervasive Computing and Communications, 2003. (PerCom 2003)..

[98]  Jin Hyun Son,et al.  User-pattern analysis framework to predict future service in intelligent home network , 2009, 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[99]  Gerd Kortuem,et al.  Smart objects as building blocks for the Internet of things , 2010, IEEE Internet Computing.

[100]  A. Bolz,et al.  Real time heart ischemia detection in the smart home care system , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[101]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[102]  Jun Zheng,et al.  Wireless Sensor Networks: A Networking Perspective , 2009 .

[103]  Imrich Chlamtac,et al.  Internet of things: Vision, applications and research challenges , 2012, Ad Hoc Networks.

[104]  Seokho Chi,et al.  Automated Object Identification Using Optical Video Cameras on Construction Sites , 2011, Comput. Aided Civ. Infrastructure Eng..

[105]  Majid Sarrafzadeh,et al.  Optimal Energy Aware Clustering in Sensor Networks , 2002 .

[106]  Mohamed S. Kamel,et al.  Efficient phrase-based document indexing for Web document clustering , 2004, IEEE Transactions on Knowledge and Data Engineering.

[107]  Volodymyr Vasyutynskyy,et al.  Simulation and analysis of buying behavior in supermarkets , 2010, 2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010).

[108]  Haiying Zhou,et al.  Real-Time Automatic ECG Diagnosis Method Dedicated to Pervasive Cardiac Care , 2009, Wirel. Sens. Netw..

[109]  Nada Golmie,et al.  NIST Framework and Roadmap for Smart Grid Interoperability Standards, Release 2.0 , 2012 .

[110]  Victor Kardeby,et al.  Automatic sensor clustering : connectivity for the internet of things , 2011 .

[111]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[112]  Damith Chinthana Ranasinghe,et al.  Adding sense to the Internet of Things , 2011, Personal and Ubiquitous Computing.

[113]  Wendi Heinzelman,et al.  Energy-efficient communication protocol for wireless microsensor networks , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[114]  Jinhua Jiang,et al.  Nodes Social Relations Cognition for Mobility-Aware in the Internet of Things , 2011, 2011 International Conference on Internet of Things and 4th International Conference on Cyber, Physical and Social Computing.

[115]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[116]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[117]  P. Barralon,et al.  Walk Detection With a Kinematic Sensor: Frequency and Wavelet Comparison , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[118]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[119]  A. Govardhan,et al.  Association of Data Mining and healthcare domain: Issues and current state of the art , 2011 .

[120]  Dileeka Dias,et al.  An intelligent driver guidance tool using location based services , 2011, Proceedings 2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services.

[121]  Anantha P. Chandrakasan,et al.  An application-specific protocol architecture for wireless microsensor networks , 2002, IEEE Trans. Wirel. Commun..

[122]  Eric Gossett,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2015 .

[123]  荒木英俊 長寿者の居住地特性と日常生活動作能力(Activities of Daily Living: ADL) , 1997 .

[124]  Ossama Younis,et al.  Distributed clustering in ad-hoc sensor networks: a hybrid, energy-efficient approach , 2004, IEEE INFOCOM 2004.

[125]  Sourav S. Bhowmick,et al.  Sequential Pattern Mining: A Survey , 2003 .

[126]  Jaydip Sen,et al.  Internet of Things - Applications and Challenges in Technology and Standardization , 2011 .

[127]  Andries Petrus Engelbrecht,et al.  Fundamentals of Computational Swarm Intelligence , 2005 .

[128]  Xue Li,et al.  Time To Live: Temporal Management of Large-Scale RFID Applications , 2008 .

[129]  Duoduo Liao,et al.  On clusterization of "big data" streams , 2012, COM.Geo '12.

[130]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[131]  Shen Bin,et al.  Research on data mining models for the internet of things , 2010, 2010 International Conference on Image Analysis and Signal Processing.

[132]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[133]  Rolf H. Weber,et al.  Internet of Things - New security and privacy challenges , 2010, Comput. Law Secur. Rev..

[134]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[135]  Ping-Min Lin,et al.  A fall detection system using k-nearest neighbor classifier , 2010, Expert Syst. Appl..

[136]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[137]  Shashi Shekhar,et al.  A Joinless Approach for Mining Spatial Colocation Patterns , 2006, IEEE Transactions on Knowledge and Data Engineering.

[138]  S. Kumar,et al.  Infrastructure for data-driven agriculture: identifying management zones for cotton using statistical modeling and machine learning techniques , 2011, 2011 8th International Conference & Expo on Emerging Technologies for a Smarter World.

[139]  Shivakant Mishra,et al.  Intrusion tolerance and anti-traffic analysis strategies for wireless sensor networks , 2004, International Conference on Dependable Systems and Networks, 2004.

[140]  Huan Chen,et al.  Smart Home Sensor Networks Pose Goal-Driven Solutions to Wireless Vacuum Systems , 2006, 2006 International Conference on Hybrid Information Technology.

[141]  Kyung-Jin Kang,et al.  A service scenario generation scheme based on association rule mining for elderly surveillance system in a smart home environment , 2012, Eng. Appl. Artif. Intell..

[142]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[143]  Jiawei Han,et al.  Discovery of Spatial Association Rules in Geographic Information Databases , 1995, SSD.

[144]  Mo-Yuen Chow,et al.  Performance Evaluation of an EDA-Based Large-Scale Plug-In Hybrid Electric Vehicle Charging Algorithm , 2012, IEEE Transactions on Smart Grid.

[145]  Thorben Keller,et al.  Mining the Internet of Things: Detection of False-Positive RFID Tag Reads using Low-Level Reader Data , 2011 .

[146]  Chu-Sing Yang,et al.  A time-efficient pattern reduction algorithm for k-means clustering , 2011, Inf. Sci..