Methodology for Knowledge Extraction from Mobility Big Data

The spread of mobile devices with several sensors, together with mobile communication, provides huge volumes of real-time data (big data) about users’ mobility habits, which should be correctly analysed to extract useful knowledge. In our research we explore a data mining approach based on a Naive Bayes (NB) classifier applied to different sources of big data. To achieve this goal, we propose a methodology based on four processes that collects data and merges different data sources into pre-defined data classes. We can apply this methodology to different big data sources and extract a diversity of knowledge that can be applied to the development of dedicated applications and decision processes in the area of intelligent transportation systems, such as route advice, CO2 emissions reduction through fuel savings, and provision of smart advice for public transportation usage.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  Henry A. Kautz,et al.  Inferring High-Level Behavior from Low-Level Sensors , 2003, UbiComp.

[3]  Sholom M. Weiss,et al.  Predictive data mining - a practical guide , 1997 .

[4]  Lin Liao,et al.  Inferring high-level behavior from , 2003 .

[5]  Elena Baralis,et al.  Data mining techniques for effective and scalable traffic analysis , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[6]  Claude Seidman,et al.  Data Mining with Microsoft SQL Server 2000 Technical Reference , 2001 .

[7]  Stephen R. Garner,et al.  WEKA: The Waikato Environment for Knowledge Analysis , 1996 .

[8]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[9]  João Ferreira Green Route Planner , 2014 .

[10]  Yoshifumi Nishio,et al.  Nonlinear Maps and their Applications , 2014 .

[11]  Juan-Carlos Cano,et al.  Using Data Mining and Vehicular Networks to Estimate the Severity of Traffic Accidents , 2012, IS-MiS.

[12]  Vitor Monteiro,et al.  Dynamic range prediction for an electric vehicle , 2013, 2013 World Electric Vehicle Symposium and Exhibition (EVS27).

[13]  Deborah Estrin,et al.  Using mobile phones to determine transportation modes , 2010, TOSN.

[14]  Fabian Hüger User interface transfer for driver information systems: a survey and an improved approach , 2011, AutomotiveUI.

[15]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[16]  Markus Hofmann,et al.  RapidMiner: Data Mining Use Cases and Business Analytics Applications , 2013 .

[17]  Gonzalo Mariscal,et al.  A survey of data mining and knowledge discovery process models and methodologies , 2010, The Knowledge Engineering Review.

[18]  Wazir Zada Khan,et al.  Mobile Phone Sensing Systems: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[19]  Maciej Pondel Data mining with Microsoft SQL Server 2008 , 2011 .

[20]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.