Linked Data Analytics in Interdisciplinary Studies: The Health Impact of Air Pollution in Urban Areas

The design of solutions that are able to exploit the available data collected in smart cities environments can lead to insights that can guide the implementation of approaches that have the potential to significantly improve the quality of life within a city. Such solutions include tools for the production of advanced analytics considering data fusion challenges. The preparation of qualitative input data sets, collected in many cases through heterogeneous sources and represented in various formats, constitute a very important step toward a meaningful analysis. Such input data sets, combined with approaches that reduce the data processing burden and support the easy and flexible-in terms of configuration-replication of an analysis, can lead to the next generation analytics tools. In this paper, a novel approach toward the production and consumption of linked data analytics in urban environments is presented. The approach is based on the exploitation of linked data principles, enhancing the ability of managing and processing of data, in ways not available before. In addition to the description of the overall technical approach, the application of the proposed solution into a real-life scenario for examining the health impact of outdoor air pollution in urban areas within an international, national, and regional perspective is detailed. A set of interesting results are produced along with their interpretation toward the provision of suggestions for policy making purposes.

[1]  Joel Schwartz,et al.  Chronic Exposure to Fine Particles and Mortality: An Extended Follow-up of the Harvard Six Cities Study from 1974 to 2009 , 2012, Environmental health perspectives.

[2]  A. Psyllidis,et al.  A Platform for Urban Analytics and Semantic Data Integration in City Planning , 2015, CAAD Futures.

[3]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[4]  Sören Auer,et al.  LinkDaViz - Automatic Binding of Linked Data to Visualizations , 2015, SEMWEB.

[5]  J McGlade,et al.  The European Environment: State and Outlook 2005 , 2005 .

[6]  Eleni Fotopoulou,et al.  Exploiting Linked Data Towards the Production of Added-Value Business Analytics and Vice-versa , 2015, DATA.

[7]  Annunziata Faustini,et al.  Which specific causes of death are associated with short term exposure to fine and coarse particles in Southern Europe? Results from the MED-PARTICLES project. , 2014, Environment international.

[8]  Jens Borken-Kleefeld,et al.  The final policy scenarios of the EU Clean Air Policy Package , 2014 .

[9]  Markus Amann Future emissions of air pollutants in Europe - Current legislation baseline and the scope for further reductions , 2012 .

[10]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[11]  Philip S. Yu,et al.  Privacy-preserving data publishing: A survey of recent developments , 2010, CSUR.

[12]  Chunxiao Jiang,et al.  Information Security in Big Data: Privacy and Data Mining , 2014, IEEE Access.

[13]  J. Schwartz,et al.  Reduction in fine particulate air pollution and mortality: Extended follow-up of the Harvard Six Cities study. , 2006, American journal of respiratory and critical care medicine.

[14]  Bianca Zadrozny,et al.  USapiens: A System for Urban Trajectory Data Analytics , 2015, 2015 16th IEEE International Conference on Mobile Data Management.

[15]  Spyros Kotoulas,et al.  QuerioCity: A Linked Data Platform for Urban Information Management , 2012, International Semantic Web Conference.

[16]  Andrew Raij,et al.  A Survey of Incentive Techniques for Mobile Crowd Sensing , 2015, IEEE Internet of Things Journal.

[17]  S. S. Sherekar,et al.  Comparative Analysis of Data Mining Tools and Techniques for Evaluating Performance of Database System , 2013 .

[18]  Andreas Schmidt,et al.  Data mining and linked open data – New perspectives for data analysis in environmental research , 2015 .

[19]  Sally A. McKee,et al.  Characterizing and subsetting big data workloads , 2014, 2014 IEEE International Symposium on Workload Characterization (IISWC).

[20]  Markus Amann,et al.  Modelling PM2.5 impact indicators in Europe: Health effects and legal compliance , 2015, Environ. Model. Softw..

[21]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.