Big Data Challenges and Hazards Modeling

In this work we present an overview of the challenges presented by remote sensing and other big data sources for hazards modeling and response in the world today. Big data not only provides vital information for rapid and efficient assessment of the effects and impacts of natural and anthropogenic effects, but is also an important boundary object facilitating communication and interaction between the relevant scientific, business, and governmental organizations. To effectively serve that role, big data must be credible, salient, and legitimate. The characteristics of big data are examined and we conclude that the most important ones for this application are volume, velocity, variety, and value. We present two different applications from the fields of climate and the solid earth science that are designed to solve these challenges for big data science.

[1]  Kerstin Kleese van Dam,et al.  Challenges in Data Intensive Analysis at Scientific Experimental User Facilities , 2011 .

[2]  Alain Biem,et al.  A Pipelining Implementation for Parsing X-ray Diffraction Source Data and Removing the Background Noise , 2010 .

[3]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[4]  R.M.W. Musson,et al.  Methodological considerations of probabilistic seismic hazard mapping , 2001 .

[5]  Ezio Faccioli,et al.  Engineering seismic risk analysis of the Friuli region , 1979 .

[6]  Russ Rew,et al.  NetCDF: an interface for scientific data access , 1990, IEEE Computer Graphics and Applications.

[7]  David M. Lawrence,et al.  Permafrost response to increasing Arctic shrub abundance depends on the relative influence of shrubs on local soil cooling versus large-scale climate warming , 2011 .

[8]  Reagan Moore,et al.  Data-intensive computing and digital libraries , 1998, CACM.

[9]  G. Nolan,et al.  Computational solutions to large-scale data management and analysis , 2010, Nature Reviews Genetics.

[10]  L. Dilling,et al.  Creating usable science: Opportunities and constraints for climate knowledge use and their implications for science policy , 2011 .

[11]  C. Lynch Big data: How do your data grow? , 2008, Nature.

[12]  R. Mcguire Seismic Hazard and Risk Analysis , 2004 .

[13]  Jinhui Qin,et al.  A Pipelining Implementation for High Resolution Seismic Hazard Maps Production , 2015, ICCS.

[14]  Karl E. Taylor,et al.  An overview of CMIP5 and the experiment design , 2012 .

[15]  Derya Maktav,et al.  Foreword to the Special Issue on “Human Settlements: A Global Remote Sensing Challenge” , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[16]  Martin Hilbert,et al.  The World’s Technological Capacity to Store, Communicate, and Compute Information , 2011, Science.

[17]  F. Giorgi,et al.  Addressing climate information needs at the regional level: the CORDEX framework , 2009 .

[18]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[19]  R. Musson DETERMINATION OF DESIGN EARTHQUAKES IN SEISMIC HAZARD ANALYSIS THROUGH MONTE CARLO SIMULATION , 1999 .

[20]  Navarun Gupta,et al.  Seven V's of Big Data understanding Big Data to extract value , 2014, Proceedings of the 2014 Zone 1 Conference of the American Society for Engineering Education.

[21]  D. Guston Boundary Organizations in Environmental Policy and Science: An Introduction , 2001 .

[22]  V. Marx Biology: The big challenges of big data , 2013, Nature.

[23]  Susan Leigh Star,et al.  Institutional Ecology, `Translations' and Boundary Objects: Amateurs and Professionals in Berkeley's Museum of Vertebrate Zoology, 1907-39 , 1989 .

[24]  Ian Gorton,et al.  The Changing Paradigm of Data-Intensive Computing , 2009, Computer.

[25]  Richard P. Mount The Office of Science Data-Management Challenge , 2005 .

[26]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[27]  Erik Meijer The world according to LINQ , 2011, CACM.

[28]  Muhammad Ali Babar,et al.  Software Architecture Review: The State of Practice , 2009, Computer.

[29]  Kalle Lyytinen,et al.  Crossing boundaries and conscripting participation: representing and integrating knowledge in a paper machinery project , 2001, Eur. J. Inf. Syst..

[30]  Richard G. Jones,et al.  A Regional Climate Change Assessment Program for North America , 2009 .

[31]  Rajiv Ranjan,et al.  Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing , 2015, Inf. Sci..

[32]  Robin K. McGuire,et al.  Probabilistic seismic hazard analysis: Early history , 2008 .

[33]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[34]  K. Assatourians,et al.  EqHaz: An Open‐Source Probabilistic Seismic‐Hazard Code Based on the Monte Carlo Simulation Approach , 2013 .