Data Mining for Geoinformatics

The rate at which geospatial data is being generated exceeds our computational capabilities to extract patterns for the understanding of a dynamically changing world. Geoinformatics and data mining focuses on the development and implementation of computational algorithms to solve these problems. This unique volume contains a collection of chapters on state-of-the-art data mining techniques applied to geoinformatic problems of high complexity and important societal value. Data Mining for Geoinformatics addresses current concerns and developments relating to spatio-temporal data mining issues in remotely-sensed data, problems in meteorological data such as tornado formation, estimation of radiation from the Fukushima nuclear power plant, simulations of traffic data using OpenStreetMap, real time traffic applications of data stream mining, visual analytics of traffic and weather data and the exploratory visualization of collective, mobile objects such as the flocking behavior of wild chickens. This book is designed for researchers and advanced-level students focused on computer science, earth science and geography as a reference or secondary text book. Practitioners working in the areas of data mining and geoscience will also find this book to be a valuable reference.

[1]  Gerhard Wotawa,et al.  Xenon-133 and caesium-137 releases into the atmosphere from the Fukushima Dai-ichi nuclear power plant: determination of the source term, atmospheric dispersion, and deposition , 2011 .

[2]  Hailing Sun Comparing Approaches to Winter Highway Maintenance Operations Through User Mobility Performance , 2022 .

[3]  Elizabeth A. Wentz,et al.  A comparison of two methods to create tracks of moving objects: linear weighted distance and constrained random walk , 2003, Int. J. Geogr. Inf. Sci..

[4]  Matthias Jarke,et al.  An evaluation framework for traffic information systems based on data streams , 2012 .

[5]  Nathalie Perrier,et al.  A survey of models and algorithms for winter road maintenance. Part II: system design for snow disposal , 2006, Comput. Oper. Res..

[6]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..

[7]  R. Clark,et al.  Spectroscopic Determination of Leaf Biochemistry Using Band-Depth Analysis of Absorption Features and Stepwise Multiple Linear Regression , 1999 .

[8]  Wu-chun Feng,et al.  MOON: MapReduce On Opportunistic eNvironments , 2010, HPDC '10.

[9]  Hillol Kargupta,et al.  MineFleet®: an overview of a widely adopted distributed vehicle performance data mining system , 2010, KDD.

[10]  James Theiler,et al.  Clustering to improve matched filter detection of weak gas plumes in hyperspectral thermal imagery , 2001, IEEE Trans. Geosci. Remote. Sens..

[11]  Donna Peuquet,et al.  An Event-Based Spatiotemporal Data Model (ESTDM) for Temporal Analysis of Geographical Data , 1995, Int. J. Geogr. Inf. Sci..

[12]  Dimitris G. Manolakis,et al.  Is there a best hyperspectral detection algorithm? , 2009, Defense + Commercial Sensing.

[13]  André Langevin,et al.  A survey of models and algorithms for winter road maintenance. Part IV: Vehicle routing and fleet sizing for plowing and snow disposal , 2005, Comput. Oper. Res..

[14]  Roberta E. Martin,et al.  Brightness-normalized Partial Least Squares Regression for hyperspectral data , 2010 .

[15]  Margaret E. Gardner,et al.  Mapping Chaparral in the Santa Monica Mountains Using Multiple Endmember Spectral Mixture Models , 1998 .

[16]  D. Helbing Traffic and related self-driven many-particle systems , 2000, cond-mat/0012229.

[17]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[18]  Jiawei Han,et al.  Geographic data mining and knowledge discovery: An overview , 2009 .

[19]  Sandra Geisler,et al.  Ontology-based data quality framework for data stream applications , 2011, ICIQ.

[20]  Nicolas W. Hengartner,et al.  Stochastic event reconstruction of atmospheric contaminant dispersion using Bayesian inference , 2008 .

[21]  N. Keshava,et al.  Distance metrics and band selection in hyperspectral processing with applications to material identification and spectral libraries , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[22]  Tetsuzo Yasunari,et al.  Cesium-137 deposition and contamination of Japanese soils due to the Fukushima nuclear accident , 2011, Proceedings of the National Academy of Sciences.

[23]  Jason Dykes,et al.  Spatially Ordered Treemaps , 2008, IEEE Transactions on Visualization and Computer Graphics.

[24]  Victor J. Blue,et al.  Cellular automata microsimulation for modeling bi-directional pedestrian walkways , 2001 .

[25]  Dimitris G. Manolakis,et al.  Taxonomy of detection algorithms for hyperspectral imaging applications , 2005 .

[26]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[27]  Junzhong Gu,et al.  A modified Hausdorff distance based algorithm for 2-dimensional spatial trajectory matching , 2010, 2010 5th International Conference on Computer Science & Education.

[28]  Hongbo Yu,et al.  A Space‐Time GIS Approach to Exploring Large Individual‐based Spatiotemporal Datasets , 2008, Trans. GIS.

[29]  Marcel Rieser,et al.  Adding Transit to an Agent-Based Transportation Simulation: Concepts and Implementation , 2010 .

[30]  R. Clark,et al.  Reflectance spectroscopy: Quantitative analysis techniques for remote sensing applications , 1984 .

[31]  Ingo J. Timm,et al.  Learning Dynamic Adaptation Strategies in Agent-Based Traffic Simulation Experiments , 2011, MATES.

[32]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[33]  André Langevin,et al.  A survey of models and algorithms for winter road maintenance. Part I: system design for spreading and plowing , 2006, Comput. Oper. Res..

[34]  Guido Cervone,et al.  Non-Darwinian evolution for the source detection of atmospheric releases , 2011 .

[35]  Gennady L. Andrienko,et al.  Spatio-temporal aggregation for visual analysis of movements , 2008, 2008 IEEE Symposium on Visual Analytics Science and Technology.

[36]  Marc Bocquet,et al.  Towards the operational estimation of a radiological plume using data assimilation after a radiological accidental atmospheric release , 2011 .

[37]  Yuzuru Tanaka,et al.  Meme media and meme market architectures for the reediting and redistribution of knowledge resources , 1998, Proceedings 1998 MultiMedia Modeling. MMM'98 (Cat. No.98EX200).

[38]  Ahmed M El-Geneidy,et al.  A travel behavior analysis of urban cycling facilities in Montréal, Canada , 2011 .

[39]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[40]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[41]  David W. Warren,et al.  LWIR/MWIR imaging hyperspectral sensor for airborne and ground-based remote sensing , 1996, Optics & Photonics.

[42]  Christopher L. Barrett,et al.  TRANSIMS for urban planning , 1999 .

[43]  Dino Pedreschi,et al.  Visually driven analysis of movement data by progressive clustering , 2008, Inf. Vis..

[44]  Otto Huisman,et al.  Beyond exploratory visualization of space time paths , 2009 .

[45]  Asma Munir Khan Intelligent infrastructure-based queue-end warning system for avoiding rear impacts , 2007 .

[46]  H. Miller A MEASUREMENT THEORY FOR TIME GEOGRAPHY , 2005 .

[47]  Qiang Chen,et al.  Aurora : a new model and architecture for data stream management ) , 2006 .

[48]  Trym Vegard Haavardsholm,et al.  Real-time georeferencing for an airborne hyperspectral imaging system , 2011, Defense + Commercial Sensing.

[49]  John F. Mustard,et al.  Abundance and distribution of ultramafic microbreccia in Moses Rock dike - Quantitative application of mapping spectroscopy , 1987 .

[50]  Katsumi Shozugawa,et al.  Deposition of fission and activation products after the Fukushima Dai-ichi nuclear power plant accident. , 2012, Environmental pollution.

[51]  Paul A. Longley,et al.  A Test Environment for Location‐Based Services Applications , 2006, Trans. GIS.

[52]  Anthony J. Ratkowski,et al.  Validation of the QUick atmospheric correction (QUAC) algorithm for VNIR-SWIR multi- and hyperspectral imagery , 2005 .

[53]  P. Turchin Quantitative analysis of movement : measuring and modeling population redistribution in animals and plants , 1998 .

[54]  Yan Huang,et al.  Modeling Herds and Their Evolvements from Trajectory Data , 2008, GIScience.

[55]  Jiawei Han,et al.  An overview of clustering methods in geographic data analysis , 2009 .

[56]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[57]  Dino Pedreschi,et al.  Interactive visual clustering of large collections of trajectories , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[58]  Robert Weibel,et al.  Discovering relative motion patterns in groups of moving point objects , 2005, Int. J. Geogr. Inf. Sci..

[59]  Gary A. Shaw,et al.  Hyperspectral Image Processing for Automatic Target Detection Applications , 2003 .

[60]  Micke Kuwahara,et al.  Webble world — A Web-based knowledge federation framework for programmable and customizable meme media objects , 2010 .

[61]  Alain Biem,et al.  IBM infosphere streams for scalable, real-time, intelligent transportation services , 2010, SIGMOD Conference.

[62]  André Skupin,et al.  Visualizing Demographic Trajectories with Self-Organizing Maps , 2005, GeoInformatica.

[63]  Matthew Simpson,et al.  Atmospheric Dispersion Modeling: Challenges of the Fukushima Daiichi Response , 2011 .

[64]  Ingo J. Timm,et al.  Fuel Consumption And Emission Modeling For Urban Scenarios , 2012, ECMS.

[65]  Mario Winter,et al.  N-FINDR: an algorithm for fast autonomous spectral end-member determination in hyperspectral data , 1999, Optics & Photonics.

[66]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[67]  Michael Schreckenberg,et al.  A cellular automaton model for freeway traffic , 1992 .

[68]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[69]  André Skupin,et al.  Visualizing Human Movement in Attribute Space , 2008 .

[70]  N Takahashi,et al.  USING TAXI GPS TO GATHER HIGH-QUALITY TRAFFIC DATA FOR WINTER ROAD MANAGEMENT EVALUATION IN SAPPORO, JAPAN , 2004 .

[71]  Miguel Vélez-Reyes,et al.  Evaluation of the GPU architecture for the implementation of target detection algorithms for hyperspectral imagery , 2011, Defense + Commercial Sensing.

[72]  Karl Aberer,et al.  A middleware for fast and flexible sensor network deployment , 2006, VLDB.

[73]  Ian H. Witten,et al.  WEKA - Experiences with a Java Open-Source Project , 2010, J. Mach. Learn. Res..

[74]  Tobias Schreck,et al.  Visual Cluster Analysis of Trajectory Data with Interactive Kohonen Maps , 2008, 2008 IEEE Symposium on Visual Analytics Science and Technology.

[75]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[76]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[77]  S. J. Sutley,et al.  Imaging spectroscopy: Earth and planetary remote sensing with the USGS Tetracorder and expert systems , 2003 .

[78]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[79]  Tetsuji Satoh,et al.  Shape-Based Similarity Query for Trajectory of Mobile Objects , 2003, Mobile Data Management.

[80]  John P. Kerekes,et al.  Receiver Operating Characteristic Curve Confidence Intervals and Regions , 2008, IEEE Geoscience and Remote Sensing Letters.

[81]  Ronald G. Resmini,et al.  Mineral mapping with HYperspectral Digital Imagery Collection Experiment (HYDICE) sensor data at Cuprite, Nevada, U.S.A. , 1997 .

[82]  Robert Weibel,et al.  Towards a taxonomy of movement patterns , 2008, Inf. Vis..

[83]  D. Lobell,et al.  A Biogeophysical Approach for Automated SWIR Unmixing of Soils and Vegetation , 2000 .

[84]  Ranga Raju Vatsavai,et al.  Map cube: A visualization tool for spatial data warehouses , 2001 .

[85]  Gerhard Wotawa,et al.  Estimation of the time-dependent radioactive source-term from the Fukushima nuclear power plant accident using atmospheric transport modelling. , 2012, Journal of environmental radioactivity.

[86]  Fred A. Kruse,et al.  Expert system analysis of hyperspectral data , 2008, SPIE Defense + Commercial Sensing.

[87]  Shashi Shekhar,et al.  CubeView: a system for traffic data visualization , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[88]  Max J. Egenhofer,et al.  Modeling Moving Objects over Multiple Granularities , 2002, Annals of Mathematics and Artificial Intelligence.

[89]  Hans-Peter Lipp,et al.  A GPS logger and software for analysis of homing in pigeons and small mammals , 2000, Physiology & Behavior.

[90]  R. S. Gabruk,et al.  A Second-Order Closure Model for the Effect of Averaging Time on Turbulent Plume Dispersion , 1997 .

[91]  Wolfgang Lehner,et al.  QStream: Deterministic Querying of Data Streams , 2004, VLDB.

[92]  Robert McKibbin,et al.  Source Release-Rate Estimation of Atmospheric Pollution from a Non-Steady Point Source at a Known Location , 2004 .

[93]  M. Yuan Representing Geographic Information to Support Queries about Life and Motion of Socio-Economic Units , 2000 .

[94]  Robert B. Noland,et al.  Behavioural Issues in Pedestrian Speed Choice and Street Crossing Behaviour: A Review , 2008 .

[95]  Michael E. Winter,et al.  Hyperspectral processing in graphical processing units , 2011, Defense + Commercial Sensing.

[96]  Yang Li,et al.  Mobile Space‐Time Envelopes for Location‐Based Services , 2006, Trans. GIS.

[97]  Alan R. Gillespie,et al.  Autonomous atmospheric compensation (AAC) of high resolution hyperspectral thermal infrared remote-sensing imagery , 2000, IEEE Trans. Geosci. Remote. Sens..

[98]  W. Knospe,et al.  A realistic two-lane traffic model for highway traffic , 2002, cond-mat/0203346.

[99]  Michael Stonebraker,et al.  Linear Road: A Stream Data Management Benchmark , 2004, VLDB.

[100]  Kathleen Stewart,et al.  Modeling Moving Geospatial Objects from an Event-based Perspective , 2007, Trans. GIS.

[101]  Stuart Newstead,et al.  Riding through red lights: the rate, characteristics and risk factors of non-compliant urban commuter cyclists. , 2011, Accident; analysis and prevention.

[102]  André Langevin,et al.  A survey of models and algorithms for winter road maintenance. Part III: Vehicle routing and depot location for spreading , 2005, Comput. Oper. Res..

[103]  Brian L. Smith,et al.  Investigation of the Performance of Wireless Location Technology-Based Traffic Monitoring Systems , 2007 .

[104]  Harvey J. Miller,et al.  Modelling accessibility using space-time prism concepts within geographical information systems , 1991, Int. J. Geogr. Inf. Sci..

[105]  Neal R. Harvey,et al.  Comparison of GENIE and conventional supervised classifiers for multispectral image feature extraction , 2002, IEEE Trans. Geosci. Remote. Sens..

[106]  Ying Xing,et al.  The Design of the Borealis Stream Processing Engine , 2005, CIDR.

[107]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[108]  Badrish Chandramouli,et al.  Spatio-Temporal Stream Processing in Microsoft StreamInsight , 2010, IEEE Data Eng. Bull..

[109]  W. S. Lewellen,et al.  A turbulent-transport model for concentration fluctuations and fluxes , 1984, Journal of Fluid Mechanics.

[110]  A. Prasad Sistla,et al.  Querying the Uncertain Position of Moving Objects , 1997, Temporal Databases, Dagstuhl.

[111]  Susan Grant-Muller,et al.  Use of sequential learning for short-term traffic flow forecasting , 2001 .

[112]  N. Andrienko,et al.  Basic Concepts of Movement Data , 2008, Mobility, Data Mining and Privacy.

[113]  Albert Bifet,et al.  DATA STREAM MINING A Practical Approach , 2009 .

[114]  John B. Adams,et al.  Quantitative subpixel spectral detection of targets in multispectral images. [terrestrial and planetary surfaces] , 1992 .

[115]  David M. Mark,et al.  Measuring similarity between geospatial lifelines in studies of environmental health , 2005, J. Geogr. Syst..

[116]  Dino Pedreschi,et al.  Mobility, Data Mining and Privacy - Geographic Knowledge Discovery , 2008, Mobility, Data Mining and Privacy.

[117]  Christopher D. Brown,et al.  Receiver operating characteristics curves and related decision measures: A tutorial , 2006 .

[118]  Dieter Pfoser,et al.  Generating semantics-based trajectories of moving objects , 2003, Comput. Environ. Urban Syst..

[119]  A. Clouvas,et al.  Environmental radioactivity measurements in Greece following the Fukushima Daichi nuclear accident. , 2012, Radiation protection dosimetry.

[120]  Ronald G. Resmini Simultaneous spectral/spatial detection of edges for hyperspectral imagery: the HySPADE algorithm revisited , 2012, Defense + Commercial Sensing.

[121]  G. Langran Time in Geographic Information Systems , 1990 .

[122]  Anthony J. Ratkowski,et al.  The sequential maximum angle convex cone (SMACC) endmember model , 2004, SPIE Defense + Commercial Sensing.

[123]  Nikos Pelekis,et al.  T-Warehouse: Visual OLAP analysis on trajectory data , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[124]  D. J. Strom,et al.  Elevated radioxenon detected remotely following the Fukushima nuclear accident. , 2011, Journal of environmental radioactivity.

[125]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[126]  Alan D. Stocker,et al.  Multi-dimensional signal processing for electro-optical target detection , 1990, Defense + Commercial Sensing.

[127]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[128]  Yukihiko Satou,et al.  Assessment of individual radionuclide distributions from the Fukushima nuclear accident covering central-east Japan , 2011, Proceedings of the National Academy of Sciences.

[129]  A. Zipf,et al.  A Comparative Study of Proprietary Geodata and Volunteered Geographic Information for Germany , 2010 .

[130]  J. Boardman,et al.  Leveraging the High Dimensionality of AVIRIS Data for improved Sub-Pixel Target i Unmixing and Rejection of False Positives : Mixture Tuned Matched Filtering , 1998 .

[131]  Ronald Kates,et al.  Ein hybrides Modell basierend auf einem Neuronalen Netz und einem ARIMA-Zeitreihenmodell zur Prognose lokaler Verkehrskenngroessen , 2003 .

[132]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[133]  Roland Chrobok,et al.  CELLULAR AUTOMATON MODELING OF THE AUTOBAHN TRAFFIC IN NORTH RHINE-WESTPHALIA , 2002 .

[134]  J. Boardman,et al.  Mapping target signatures via partial unmixing of AVIRIS data: in Summaries , 1995 .

[135]  Shih-Lung Shaw,et al.  Exploring potential human activities in physical and virtual spaces: a spatio‐temporal GIS approach , 2008, Int. J. Geogr. Inf. Sci..

[136]  Motoki Asano,et al.  Winter Road Traffic Evaluation Using Taxi Probe Data in Sapporo, Japan , 2012 .

[137]  Sue Ellen Haupt,et al.  Validation of a Receptor–Dispersion Model Coupled with a Genetic Algorithm Using Synthetic Data , 2006 .

[138]  Elizabeth A. Wentz,et al.  A Shape Definition for Geographic Applications Based on Edge, Elongation, and Perforation , 2010 .

[139]  T. Painter,et al.  Reflectance quantities in optical remote sensing - definitions and case studies , 2006 .

[140]  Michael F. Worboys,et al.  GIS : a computing perspective , 2004 .

[141]  Sue Ellen Haupt,et al.  A Genetic Algorithm Method to Assimilate Sensor Data for a Toxic Contaminant Release , 2007, J. Comput..

[142]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[143]  Wolfgang Lehner,et al.  Representing Data Quality in Sensor Data Streaming Environments , 2009, JDIQ.

[144]  Ingo J. Timm,et al.  From GIS to mixed traffic simulation in urban scenarios , 2011, SimuTools.

[145]  P. Switzer,et al.  A transformation for ordering multispectral data in terms of image quality with implications for noise removal , 1988 .

[146]  Floriano De Rango,et al.  Proceedings of the Summer Computer Simulation Conference , 2016 .

[147]  Shashi Shekhar,et al.  Analysis of spatial data with map cubes: Highway traffic data , 2009 .

[148]  Jim Morgenstern,et al.  GPGPU-based real-time conditional dilation for adaptive thresholding for target detection , 2011, Defense + Commercial Sensing.

[149]  William Wright,et al.  GeoTime Information Visualization , 2004, IEEE Symposium on Information Visualization.

[150]  Helmut Alt,et al.  Comparison of Distance Measures for Planar Curves , 2003, Algorithmica.

[151]  Gawron,et al.  Continuous limit of the Nagel-Schreckenberg model. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[152]  Wim Bakker,et al.  CCSM: Cross correlogram spectral matching , 1997 .

[153]  Alan D. Stocker,et al.  Design and performance of the Civil Air Patrol ARCHER hyperspectral processing system , 2005 .

[154]  David W. Messinger,et al.  Anomaly detection using topology , 2007, SPIE Defense + Commercial Sensing.

[155]  H. Mannila,et al.  Computing Discrete Fréchet Distance ∗ , 1994 .

[156]  Sandra Geisler,et al.  A Quality- and Priority-Based Traffic Information Fusion Architecture , 2009 .

[157]  Ingo Rechenberg,et al.  Evolutionsstrategie : Optimierung technischer Systeme nach Prinzipien der biologischen Evolution , 1973 .

[158]  Menno-Jan Kraak,et al.  The space - time cube revisited from a geovisualization perspective , 2003 .

[159]  Dirk Helbing Empirical traffic data and their implications for traffic modeling , 1997 .

[160]  Sandra Geisler,et al.  Accuracy Assessment for Traffic Information Derived from Floating Phone Data , 2010 .