Incidents1M: A Large-Scale Dataset of Images With Natural Disasters, Damage, and Incidents

Natural disasters, such as floods, tornadoes, or wildfires, are increasingly pervasive as the Earth undergoes global warming. It is difficult to predict when and where an incident will occur, so timely emergency response is critical to saving the lives of those endangered by destructive events. Fortunately, technology can play a role in these situations. Social media posts can be used as a low-latency data source to understand the progression and aftermath of a disaster, yet parsing this data is tedious without automated methods. Prior work has mostly focused on text-based filtering, yet image and video-based filtering remains largely unexplored. In this work, we present the Incidents1M Dataset, a large-scale multi-label dataset which contains 977,088 images, with 43 incident and 49 place categories. We provide details of the dataset construction, statistics and potential biases; introduce and train a model for incident detection; and perform image-filtering experiments on millions of images on Flickr and Twitter. We also present some applications on incident analysis to encourage and enable future work in computer vision for humanitarian aid. Code, data, and models are available at http://incidentsdataset.csail.mit.edu.

[1]  Sang Michael Xie,et al.  Combining satellite imagery and machine learning to predict poverty , 2016, Science.

[2]  James A. Thom,et al.  Mining and Classifying Image Posts on Social Media to Analyse Fires , 2016, ISCRAM.

[3]  Simon Plank,et al.  Rapid Damage Assessment by Means of Multi-Temporal SAR - A Comprehensive Review and Outlook to Sentinel-1 , 2014, Remote. Sens..

[4]  Cornelia Caragea,et al.  Identifying Disaster Damage Images Using a Domain Adaptation Approach , 2019, ISCRAM.

[5]  P. Meier Big (Crisis) Data , 2016 .

[6]  Nicola Conci,et al.  Natural disasters detection in social media and satellite imagery: a survey , 2019, Multimedia Tools and Applications.

[7]  Klaus D. McDonald-Maier,et al.  DisplaceNet: Recognising Displaced People from Images by Exploiting Dominance Level , 2019, CVPR Workshops.

[8]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[9]  Han Dong,et al.  Building Damage Detection from Post-Event Aerial Imagery Using Single Shot Multibox Detector , 2019, Applied Sciences.

[10]  Norman Kerle,et al.  UAV-based urban structural damage assessment using object-based image analysis and semantic reasoning , 2014 .

[11]  Abhinav Gupta,et al.  Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Firoj Alam,et al.  Processing Social Media Images by Combining Human and Machine Computing during Crises , 2018, Int. J. Hum. Comput. Interact..

[13]  Jean Oh,et al.  Explainable Semantic Mapping for First Responders , 2019, ArXiv.

[14]  Yukio Tamura,et al.  Cyclone damage detection on building structures from pre- and post-satellite images using wavelet based pattern recognition , 2015 .

[15]  Otávio A. B. Penatti,et al.  Exploiting ConvNet Diversity for Flooding Identification , 2017, IEEE Geoscience and Remote Sensing Letters.

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Mustafa Turker,et al.  Detection of collapsed buildings caused by the 1999 Izmit, Turkey earthquake through digital analysis of post-event aerial photographs , 2004 .

[18]  Alexei A. Efros,et al.  City Forensics: Using Visual Elements to Predict Non-Visual City Attributes , 2014, IEEE Transactions on Visualization and Computer Graphics.

[19]  Joseph Z. Xu,et al.  Building Damage Detection in Satellite Imagery Using Convolutional Neural Networks , 2019, ArXiv.

[20]  Laurens van der Maaten,et al.  Does Object Recognition Work for Everyone? , 2019, CVPR Workshops.

[21]  R. Srikant,et al.  Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks , 2017, ICLR.

[22]  H. Alemohammad,et al.  Detecting Roads from Satellite Imagery in the Developing World , 2019, CVPR Workshops.

[23]  Maryam Rahnemoonfar,et al.  FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding , 2020, IEEE Access.

[24]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Ciro Cattuto,et al.  Predicting City Poverty Using Satellite Imagery , 2019, CVPR Workshops.

[26]  Serge J. Belongie,et al.  Cross-View Image Geolocalization , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Rumi Chunara,et al.  Deep Landscape Features for Improving Vector-borne Disease Prediction , 2019, CVPR Workshops.

[28]  Jaakko Lehtinen,et al.  Analyzing and Improving the Image Quality of StyleGAN , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Muhammad Imran,et al.  Damage Assessment from Social Media Imagery Data During Disasters , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[30]  Jonathan Krause,et al.  Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States , 2017, Proceedings of the National Academy of Sciences.

[31]  Piotr Bilinski,et al.  Mapping Informal Settlements in Developing Countries with Multi-resolution, Multi-spectral Data , 2018, ArXiv.

[32]  Ramesh Raskar,et al.  Streetscore -- Predicting the Perceived Safety of One Million Streetscapes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[33]  Ramesh Raskar,et al.  Computer vision uncovers predictors of physical urban change , 2017, Proceedings of the National Academy of Sciences.

[34]  Andrea Vedaldi,et al.  Learning multiple visual domains with residual adapters , 2017, NIPS.

[35]  Sanjay Chawla,et al.  Nazr-CNN: Object Detection and Fine-Grained Classification in Crowdsourced UAV Images , 2016, ArXiv.

[36]  Tao Chen,et al.  Understanding and classifying image tweets , 2013, ACM Multimedia.

[37]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[38]  Nataliia Kussul,et al.  Flood Hazard and Flood Risk Assessment Using a Time Series of Satellite Images: A Case Study in Namibia , 2014, Risk analysis : an official publication of the Society for Risk Analysis.

[39]  Marc Rußwurm,et al.  Multi3Net: Segmenting Flooded Buildings via Fusion of Multiresolution, Multisensor, and Multitemporal Satellite Imagery , 2019, AAAI.

[40]  Greg Mori,et al.  Building Damage Assessment Using Deep Learning and Ground-Level Image Data , 2017, 2017 14th Conference on Computer and Robot Vision (CRV).

[41]  Jérémie Sublime,et al.  Automatic Post-Disaster Damage Mapping Using Deep-Learning Techniques for Change Detection: Case Study of the Tohoku Tsunami , 2019, Remote. Sens..

[42]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[43]  Howie Choset,et al.  xBD: A Dataset for Assessing Building Damage from Satellite Imagery , 2019, ArXiv.

[44]  Benjamin Bischke,et al.  The Multimedia Satellite Task at MediaEval 2018: Emergency Response for Flooding Events , 2018 .

[45]  Cheryl A. Palm,et al.  Socioecologically informed use of remote sensing data to predict rural household poverty , 2019, Proceedings of the National Academy of Sciences.

[46]  Raffay Hamid,et al.  Large-scale damage detection using satellite imagery , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Yutaka Satoh,et al.  Ten-Million-Order Human Database for World-Wide Fashion Culture Analysis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48]  Bolei Zhou,et al.  Recognizing City Identity via Attribute Analysis of Geo-tagged Images , 2014, ECCV.

[49]  Kibok Lee,et al.  A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks , 2018, NeurIPS.

[50]  N. Chehataa,et al.  Object-based change detection in wind storm-damaged forest using high-resolution multispectral images , 2014 .

[51]  Nan Wu,et al.  Deep Neural Networks Improve Radiologists’ Performance in Breast Cancer Screening , 2019, IEEE Transactions on Medical Imaging.

[52]  Yoshua Bengio,et al.  Visualizing the Consequences of Climate Change Using Cycle-Consistent Adversarial Networks , 2019, ArXiv.

[53]  Firoj Alam,et al.  Deep Learning Benchmarks and Datasets for Social Media Image Classification for Disaster Response , 2020, 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[54]  Beomsu Kim,et al.  Revisiting Classical Bagging with Modern Transfer Learning for On-the-fly Disaster Damage Detector , 2019, ArXiv.

[55]  Maryam Rahnemoonfar,et al.  Comprehensive Semantic Segmentation on High Resolution UAV Imagery for Natural Disaster Damage Assessment , 2020, 2020 IEEE International Conference on Big Data (Big Data).

[56]  Nigel Collier,et al.  A Pragmatic Guide to Geoparsing Evaluation Toponyms , Named Entity Recognition and Pragmatics , 2018 .

[57]  Stefano Ermon,et al.  Deep Learning For Crop Yield Prediction in Africa , 2019 .

[58]  Ivan Bartoli,et al.  Semisupervised classification of hurricane damage from postevent aerial imagery using deep learning , 2018, Journal of Applied Remote Sensing.

[59]  M. Fatehkia,et al.  MAPPING POVERTY IN THE PHILIPPINES USING MACHINE LEARNING, SATELLITE IMAGERY, AND CROWD-SOURCED GEOSPATIAL INFORMATION , 2019 .

[60]  Olga Russakovsky,et al.  REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets , 2020, International Journal of Computer Vision.

[61]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[62]  Aleš Leonardis,et al.  Exploring Object-Centric and Scene-Centric CNN Features and Their Complementarity for Human Rights Violations Recognition in Images , 2018, IEEE Access.

[63]  Fabio Dell'Acqua,et al.  Rapid Damage Detection in the Bam Area Using Multitemporal SAR and Exploiting Ancillary Data , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[64]  Dim P. Papadopoulos,et al.  Detecting natural disasters, damage, and incidents in the wild , 2020, ECCV.

[65]  Devis Tuia,et al.  When a Few Clicks Make All the Difference: Improving Weakly-Supervised Wildlife Detection in UAV Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[66]  Natalia Efremova,et al.  AI-based evaluation of the SDGs: The case of crop detection with earth observation data , 2019, SSRN Electronic Journal.

[67]  João Porto de Albuquerque,et al.  Investigating images as indicators for relevant social media messages in disaster management , 2015, ISCRAM.

[68]  Bolei Zhou,et al.  Scene Parsing through ADE20K Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Stephan Hoyer,et al.  Inundation Modeling in Data Scarce Regions , 2019, ArXiv.

[70]  David A. Shamma,et al.  YFCC100M , 2015, Commun. ACM.

[71]  P. Chaudhary,et al.  Water level prediction from social media images with a multi-task ranking approach , 2020, ISPRS Journal of Photogrammetry and Remote Sensing.

[72]  David Radke,et al.  FireCast: Leveraging Deep Learning to Predict Wildfire Spread , 2019, IJCAI.

[73]  Stefano Ermon,et al.  Monitoring Ethiopian Wheat Fungus with Satellite Imagery and Deep Feature Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[74]  Jonathan Krause,et al.  Fine-Grained Car Detection for Visual Census Estimation , 2017, AAAI.

[75]  Kibok Lee,et al.  Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples , 2017, ICLR.

[76]  Greg Mori,et al.  Learning a Deep ConvNet for Multi-Label Classification With Partial Labels , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[77]  Sergey V. Samsonov,et al.  A review of the status of satellite remote sensing and image processing techniques for mapping natural hazards and disasters , 2009 .

[78]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[79]  Saikat Basu,et al.  From Satellite Imagery to Disaster Insights , 2018, ArXiv.

[80]  Paolo Gamba,et al.  Remote Sensing and Earthquake Damage Assessment: Experiences, Limits, and Perspectives , 2012, Proceedings of the IEEE.

[81]  Andrea Marchetti,et al.  EARS (earthquake alert and report system): a real time decision support system for earthquake crisis management , 2014, KDD.

[82]  Scott Workman,et al.  A Unified Model for Near and Remote Sensing , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[83]  Daniel Gatica-Perez,et al.  Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[84]  Ethan Weber,et al.  Building Disaster Damage Assessment in Satellite Imagery with Multi-Temporal Fusion , 2020, ArXiv.

[85]  Saikat Basu,et al.  Building High Resolution Maps for Humanitarian Aid and Development with Weakly- and Semi-Supervised Learning , 2019, CVPR Workshops.

[86]  Stefano Ermon,et al.  Semantic Segmentation of Crop Type in Africa: A Novel Dataset and Analysis of Deep Learning Methods , 2019, CVPR Workshops.

[87]  Mei-Ling Shyu,et al.  Unconstrained Flood Event Detection Using Adversarial Data Augmentation , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[88]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[89]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[90]  Vinh-Tiep Nguyen,et al.  Flood Level Prediction via Human Pose Estimation from Social Media Images , 2020, ICMR.

[91]  Jigar Doshi,et al.  FireNet: Real-time Segmentation of Fire Perimeter from Aerial Video , 2019, ArXiv.

[92]  Tero Karras,et al.  Training Generative Adversarial Networks with Limited Data , 2020, NeurIPS.

[93]  Joel R. Tetreault,et al.  Multimodal Categorization of Crisis Events in Social Media , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[94]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[95]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[96]  David S. Melnick,et al.  International evaluation of an AI system for breast cancer screening , 2020, Nature.

[97]  Christian Reuter,et al.  Retrospective Review and Future Directions for Crisis Informatics , 2021, Information Refinement Technologies for Crisis Informatics.