Identifying Community Fire Hazards from Citizen Communication by Applying Transfer Learning and Machine Learning Techniques

A cross-region transfer learning method is proposed to identify community (e.g. car parks, public spaces and shopping centers) fire hazards based on text input provided by community members. The key component of the method, which also accounts for data imbalance, is an improved transfer component analysis that is embedded with a local discriminant analysis to transfer non-local rich knowledge to the fire hazard identification of local communities with an insufficient number of samples. In addition, a fire hazard knowledge map is established and applied to supplement the missing key features for fire hazard identification, and ontology modeling is applied to standardize the text features and reduce the effect of semantic ambiguity brought by cross-region knowledge transfer. The proposed method is verified based on the text data of nine fire hazard classes from Lanzhou and Beidaihe in China. Machine learning experiments show that fire hazard identification performance of all nine classes were improved with the overall accuracy, precision, recall, F1 score and AUC increased by 12%, 15%, 16%, 15% and 15%, respectively. Under data imbalance scenarios, the proposed method outperforms the state of the art methods, such as sampling-based methods, FastText and ULMFiT. The results also show that the proposed method can achieve desired performance with only half of the training samples. These findings illustrate that the proposed method can assist regions by improving fire identification results significantly through knowledge transfer. The proposed approach can be followed to build smart systems for community fire risk management with reasonable performance and high efficiency.

[1]  Ivor W. Tsang,et al.  Domain Adaptation via Transfer Component Analysis , 2009, IEEE Transactions on Neural Networks.

[2]  Jian Ma,et al.  An Ontology-Based Text-Mining Method to Cluster Proposals for Research Project Selection , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[3]  Hongfei Lin,et al.  Knowledge transfer based on feature representation mapping for text classification , 2011, Expert Syst. Appl..

[4]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[5]  Jun Huan,et al.  Large margin transductive transfer learning , 2009, CIKM.

[6]  Michael A. Langston,et al.  Robust Inference of Genetic Exchange Communities from Microbial Genomes Using TF-IDF , 2017, Front. Microbiol..

[7]  Jing Xin,et al.  Fire risk analysis of residential buildings based on scenario clusters and its application in fire risk management , 2013 .

[8]  Hui Zhang,et al.  Multi-Modal Description of Public Safety Events Using Surveillance and Social Media , 2019, IEEE Transactions on Big Data.

[9]  Umar Manzoor,et al.  Ontology Based SMS Controller for Smart Phones , 2015, ArXiv.

[10]  Khalid Moinuddin,et al.  Reliability of sprinkler system in Australian shopping centres –A fault tree analysis , 2019 .

[11]  Qiang Yang,et al.  Transfer Learning via Dimensionality Reduction , 2008, AAAI.

[12]  Chinwe Ekenna,et al.  Smart city in crisis: Technology and policy concerns , 2019, Sustainable Cities and Society.

[13]  Zhongliang Jing,et al.  The Exploration of Urban Gridded Management in E-government , 2006, 2006 Fifth International Conference on Grid and Cooperative Computing Workshops.

[14]  Tom Heskes,et al.  Multi-Domain Transfer Component Analysis for Domain Generalization , 2017, Neural Processing Letters.

[15]  Saad A. Al-Ahmadi,et al.  Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification , 2018, Entropy.

[16]  Jun Wang,et al.  Knowledge map construction for question and answer archives , 2020, Expert Syst. Appl..

[17]  Gao Xu,et al.  Video Smoke Detection Based on Deep Saliency Network , 2018, Fire Safety Journal.

[18]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[19]  Bart Merci,et al.  Full-scale and reduced-scale tests on smoke movement in case of car park fire , 2013 .

[20]  Hua Xu,et al.  Chinese comments sentiment classification based on word2vec and SVMperf , 2015, Expert Syst. Appl..

[21]  Faisal Khan,et al.  Dynamic hazard identification and scenario mapping using Bayesian network , 2017 .

[22]  Bai Xiu-yin,et al.  An empirical study on application and efficiency of gridded management in public service supply of Chinese Government , 2017 .

[23]  Qiang Yang,et al.  Bridging Domains Using World Wide Knowledge for Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[24]  Noureddine Bénichou,et al.  How Design Fires Can be Used in Fire Hazard Analysis , 2002 .

[25]  Zhang Yongming,et al.  Erratum to: Video Fire Smoke Detection Using Motion and Color Features , 2010 .

[26]  Sahil Garg,et al.  Structural block driven enhanced convolutional neural representation for relation extraction , 2020, Appl. Soft Comput..

[27]  Faisal Khan,et al.  An ontology-based methodology for hazard identification and causation analysis , 2019, Process Safety and Environmental Protection.

[28]  Mehran Kamkarhaghighi,et al.  Content Tree Word Embedding for document representation , 2017, Expert Syst. Appl..

[29]  Brian Y. Lattimer,et al.  Wildland Fire Spread Modeling Using Convolutional Neural Networks , 2019, Fire Technology.

[30]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[31]  Jonatan Gehandler,et al.  Limit-Based Fire Hazard Model for Evaluating Tunnel Life Safety , 2015 .

[32]  Michael Ng,et al.  Analysis of regularized least squares for functional linear regression model , 2018, J. Complex..

[33]  Jean-Philippe Thiran,et al.  Text identification in complex background using SVM , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[34]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[35]  Jordi Grau-Moya,et al.  Non-Equilibrium Relations for Bounded Rational Decision-Making in Changing Environments , 2017, Entropy.

[36]  Michal Koziarski,et al.  Radial-Based Undersampling for Imbalanced Data Classification , 2019, Pattern Recognit..

[37]  Vinod Patidar,et al.  Image encryption using chaotic logistic map , 2006, Image Vis. Comput..

[38]  X. W. Liang,et al.  LR-SMOTE - An improved unbalanced data set oversampling based on K-means and SVM , 2020, Knowl. Based Syst..

[39]  O. Kuipers,et al.  The Relationship among Tyrosine Decarboxylase and Agmatine Deiminase Pathways in Enterococcus faecalis , 2017, Front. Microbiol..

[40]  Min Xiao,et al.  Semi-Supervised Kernel Matching for Domain Adaptation , 2012, AAAI.