EKGTF: A knowledge-enhanced model for optimizing social network-based meteorological briefings

Abstract With the frequent occurrence of extreme natural phenomena, news about meteorological disasters has increased. As a timely and effective social sensor, social networks have gradually become an important data source for the perception of extreme meteorological events. Meteorological briefing refers to screening valuable knowledge from massive data to provide decision-makers with efficient situational awareness support. However, social network-based briefing content has challenges, including colloquialisms and informal text styles. How to optimize these data in a formal text style is of great significance to improve decision-making efficiency. This paper proposes a meteorological briefing formalization module composed of three models: the text form judgment model, the formalization words detection model, and the event knowledge guided text formalization (EKGTF) model. These models are concatenated to optimize the meteorological briefing, specifically formalizing the briefing content’s language style based on Sina Weibo data. As a knowledge-enhanced model, the EKGTF model focuses on describing the core meteorological event knowledge while formalizing the content. Compared to baseline models, the EKGTF model achieves the best results on the BLEU score. Based on the meteorological briefing formalization module, a meteorological briefing formalization service framework is constructed, which is to be applied to the China Meteorological Administration (CMA) Public Meteorological Service Center.

[1]  Yang Liu,et al.  Fine-tune BERT for Extractive Summarization , 2019, ArXiv.

[2]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[3]  Ayush Singh,et al.  Sentiment Transfer using Seq2Seq Adversarial Autoencoders , 2018, ArXiv.

[4]  Chen Wu,et al.  A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer , 2019, ACL.

[5]  Joshua B. Plotkin,et al.  Information gerrymandering and undemocratic decisions , 2019, Nature.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[8]  Jeong-Hwan Kim,et al.  Deep learning for multi-year ENSO forecasts , 2019, Nature.

[9]  Michael H. Glantz,et al.  ENSO as an Integrating Concept in Earth Science , 2006, Science.

[10]  Jonas Mueller,et al.  IMaT: Unsupervised Text Attribute Transfer via Iterative Matching and Translation , 2019, EMNLP/IJCNLP.

[11]  Jinjun Xiong,et al.  Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus , 2019, NAACL.

[12]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[13]  Tommi S. Jaakkola,et al.  Sequence to Better Sequence: Continuous Revision of Combinatorial Structures , 2017, ICML.

[14]  Y-Lan Boureau,et al.  Zero-Shot Fine-Grained Style Transfer: Leveraging Distributed Continuous Style Representations to Transfer To Unseen Styles , 2019, ArXiv.

[15]  Zhendong Niu,et al.  A Hybrid E-Learning Recommendation Approach Based on Learners’ Influence Propagation , 2020, IEEE Transactions on Knowledge and Data Engineering.

[16]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[17]  Ali A. Ghorbani,et al.  An overview of online fake news: Characterization, detection, and discussion , 2020, Inf. Process. Manag..

[18]  Kripabandhu Ghosh,et al.  Utilizing microblogs for assisting post-disaster relief operations via matching resource needs and availabilities , 2019, Inf. Process. Manag..

[19]  Dean Eckles,et al.  Protecting elections from social media manipulation , 2019, Science.

[20]  Bernard J. Jansen,et al.  Correlation of Brand Mentions in Social Media and Web Searching Before and After Real Life Events: Phase Analysis of Social Media and Search Data for Super Bowl 2015 Commercials , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[21]  Zhiting Hu,et al.  A Survey of Knowledge-enhanced Text Generation , 2020, ACM Comput. Surv..

[22]  Dongyan Zhao,et al.  Style Transfer in Text: Exploration and Evaluation , 2017, AAAI.

[23]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[24]  Guoqiang Liu,et al.  Inter-basin sources for two-year predictability of the multi-year La Niña event in 2010–2012 , 2017, Scientific Reports.

[25]  Alexander I. Rudnicky,et al.  Automatic Extraction of Briefing Templates , 2008, IJCNLP.

[26]  Xuanjing Huang,et al.  Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation , 2019, ACL.

[27]  Vincent Nguyen,et al.  Investigating the Effect of Lexical Segmentation in Transformer-based Models on Medical Datasets , 2019, ALTA.

[28]  Zhoujun Li,et al.  Harnessing Pre-Trained Neural Networks with Rules for Formality Style Transfer , 2019, EMNLP.

[29]  Lili Mou,et al.  Disentangled Representation Learning for Non-Parallel Text Style Transfer , 2018, ACL.

[30]  Percy Liang,et al.  Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer , 2018, NAACL.

[31]  Chang-Shing Lee,et al.  A fuzzy ontology and its application to news summarization , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  Rajesh Ranganath,et al.  ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission , 2019, ArXiv.

[33]  Lei Li,et al.  Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information , 2019, ACL.

[34]  Jaewoo Kang,et al.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..

[35]  Xiaojun Wan,et al.  Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation , 2019, NeurIPS.

[36]  Jie Zhou,et al.  A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer , 2019, IJCAI.

[37]  Samy Bengio,et al.  Content preserving text generation with attribute controls , 2018, NeurIPS.

[38]  Ye Zhang,et al.  SHAPED: Shared-Private Encoder-Decoder for Text Style Adaptation , 2018, NAACL.

[39]  Shujie Liu,et al.  A Dataset for Low-Resource Stylized Sequence-to-Sequence Generation , 2020, AAAI.

[40]  Wei-Hung Weng,et al.  Publicly Available Clinical BERT Embeddings , 2019, Proceedings of the 2nd Clinical Natural Language Processing Workshop.

[41]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[42]  Jimmy J. Lin,et al.  Simple Applications of BERT for Ad Hoc Document Retrieval , 2019, ArXiv.

[43]  Decomposing Textual Information For Style Transfer , 2019, EMNLP.

[44]  Markus Bayer,et al.  Rapid relevance classification of social media posts in disasters and emergencies: A system and evaluation featuring active, incremental and online learning , 2020, Inf. Process. Manag..

[45]  John Yen,et al.  Seeking the trustworthy tweet: Can microblogged data fit the information needs of disaster response and humanitarian relief organizations , 2011, ISCRAM.

[46]  Ion Bica,et al.  Sensing service architecture for smart cities using social network platforms , 2017, Soft Comput..

[47]  Keith Carlson,et al.  Evaluating prose style transfer with the Bible , 2017, Royal Society Open Science.

[48]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[49]  Hao Lu,et al.  Automatic generation of meteorological briefing by event knowledge guided summarization model , 2020, Knowl. Based Syst..

[50]  Xu Sun,et al.  Learning Sentiment Memories for Sentiment Modification without Parallel Data , 2018, EMNLP.

[51]  Zhe Gan,et al.  Adversarial Text Generation via Feature-Mover's Distance , 2018, NeurIPS.

[52]  Alexander M. Rush,et al.  Adversarially Regularized Autoencoders , 2017, ICML.

[53]  Miguel Angel Ferrer-Ballester,et al.  A Perspective Analysis of Handwritten Signature Technology , 2019, ACM Comput. Surv..

[54]  Eric P. Xing,et al.  Unsupervised Text Style Transfer using Language Models as Discriminators , 2018, NeurIPS.

[55]  Yu Cheng,et al.  Domain Adaptive Text Style Transfer , 2019, EMNLP.

[56]  Robert Balzer,et al.  The Briefing Associate: Easing Authors into the Semantic Web , 2002, IEEE Intell. Syst..

[57]  Jimmy J. Lin,et al.  Cross-Domain Modeling of Sentence-Level Evidence for Document Retrieval , 2019, EMNLP.

[58]  Ramesh Nallapati,et al.  Domain Adaptation with BERT-based Domain Classification and Data Selection , 2019, EMNLP.

[59]  Zaher Al Aghbari,et al.  SNSJam: Road traffic analysis and prediction by fusing data from multiple social networks , 2020, Inf. Process. Manag..

[60]  Hao Lu,et al.  Wide-grained capsule network with sentence-level feature to detect meteorological event in social network , 2020, Future Gener. Comput. Syst..

[61]  Houfeng Wang,et al.  Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach , 2018, ACL.

[62]  Naren Ramakrishnan,et al.  Neural Abstractive Text Summarization with Sequence-to-Sequence Models , 2018, Trans. Data Sci..

[63]  Dayiheng Liu,et al.  Revision in Continuous Space: Fine-Grained Control of Text Style Transfer , 2019, ArXiv.

[64]  Inderjeet Mani,et al.  Automated Briefing Production for Lessons Learned Systems , 2000 .

[65]  Mehmet A. Orgun,et al.  Real-time event detection from the Twitter data stream using the TwitterNews+ Framework , 2019, Inf. Process. Manag..

[66]  Inderjeet Mani,et al.  Using Summarization for Automatic Briefing Generation , 2000 .

[67]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[68]  Graham Neubig,et al.  A Probabilistic Formulation of Unsupervised Text Style Transfer , 2020, ICLR.

[69]  Zhendong Niu,et al.  Sensing Urban Transportation Events from Multi-Channel Social Signals with the Word2vec Fusion Model , 2018, Sensors.

[70]  Gerasimos Spanakis,et al.  Towards Controlled Transformation of Sentiment in Sentences , 2019, ICAART.

[71]  D. Raabe,et al.  Hydrogen enhances strength and ductility of an equiatomic high-entropy alloy , 2017, Scientific Reports.

[72]  Xu Sun,et al.  Parallel Data Augmentation for Formality Style Transfer , 2020, ACL.

[73]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[74]  Joel R. Tetreault,et al.  Dear Sir or Madam, May I Introduce the GYAFC Dataset: Corpus, Benchmarks and Metrics for Formality Style Transfer , 2018, NAACL.

[75]  Jason J. Jung,et al.  Social big data: Recent achievements and new challenges , 2015, Information Fusion.

[76]  Eric P. Xing,et al.  Toward Controlled Generation of Text , 2017, ICML.

[77]  Antonio Torralba,et al.  Using AI and Social Media Multimodal Content for Disaster Response and Management: Opportunities, Challenges, and Future Directions , 2020, Inf. Process. Manag..

[78]  Jim Hunter,et al.  Choosing words in computer-generated weather forecasts , 2005, Artif. Intell..

[79]  Yulia Tsvetkov,et al.  Style Transfer Through Back-Translation , 2018, ACL.

[80]  Cícero Nogueira dos Santos,et al.  Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer , 2018, ACL.

[81]  Regina Barzilay,et al.  Style Transfer from Non-Parallel Text by Cross-Alignment , 2017, NIPS.

[82]  Guillaume Lample,et al.  Multiple-Attribute Text Style Transfer , 2018, ArXiv.

[83]  Min-Yen Kan,et al.  Customization in a unified framework for summarizing medical literature , 2005, Artif. Intell. Medicine.

[84]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[85]  Zhendong Niu,et al.  Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning , 2017, Artificial Intelligence Review.

[86]  Vishal Gupta,et al.  Recent automatic text summarization techniques: a survey , 2016, Artificial Intelligence Review.

[87]  Zhendong Niu,et al.  Using Adverse Weather Data in Social Media to Assist with City-Level Traffic Situation Awareness and Alerting , 2018, Applied Sciences.

[88]  Dejian Yang,et al.  Progress in ENSO prediction and predictability study , 2018, National Science Review.

[89]  M. Baqer,et al.  S-Sensors: Integrating physical world inputs with social networks using wireless sensor networks , 2009, 2009 International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP).

[90]  Amar Prakash Azad,et al.  Unsupervised Controllable Text Formalization , 2018, AAAI.

[91]  Benjamin C. M. Fung,et al.  Detecting breaking news rumors of emerging topics in social media , 2020, Inf. Process. Manag..

[92]  Akhilesh Sudhakar,et al.  “Transforming” Delete, Retrieve, Generate Approach for Controlled Text Style Transfer , 2019, EMNLP.

[93]  Krys J. Kochut,et al.  Text Summarization Techniques: A Brief Survey , 2017, International Journal of Advanced Computer Science and Applications.

[94]  Harsh Jhamtani,et al.  Shakespearizing Modern Language Using Copy-Enriched Sequence to Sequence Models , 2017, Proceedings of the Workshop on Stylistic Variation.

[95]  Alexander M. Rush,et al.  Structured Attention Networks , 2017, ICLR.

[96]  Muhammad Imran,et al.  Automatic identification of eyewitness messages on twitter during disasters , 2020, Inf. Process. Manag..

[97]  Karl Aberer,et al.  Cloud based social and sensor data fusion , 2012, 2012 15th International Conference on Information Fusion.

[98]  Tao Zhang,et al.  Mask and Infill: Applying Masked Language Model for Sentiment Transfer , 2019, IJCAI.

[99]  Alexander I. Rudnicky,et al.  Learning from the Report-writing Behavior of Individuals , 2007, IJCAI.