论文信息 - Leveraging Event Specific and Chunk Span features to Extract COVID Events from tweets

Leveraging Event Specific and Chunk Span features to Extract COVID Events from tweets

Twitter has acted as an important source of information during disasters and pandemic, especially during the times of COVID-19. In this paper, we describe our system entry for WNUT 2020 Shared Task-3. The task was aimed at automating the extraction of a variety of COVID-19 related events from Twitter, such as individuals who recently contracted the virus, someone with symptoms who were denied testing and believed remedies against the infection. The system consists of separate multi-task models for slot-filling subtasks and sentence-classification subtasks while leveraging the useful sentence-level information for the corresponding event. The system uses COVID-Twitter-Bert with attention-weighted pooling of candidate slot-chunk features to capture the useful information chunks. The system ranks 1st at the leader-board with F1 of 0.6598, without using any ensembles or additional datasets. The code and trained models are available at this https url1.

Ayush Kaushal | Tejas Vaidhya

[1] Durga Toshniwal,et al. Sub-event detection during natural hazards using features of social media data , 2013, WWW.

[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[3] Rohit Kumar,et al. WordTokenizers.jl: Basic tools for tokenizing natural language in Julia , 2020, J. Open Source Softw..

[4] Regina Barzilay,et al. Event Discovery in Social Media Feeds , 2011, ACL.

[5] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6] Yangming Li,et al. A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding , 2019, EMNLP.

[7] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[8] Andrew L. Maas. Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[9] Ritam Dutt,et al. Analysing the Extent of Misinformation in Cancer Related Tweets , 2020, ICWSM.

[10] Muhammad Imran,et al. Identifying Sub-events and Summarizing Disaster-Related Information from Microblogs , 2018, SIGIR.

[11] Bowen Zhou,et al. Leveraging Sentence-level Information with Encoder LSTM for Semantic Slot Filling , 2016, EMNLP.