Twitter provides important information for emergency responders in the rescue process during disasters. However, tweets containing relevant information are sparse and are usually hidden in a vast set of noisy contents. This leads to inherent challenges in generating suitable training data that are required for neural network models. In this paper, we study the problem of retrieving the infrastructure damage information from tweets generated from different location during crisis using the model actively trained on past but similar events. We combine RNN and GRU based model coupled with active learning that gets trained on most uncertain samples and captures the latent features of different data distribution. It reduces the uses of around 90% less training data, thereby significantly reducing the manual annotation efforts. We use the model pre-trained using active learning based approach to retrieve the infrastructure damage tweets originated from different regions. We obtain a minimum of 18% gain on F1-measure and considerably on other metrics over recent state-of-the-art IR techniques.
[1]
Somprakash Bandyopadhyay,et al.
Identifying Post-Disaster Resource Needs and Availabilities from Microblogs
,
2017,
2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).
[2]
Joydeep Chandra,et al.
Characterizing Infrastructure Damage After Earthquake: A Split-Query Based IR Approach
,
2018,
2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).
[3]
Carlos Castillo,et al.
AIDR: artificial intelligence for disaster response
,
2014,
WWW.
[4]
Joydeep Chandra,et al.
Where should one get news updates: Twitter or Reddit
,
2019,
Online Soc. Networks Media.