Text Classification of Flu-Related Tweets Using FastText with Sentiment and Keyword Features
暂无分享,去创建一个
In this study, we present a framework for fluprediction/detection based on the available data of Social Networking Sites (SNS). The framework uses a state-of-the-art text classifier, which is FastText, to classify Twitter posts into flu-related or flu-unrelated posts. The FastText based framework is trained and tested using a pre-labeled dataset and utilizing the features of sentiment analysis and predefined keyword occurrences in addition to textual features. Results show that the framework improves the accuracy, in addition to the efficiency of flu disease surveillance systems that use unstructured data such as posts of Social Networking Sites.
[1] Miad Faezipour,et al. A review of influenza detection and prediction through social networking sites , 2018, Theoretical Biology and Medical Modelling.
[2] Mark Dredze,et al. Separating Fact from Fear: Tracking Flu Infections on Twitter , 2013, NAACL.
[3] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.
[4] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.