论文信息 - Automated Identification of Potential Conflict-of-Interest in Biomedical Articles Using Hybrid Deep Neural Network

Automated Identification of Potential Conflict-of-Interest in Biomedical Articles Using Hybrid Deep Neural Network

Conflicts-of-interest (COI) in biomedical research may cause ethical risks, including pro-industry conclusions, restrictions on the behavior of investigators, and the use of biased study designs. To ensure the impartiality and objectivity in research, many journal publishers require authors to provide a COI statement within the body text of their articles at the time of peer-review and publication. However, author’s self-reported COI disclosure often does not explicitly appear in their article, and may not be very accurate or reliable. In this study, we present a two-stage machine learning scheme using a hybrid deep learning neural network (HDNN) that combines a multi-channel convolutional neural network (CNN) and a feed-forward neural network (FNN), to automatically identify a potential COI in online biomedical articles. HDNN is designed to simultaneously learn a syntactic and semantic representation of text, relationships between neighboring words in a sentence, and handcrafted input features, and achieves a better performance overall (accuracy exceeding 96.8%) than other classifiers such as support vector machine (SVM), single/multi-channel CNNs, Long Short-term Memory (LSTM), and an Ensemble model in a series of classification experiments.

In-Cheol Kim | George R. Thoma

[1] Robert E. Mercer,et al. A Design Methodology for a Biomedical Literature Indexing Tool Using the Rhetoric of Science , 2004, HLT-NAACL 2004.

[2] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[3] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[4] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[5] Phil Blunsom,et al. A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[6] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[8] Maria Simi,et al. Experiments on the Use of Feature Selection and Negative Evidence in Automated Text Categorization , 2000, ECDL.

[9] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[11] Hung Hum,et al. Is Naïve Bayes a Good Classifier for Document Classification , 2011 .

[12] Pushpak Bhattacharyya,et al. IITP at SemEval-2017 Task 5: An Ensemble of Deep Learning and Feature Based Models for Financial Sentiment Analysis , 2017, *SEMEVAL.

[13] Awais Athar,et al. Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[14] Cícero Nogueira dos Santos,et al. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[15] Ming Zhou,et al. Question Answering over Freebase with Multi-Column Convolutional Neural Networks , 2015, ACL.

[16] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[17] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[18] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..