Factorization machines and deep views-based co-training for improving answer quality prediction in online health expert question-answering services

In online health expert question-answering (HQA) services, it is significant to automatically determine the quality of the answers. There are two prominent challenges in this task. First, the answers are usually written in short text, which makes it difficult to absorb the text semantic information. Second, it usually lacks sufficient labeled data but contains a huge amount of unlabeled data. To tackle these challenges, we propose a novel deep co-training framework based on factorization machines (FM) and deep textual views to intelligently and automatically identify the quality of HQA systems. More specifically, we exploit additional domain-specific semantic information from domain-specific word embeddings to expand the semantic space of short text and apply FM to excavate the non-independent interaction relationships among diverse features within individual views for improving the performance of the base classifier via co-training. Our learned deep textual views, the convolutional neural networks (CNN) view which focuses on extracting local features using convolution filters to locally model short text and the dependency-sensitive convolutional neural networks (DSCNN) view which focuses on capturing long-distance dependency information within the text to globally model short text, can then overcome the challenge of feature sparseness in the short text answers from the doctors. The developed co-training framework can effectively mine the highly non-linear semantic information embedded in the unlabeled data and expose the highly non-linear relationships between different views, which minimizes the labeling effort. Finally, we conduct extensive empirical evaluations and demonstrate that our proposed method can significantly improve the predictive performance of the answer quality in the context of HQA services.

[1]  Ma Jian-fen Modify the Method of Feature's Weight in Text Classfication , 2009 .

[2]  J. Oh,et al.  Research agenda for social Q&A , 2009 .

[3]  Ee-Peng Lim,et al.  Web classification of conceptual entities using co-training , 2011, Expert Syst. Appl..

[4]  Qing Chen,et al.  Predicting the quality of online health expert question-answering services with temporal features in a deep learning framework , 2018, Neurocomputing.

[5]  Truyen Tran,et al.  Predicting healthcare trajectories from medical records: A deep learning approach , 2017, J. Biomed. Informatics.

[6]  Ludmila I. Kuncheva,et al.  A Theoretical Study on Six Classifier Fusion Strategies , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[8]  Haifeng Hu,et al.  Exploring social features for answer quality prediction in CQA portals , 2013, 2013 International Conference on Machine Learning and Cybernetics.

[9]  Yuan Luo,et al.  Recurrent Neural Networks for Classifying Relations in Clinical Notes , 2017, AMIA.

[10]  Susumu Horiguchi,et al.  Learning to classify short and sparse text & web with hidden topics from large-scale data collections , 2008, WWW.

[11]  Kilian Q. Weinberger,et al.  From sBoW to dCoT marginalized encoders for text representation , 2012, CIKM '12.

[12]  Robert Kincaid,et al.  Nicky: Toward a Virtual Assistant for Test and Measurement Instrument Recommendations , 2017, 2017 IEEE 11th International Conference on Semantic Computing (ICSC).

[13]  Mehran Sahami,et al.  A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.

[14]  Tengfei Liu,et al.  Recurrent networks with attention and convolutional networks for sentence representation and classification , 2018, Applied Intelligence.

[15]  Ming Dong,et al.  A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories , 2016, J. Biomed. Informatics.

[16]  Eugene Agichtein,et al.  Learning to recognize reliable users and content in social media with coupled mutual reinforcement , 2009, WWW '09.

[17]  W. Bruce Croft,et al.  A framework to predict the quality of answers with non-textual features , 2006, SIGIR.

[18]  Bin He,et al.  De-identification of medical records using conditional random fields and long short-term memory networks , 2017, Journal of biomedical informatics.

[19]  Stan Matwin,et al.  Email classification with co-training , 2011, CASCON.

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Shuang Wu,et al.  An integrated neural model for sentence classification , 2018, 2018 Chinese Control And Decision Conference (CCDC).

[22]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Michael R. Lyu,et al.  Boosting Response Aware Model-Based Collaborative Filtering , 2015, IEEE Transactions on Knowledge and Data Engineering.

[24]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[25]  Jian Feng,et al.  Predicting the quality of user-generated answers using co-training in community-based question answering portals , 2015, Pattern Recognit. Lett..

[26]  Ausif Mahmood,et al.  Convolutional Recurrent Deep Learning Model for Sentence Classification , 2018, IEEE Access.

[27]  Xiaohui Yan,et al.  A biterm topic model for short texts , 2013, WWW.

[28]  R. Bharat Rao,et al.  Bayesian Co-Training , 2007, J. Mach. Learn. Res..

[29]  Chen Change Loy,et al.  Development of fine-grained pill identification algorithm using deep convolutional network , 2017, J. Biomed. Informatics.

[30]  Pavel Braslavski,et al.  Towards Automatic Evaluation of Health-Related CQA Data , 2014, CLEF.

[31]  Antonio Jimeno Yepes,et al.  Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation , 2016, 1604.02506.

[32]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[33]  Xiaolong Wang,et al.  De-identification of clinical notes via recurrent neural network and conditional random field. , 2017, Journal of biomedical informatics.

[34]  Bin Sheng,et al.  Computer-Assisted Decision Support System in Pulmonary Cancer detection and stage classification on CT images , 2018, J. Biomed. Informatics.

[35]  William Speier,et al.  Using phrases and document metadata to improve topic modeling of clinical reports , 2016, J. Biomed. Informatics.

[36]  Elizabeth Sklar,et al.  Longitudinal analysis of discussion topics in an online breast cancer community using convolutional neural networks , 2016, J. Biomed. Informatics.

[37]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[38]  Wai-Tat Fu,et al.  Empathic Virual Assistant for Healthcare Information with Positive Emotional Experience , 2016, 2016 IEEE International Conference on Healthcare Informatics (ICHI).

[39]  Il-Chul Moon,et al.  Identifying prescription patterns with a topic model of diseases and medications , 2017, J. Biomed. Informatics.

[40]  Hakan Ferhatosmanoglu,et al.  Short text classification in twitter to improve information filtering , 2010, SIGIR.

[41]  Mourad Gridach,et al.  Character-level neural network for biomedical named entity recognition , 2017, J. Biomed. Informatics.

[42]  Artem Kuznetsov,et al.  Characterizing Health-Related Community Question Answering , 2013, ECIR.

[43]  Tung Tran,et al.  Predicting mental conditions based on "history of present illness" in psychiatric notes with deep neural networks. , 2017, Journal of biomedical informatics.

[44]  Wai-Tat Fu,et al.  Mindbot: A Social-Based Medical Virtual Assistant , 2016, 2016 IEEE International Conference on Healthcare Informatics (ICHI).

[45]  Joel J. P. C. Rodrigues,et al.  Mobile-health: A review of current state in 2015 , 2015, J. Biomed. Informatics.

[46]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[47]  Haiqin Yang,et al.  A deep learning approach for predicting the quality of online health expert question-answering services , 2016, J. Biomed. Informatics.

[48]  Enrico W. Coiera,et al.  A shared latent space matrix factorisation method for recommending new trial evidence for systematic review updates , 2017, J. Biomed. Informatics.

[49]  Haiqin Yang,et al.  Heterogeneous Features Integration in Deep Knowledge Tracing , 2017, ICONIP.

[50]  Ramakanth Kavuluru,et al.  Ordinal convolutional neural networks for predicting RDoC positive valence psychiatric symptom severity scores. , 2017, Journal of biomedical informatics.

[51]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[52]  Sharma Chakravarthy,et al.  Answer Quality Prediction in Q/A Social Networks by Leveraging Temporal Features , 2013, Int. J. Next Gener. Comput..

[53]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[54]  Ming Li,et al.  The best answer prediction by exploiting heterogeneous data on software development Q&A forum , 2017, Neurocomputing.

[55]  Jung-Tae Lee,et al.  Predicting the Quality of Answers Using Surface Linguistic Features , 2007, Sixth International Conference on Advanced Language Processing and Web Information Technology (ALPIT 2007).

[56]  Adam Worrall,et al.  Quality of health answers in social Q&A , 2012, ASIST.

[57]  Lina M. Sulieman,et al.  Classifying patient portal messages using Convolutional Neural Networks , 2017, J. Biomed. Informatics.

[58]  Michael R. Lyu,et al.  Online Nonlinear AUC Maximization for Imbalanced Data Sets , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[59]  Todd R. Johnson,et al.  Improving the utility of MeSH® terms using the TopicalMeSH representation , 2016, J. Biomed. Informatics.

[60]  Zhengxing Huang,et al.  On mining latent topics from healthcare chat logs , 2016, J. Biomed. Informatics.

[61]  Sheizaf Rafaeli,et al.  Predictors of answer quality in online Q&A sites , 2008, CHI.

[62]  Jeffrey Pomerantz,et al.  Evaluating and predicting answer quality in community QA , 2010, SIGIR.

[63]  Eugene Agichtein,et al.  Predicting information seeker satisfaction in community question answering , 2008, SIGIR '08.

[64]  Haiqin Yang,et al.  Implicit Heterogeneous Features Embedding in Deep Knowledge Tracing , 2018, Cognitive Computation.

[65]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[66]  Mengen Chen,et al.  Short Text Classification Improved by Learning Multi-Granularity Topics , 2011, IJCAI.

[67]  Yijia Zhang,et al.  A hybrid model based on neural networks for biomedical relation extraction , 2018, J. Biomed. Informatics.