Term-Based Personalization for Feature Selection in Clinical Handover Form Auto-Filling

Feature learning and selection have been widely applied in many research areas because of their good performance and lower complexity. Traditional methods usually treat all terms with same feature sets, such that performance can be damaged when noisy information is brought via wrong features for a given term. In this paper, we propose a term-based personalization approach to finding the best features for each term. First, features are given as the input so that we focus on selection strategies. Second, the importance of each feature subset to a given term is evaluated by the term-feature probabilistic relevance model. We present a feature searching method to generate feature candidate subsets for each term, since evaluating all the possible feature subsets is computationally intensive. Finally, we obtain the personalized feature set for each term as a subset of all features. Experiments have been conducted on the NICTA Synthetic Nursing Handover dataset and the results show that our approach is promising and effective.

[1]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[2]  Stjepan Oreski,et al.  Genetic algorithm-based heuristic for feature selection in credit risk assessment , 2014, Expert Syst. Appl..

[3]  Thierry Hamon,et al.  Disease and Disorder Template Filling using Rule-based and Statistical Approaches , 2014, CLEF.

[4]  Ellen Riloff,et al.  Stacked Generalization for Medical Concept Extraction from Clinical Notes , 2015, BioNLP@IJCNLP.

[5]  José Orlando Gomes,et al.  Handoff strategies in settings with high consequences for failure: lessons for health care operations. , 2004, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[6]  Evangelos Kanoulas,et al.  Distributional Semantics for Medical Information Extraction , 2016, CLEF.

[7]  D. T. Tran miph,et al.  Classifying nursing errors in clinical management within an Australian hospital , 2010 .

[8]  Yan Ge,et al.  Current Development and Technology in the Information Extraction for Clinical Narrative Text , 2015 .

[9]  Jing Liu,et al.  Clustering-Guided Sparse Structural Learning for Unsupervised Feature Selection , 2014, IEEE Transactions on Knowledge and Data Engineering.

[10]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[11]  Maximilian Eibl,et al.  Wrappers for Feature Subset Selection in CRF-based Clinical Information Extraction , 2016, CLEF.

[12]  Wei Liu,et al.  Semi-supervised multiview feature selection with label learning for VHR remote sensing images , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[13]  Dominique Estival,et al.  A usability framework for speech recognition technologies in clinical handover: A pre-implementation study , 2014, Journal of Medical Systems.

[14]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[15]  Liyuan Zhou,et al.  Task 1 of the CLEF eHealth Evaluation Lab 2016: Handover Information Extraction , 2016, CLEF.

[16]  Yogesh R. Shepal A Fast Clustering-Based Feature Subset Selection Algorithm for High Dimensional Data , 2014 .

[17]  Kristian Kersting,et al.  Learning Using Unselected Features (LUFe) , 2016, IJCAI.

[18]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[19]  Jenelle Matic,et al.  Review: bringing patient safety to the forefront through structured computerisation during clinical handover. , 2011, Journal of clinical nursing.

[20]  Jayanthi Sivaswamy,et al.  Regenerative Random Forest with Automatic Feature Selection to Detect Mitosis in Histopathological Breast Cancer Images , 2015, MICCAI.

[21]  Ahmad Taher Azar,et al.  Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis , 2014, Comput. Methods Programs Biomed..

[22]  Linda Dawson,et al.  A systematic review of speech recognition technology in health care , 2014, BMC Medical Informatics and Decision Making.

[23]  J. Basilakis,et al.  Comparing nursing handover and documentation: forming one set of patient information. , 2014, International nursing review.

[24]  Zenglin Xu,et al.  Discriminative Semi-Supervised Feature Selection Via Manifold Regularization , 2009, IEEE Transactions on Neural Networks.

[25]  Huan Liu,et al.  Semi-supervised Feature Selection via Spectral Analysis , 2007, SDM.

[26]  Gabriela Ferraro,et al.  Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations , 2015, JMIR medical informatics.

[27]  Louise Deléger,et al.  A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction , 2012, J. Am. Medical Informatics Assoc..

[28]  J.C. Rajapakse,et al.  SVM-RFE With MRMR Filter for Gene Selection , 2010, IEEE Transactions on NanoBioscience.

[29]  Filiberto Pla,et al.  Supervised feature selection by clustering using conditional mutual information-based distances , 2010, Pattern Recognit..

[30]  Bruce E. Bray,et al.  Congestive heart failure information extraction framework for automated treatment performance measures assessment , 2017, J. Am. Medical Informatics Assoc..

[31]  R. Sundaram A First Course in Optimization Theory: Optimization in ℝ n , 1996 .

[32]  Huan Liu,et al.  Searching for interacting features in subset selection , 2009, Intell. Data Anal..

[33]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[34]  Yang Song,et al.  Estimating Probability Density of Content Types for Promoting Medical Records Search , 2016, ECIR.

[35]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[36]  Dominique Estival,et al.  Capturing patient information at nursing shift changes: methodological evaluation of speech recognition and information extraction , 2015, J. Am. Medical Informatics Assoc..