BioinQA: metadata-based multi-document QA system for addressing the issues in biomedical domain

Despite the availability of large amount of biomedical literature; extracting relevant information catering to the exact need of the user has been difficult in the absence of efficient domain specific information retrieval tools. Biomedical question answering (QA) systems require special techniques to address domain-specific issues, since a wide variety of user-groups having different information needs; terminology and level of understanding, etc., may access the information. While specialised information retrieval tools are not suitable for beginners, general purpose search engines are not intelligent enough to respond to domain specific questions. This paper presents an intelligent QA system that answers natural language questions while adapting itself to the level of user. The system constructs answers from multiple documents for complex comparison seeking questions. The system utilises metadata knowledge for addressing specific biomedical domain concerns like heterogeneity, acronyms, etc. Experiments ...

[1]  Maria Mirto,et al.  ProGenGrid: a workflow service infrastructure for composing and executing bioinformatics grid services , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[2]  M. Ebell,et al.  Analysis of questions asked by family doctors regarding patient care , 1999, BMJ.

[3]  Vijay K. Vaishnavi,et al.  A methodology for semantic integration of metadata in bioinformatics data sources , 2005, ACM-SE 43.

[4]  Christophe Rigotti,et al.  Combining sequence and itemset mining to discover named entities in biomedical texts: a new type of pattern , 2009, Int. J. Data Min. Model. Manag..

[5]  Jeffrey Pomerantz,et al.  A linguistic analysis of question taxonomies , 2005, J. Assoc. Inf. Sci. Technol..

[6]  Gosse Bouma,et al.  Developing Offline Strategies for Answering Medical Questions , 2005 .

[7]  Lotfi A. Zadeh,et al.  From search engines to question answering systems - The problems of world knowledge, relevance, deduction and precisiation , 2006, Fuzzy Logic and the Semantic Web.

[8]  Alistair Moffat,et al.  Needles and Haystacks: a search engine for personal information collections , 2000, Proceedings 23rd Australasian Computer Science Conference. ACSC 2000 (Cat. No.PR00518).

[9]  Donato Malerba,et al.  A relational perspective on spatial data mining , 2008, Int. J. Data Min. Model. Manag..

[10]  Baoxin Li,et al.  Completing missing views for multiple sources of web media , 2008, Int. J. Data Min. Model. Manag..

[11]  S. Satya‐Murti Evidence-based Medicine: How to Practice and Teach EBM , 1997 .

[12]  Vadlamani Ravi,et al.  Predicting credit card customer churn in banks using data mining , 2008, Int. J. Data Anal. Tech. Strateg..

[13]  Fabio Rinaldi,et al.  Answering Questions in the Genomics Domain , 2004, ACL 2004.

[14]  Stefan Schulz,et al.  Biomedical text retrieval in languages with a complex morphology , 2002, ACL Workshop on Natural Language Processing in the Biomedical Domain.

[15]  Ankush Mittal,et al.  A Fully Automatic Question-Answering System for Intelligent Search in E-Learning Documents , 2005 .

[16]  Panagiotis Stamatopoulos,et al.  Summarization from Medical Documents: A Survey , 2005, Artif. Intell. Medicine.

[17]  George Hripcsak,et al.  Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians , 2007, J. Biomed. Informatics.

[18]  Amr Badr,et al.  A Service Oriented Architecture for Biomedical Question Answering System , 2008, 2008 IEEE Congress on Services Part II (services-2 2008).

[19]  L. Stoimenov,et al.  Integration of GIS data sources over the Internet using mediator and wrapper technology , 2000, 2000 10th Mediterranean Electrotechnical Conference. Information Technology and Electrotechnology for the Mediterranean Countries. Proceedings. MeleCon 2000 (Cat. No.00CH37099).

[20]  Graeme Hirst,et al.  Answering Clinical Questions with Role Identification , 2003, BioNLP@ACL.

[21]  William R. Hersh,et al.  A survey of current work in biomedical text mining , 2005, Briefings Bioinform..

[22]  Michael K. Ng,et al.  SMART: a subspace clustering algorithm that automatically identifies the appropriate number of clusters , 2009, Int. J. Data Min. Model. Manag..

[23]  Young-In Song,et al.  Terminology Indexing and Reweighting methods for Biomedical Text Retrieval , 2004 .

[24]  Pierre Zweigenbaum Question answering in biomedicine , 2003 .

[25]  Xiaohui Liu,et al.  Combining multiple classifiers for wrapper feature selection , 2008, Int. J. Data Min. Model. Manag..

[26]  Pierre Zweigenbaum,et al.  Towards a Medical Question-Answering System: a Feasibility Study , 2003, MIE.

[27]  Nan Wang,et al.  Automatic composite wrapper generation for semi-structured biological data based on table structure identification , 2004, SGMD.

[28]  Ankush Mittal,et al.  Question Processing and Clustering in INDOC: A Biomedical Question Answering System , 2007, EURASIP J. Bioinform. Syst. Biol..

[29]  Brigitte Grau,et al.  EQueR: the French Evaluation campaign of Question-Answering Systems , 2006, LREC.