A study on question answering system using integrated retrieval method

In recent years, Question Answering (QA) system has been researched extensively and automatic question answering has become an interesting research field and resulted in a visible improvement in its performance. Especially, during the last decade, a number of automatic QA systems have emerged, which has been largely driven by the TREC (Text REtrieval Conference) QA Track. The technology of QA relates to a lot of aspects of NLP (Natural Language Processing), such as Information Retrieval (IR), Information Extraction (IE), Automatic Summarization, Conversation Interface, etc. However, recently the QA systems have emerged following two directions: one direction is to use the TREC QA data as a testing corpus and develop their own search engines and answer extraction techniques on top of the corpus; Another direction is to use the WWW as the potential answer source and use generic search engines, such as Google, to extract the answers for the users’ questions. Although, the current trend in Question Answering focus on open domain, the open domain is lacking to treat the restricted domains and all question types, because no restriction is imposed either on the user’s special vocabulary or on the question type, and it is very hard to construct a common knowledge (ontology) base for open domain. From the viewpoint of practicality, many researchers also begin to focus their attentions on the restricted domain QA and have built some advanced QA systems for restricted domain. On the other hand, since Chinese text retrieval has just been developed lately and there are many various specific characteristics in Chinese language, the research of Chinese QA using natural language was developed later than that in western countries and Japan. But, it has been started to pay attention to Chinese Question Answering in recent years. In this thesis, a fundamental start originates from the usability and limitation

[1]  Katsunobu Itou,et al.  Effects of language modeling on speech-driven question answering , 2004, INTERSPEECH.

[2]  Ulf Hermjakob,et al.  Parsing and Question Classification for Question Answering , 2001, ACL 2001.

[3]  Yutao Guo,et al.  Chinese Question Answering with Full-Text Retrieval Re-Visited , 2004 .

[4]  Gregory A. Marton,et al.  Sepia : semantic parsing for named entities , 2003 .

[5]  Richard F. E. Sutcliffe,et al.  Question Answering Using the DLT System at TREC 2002 , 2002, TREC.

[6]  Joseph Weizenbaum,et al.  ELIZA—a computer program for the study of natural language communication between man and machine , 1966, CACM.

[7]  W. Bruce Croft,et al.  Evaluating Question-Answering Techniques in Chinese , 2001, HLT.

[8]  Chin-Yew Lin The Effectiveness of Dictionary and Web-Based Answer Reranking , 2002, COLING.

[9]  Shi-Jim Yen,et al.  The Design and Implementation of Chinese Semantic Search Engine Based on Faq Corpus and Ontology Construction from Information Extraction , 2022 .

[10]  Bernardo Magnini,et al.  Is It the Right Answer? Exploiting Web Redundancy for Answer Validation , 2002, ACL.

[11]  Hiroyuki Kojima,et al.  Chinese Term Extraction from Web Pages Based on Compound Term Productivity , 2004, SIGHAN@ACL.

[12]  Jimmy J. Lin,et al.  Web question answering: is more always better? , 2002, SIGIR '02.

[13]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[14]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[15]  Katunobu Itou,et al.  Towards Speech-Driven Question Answering: Experiments Using the NTCIR-3 Question Answering Collection , 2002, NTCIR.

[16]  M. de Rijke,et al.  Tequesta: The University of Amsterdam's Textual Question Answering System , 2001, TREC.

[17]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[18]  Hsiu-Hsen Yao,et al.  Chinese question-answering system , 2004, Journal of Computer Science and Technology.

[19]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[20]  Sanda M. Harabagiu,et al.  LASSO: A Tool for Surfing the Answer Net , 1999, TREC.

[21]  Dell Zhang,et al.  Question classification using support vector machines , 2003, SIGIR.

[22]  Zhang Lin A Chinese Spoken Dialogue System about Real-time Stock Information , 2004 .

[23]  Adwait Ratnaparkhi,et al.  IBM's Statistical Question Answering System , 2000, TREC.

[24]  Ellen M. Voorhees,et al.  The Ninth Text REtrieval Conference (TREC-9) , 2001 .

[25]  Yorick Wilks,et al.  Can We Make Information Extraction More Adaptive? , 1999, SCIE.

[26]  Walter Kintsch,et al.  Comprehension: A Paradigm for Cognition , 1998 .

[27]  Zhu Sheng An Efficient Stochastic Context Free Parsing Algorithm , 1998 .

[28]  Sanda M. Harabagiu,et al.  FALCON: Boosting Knowledge for Answer Engines , 2000, TREC.

[29]  Roger Levy,et al.  Is it Harder to Parse Chinese, or the Chinese Treebank? , 2003, ACL.

[30]  Shingo Kuroiwa,et al.  Chinese Automatic Question Answering System of Specific-domain Based on Vector Space Model , 2005 .

[31]  Boris Katz,et al.  From Sentence Processing to Information Access on the World Wide Web , 1997 .

[32]  Scott Miller,et al.  TREC 2002 QA at BBN: Answer Selection and Confidence Estimation , 2002, TREC.

[33]  Oren Tsur,et al.  BioGrapher: Biography Questions as a Restricted Domain Question Answering Task , 2004 .

[34]  Dell Zhang,et al.  Web Based Question Answering with Aggregation Strategy , 2004, APWeb.

[35]  Alistair Moffat,et al.  Exploring the similarity space , 1998, SIGF.

[36]  Bert F. Green,et al.  Baseball: an automatic question-answerer , 1899, IRE-AIEE-ACM '61 (Western).

[37]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[38]  Susan T. Dumais,et al.  An Analysis of the AskMSR Question-Answering System , 2002, EMNLP.

[39]  Harris Wu,et al.  Probabilistic question answering on the web , 2002, WWW '02.

[40]  Steven D. Whitehead,et al.  Auto-FAQ: An Experiment in Cyberspace Leveraging , 1995, Comput. Networks ISDN Syst..

[41]  Mark Sanderson,et al.  University of Sheffield TREC-8 Q&A System , 1999, TREC.

[42]  Jimmy J. Lin,et al.  Overview of the TREC 2007 Question Answering Track , 2008, TREC.

[43]  Martin M. Soubbotin,et al.  Use of Patterns for Detection of Likely Answer Strings: A Systematic Approach , 2002, TREC.

[44]  Sanda M. Harabagiu,et al.  Answering Complex, List and Context Questions with LCC's Question-Answering Server , 2001, TREC.

[45]  F. Ren,et al.  Web-based question answering system for restricted domain based of integrating method using semantic information , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[46]  M. A. R T A P A L,et al.  The Penn Chinese TreeBank: Phrase structure annotation of a large corpus , 2005, Natural Language Engineering.

[47]  Kazuhide Yamamoto,et al.  Performance Evaluation of Chinese Analyzers with Support Vector Machines , 2003 .

[48]  Lynette Hirschman,et al.  Deep Read: A Reading Comprehension System , 1999, ACL.

[49]  Christian Jacquemin,et al.  Terminological Variants for Document Selection and Question/Answer Matching , 2001, ACL 2001.

[50]  Sanda M. Harabagiu,et al.  Answer Mining by Combining Extraction Techniques with Abductive Reasoning , 2003, Text Retrieval Conference.

[51]  Charles L. A. Clarke,et al.  Statistical Selection of Exact Answers (MultiText Experiments for TREC 2002) , 2002, TREC.

[52]  Kristian J. Hammond,et al.  Question Answering from Frequently Asked Question Files: Experiences with the FAQ FINDER System , 1997, AI Mag..

[53]  Qun Liu,et al.  HHMM-based Chinese Lexical Analyzer ICTCLAS , 2003, SIGHAN.

[54]  Diego Mollá Aliod,et al.  A real world implementation of answer extraction , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[55]  Wei Li,et al.  Information Extraction Supported Question Answering , 1999, TREC.

[56]  Cai Dong,et al.  Research on Web-based Chinese Question Answering System and Answer Extraction , 2004 .

[57]  Farah Benamara Cooperative Question Answering in Restricted Domains: the WEBCOOP Experiment , 2004 .

[58]  Fan Xiao-zhong A Study on a Bank-Domain Automatic Chinese Question-Answering System BAQS , 2004 .

[59]  Shingo Kuroiwa,et al.  A Question Answering System on Special Domain and the Implementation of Speech Interface , 2006, CICLing.

[60]  Lynette Hirschman,et al.  Natural language question answering: the view from here , 2001, Natural Language Engineering.

[61]  William A. Woods,et al.  Progress in natural language understanding: an application to lunar geology , 1973, AFIPS National Computer Conference.

[62]  Chengqing Zong,et al.  Chinese Utterance Segmentation in Spoken Language Translation , 2003, CICLing.

[63]  Qun Liu,et al.  Semantic computation in a Chinese Question-Answering system , 2002, Journal of Computer Science and Technology.

[64]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[65]  Edward James Schofield,et al.  A Speech Interface for Open-Domain Question-Answering , 2003, ACL.

[66]  Zhiping Zheng,et al.  AnswerBus question answering system , 2002 .

[67]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[68]  Terry Winograd,et al.  Understanding natural language , 1974 .

[69]  Walter Daelemans,et al.  Complex answers: a case study using a WWW question answering system , 2001, Natural Language Engineering.

[70]  Stephen E. Robertson,et al.  On relevance weights with little relevance information , 1997, SIGIR '97.

[71]  Jun Suzuki,et al.  Question Classification using HDAG Kernel , 2003, ACL 2003.

[72]  Xu Bo,et al.  Research on Question Answering & Evaluation: A Survey , 2005 .

[73]  Zimin Wu,et al.  Chinese Text Segmentation for Text Retrieval: Achievements and Problems , 1993, J. Am. Soc. Inf. Sci..

[74]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[75]  Stephen E. Robertson,et al.  Okapi/Keenbow at TREC-8 , 1999, TREC.

[76]  Richard Sproat,et al.  A statistical method for finding word boundaries in Chinese text , 1990 .

[77]  W. Bruce Croft,et al.  Document Retrieval and Routing Using the INQUERY System , 1994, TREC.

[78]  Mario Martín,et al.  Essence: A Portable Methodology for Acquiring Information Extraction Patterns , 2000, ECAI.

[79]  Robert F. Simmons,et al.  Answering English questions by computer: a survey , 1965, CACM.

[80]  Young-In Song,et al.  A Practical QA System in Restricted Domains , 2004 .

[81]  Qiang Zhou,et al.  Using Co-occurrence Statistics as an Information Source for Partial Parsing of Chinese , 2000, ACL 2000.

[82]  Scott B. Huffman,et al.  Learning information extraction patterns from examples , 1995, Learning for Natural Language Processing.

[83]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[84]  Remi Zajac Towards Ontological Question Answering , 2001, ACL 2001.

[85]  Krzysztof Czuba,et al.  One Search Engine or Two for Question-Answering , 2000, TREC.

[86]  Jaime G. Carbonell,et al.  The JAVELIN Question-Answering System at TREC 2003: A Multi-Strategh Approach with Dynamic Planning , 2003, TREC.

[87]  Daniel G. Bobrow,et al.  GUS, A Frame-Driven Dialog System , 1986, Artif. Intell..

[88]  Fredric C. Gey,et al.  Chinese text retrieval without using a dictionary , 1997, SIGIR '97.

[89]  Salim Roukos,et al.  Automatic Derivation of Surface Text Patterns for a Maximum Entropy Based Question Answering System , 2003, NAACL.

[90]  David Chiang,et al.  Two Statistical Parsing Models Applied to the Chinese Treebank , 2000, ACL 2000.

[91]  S. Robertson The probability ranking principle in IR , 1997 .

[92]  Martin M. Soubbotin Patterns of Potential Answer Expressions as Clues to the Right Answers , 2001, TREC.

[93]  Douglas E. Appelt,et al.  SRI International FASTUS SystemMUC-6 Test Results and Analysis , 1995, MUC.

[94]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[95]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[96]  Charles L. A. Clarke,et al.  Question Answering by Passage Selection (MultiText Experiments for TREC-9) , 2000, TREC.

[97]  Shih-Hung Wu,et al.  ASQA: Academia Sinica Question Answering System for NTCIR-5 CLQA , 2005, NTCIR.

[98]  Shingo Kuroiwa,et al.  A New Question Answering System for Chinese Restricted Domain , 2006, IEICE Trans. Inf. Syst..

[99]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[100]  Oren Etzioni,et al.  Scaling question answering to the Web , 2001, WWW '01.

[101]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[102]  Graeme Hirst,et al.  Analysis of Semantic Classes in Medical Text for Question Answering , 2004 .

[103]  Guan Yi,et al.  The Research on Professional Website Oriented Chinese Question Answering System , 2001 .