Evaluation of Query-Based Arabic Text Summarization System

In this paper, we present and analyze the results of the application of Arabic query-based text summarization system - AQBTSS - in an attempt to produce a query-oriented summary for a single Arabic document. For this task, we adapted the traditional vector space model (VSM) and the cosine similarity measure to find the most relevant passages extracted form Arabic document to produce a text summary. We aim at using the short summaries in some natural language (NL) tasks such as generating answers for Arabic open domain question answering system (AQAS) as well as experimenting with categorizing Arabic scripts. The obtained results indicate that our simple approach for text summarization is promising.

[1]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[2]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[3]  Hongyan Jing,et al.  Sentence Reduction for Automatic Text Summarization , 2000, ANLP.

[4]  Wei Li,et al.  A Question Answering System Supported by Information Extraction , 2000, ANLP.

[5]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[6]  Sanda M. Harabagiu,et al.  Experiments with Open-Domain Textual Question Answering , 2000, COLING.

[7]  Jimmy J. Lin,et al.  Quantitative evaluation of passage retrieval algorithms for question answering , 2003, SIGIR.

[8]  Inderjeet Mani,et al.  The Tipster Summac Text Summarization Evaluation , 1999, EACL.

[9]  Min-Yen Kan,et al.  Applying Natural Language Generation to Indicative Summarization , 2001, EWNLG@ACL.

[10]  Gustave J. Rath,et al.  The formation of abstracts by the selection of sentences , 1961 .

[11]  Claire Cardie,et al.  Multidocument Summarization via Information Extraction , 2001, HLT.

[12]  Satoshi Sekine,et al.  A survey for Multi-Document Summarization , 2003, HLT-NAACL 2003.

[13]  BASSAM HAMMO,et al.  Experimenting with a Question Answering System for the Arabic Language , 2004, Comput. Humanit..

[14]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[15]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[16]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[17]  Sur-Jin Ker,et al.  A Text Categorization Based on a Summarization Extraction , 2000 .

[18]  Hugo Zaragoza,et al.  Information Retrieval: Algorithms and Heuristics , 2002, Information Retrieval.

[19]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .