论文信息 - Using Graph Based Mapping of Co-occurring Words and Closeness Centrality Score for Summarization Evaluation

Using Graph Based Mapping of Co-occurring Words and Closeness Centrality Score for Summarization Evaluation

The use of predefined phrase patterns like: N-grams (N>=2), longest common sub sequences or pre defined linguistic patterns etc do not give any credit to non-matching/smaller-size useful patterns and thus, may result in loss of information. Next, the use of 1-gram based model results in several noisy matches. Additionally, due to presence of more than one topic with different levels of importance in summary, we consider summarization evaluation task as topic based evaluation of information content. Means at first stage, we identify the topics covered in given model/reference summary and calculate their importance. At the next stage, we calculate the information coverage in test /machine generated summary, w.r.t. every identified topic. We introduce a graph based mapping scheme and the concept of closeness centrality measure to calculate the information depth and sense of the co-occurring words in every identified topic. Our experimental results show that devised system is better than/comparable with best results of TAC 2011 AESOP dataset.

K. Srinathan | Vasudeva Varma | Niraj Kumar

[1] K. Srinathan,et al. An Effective Approach for AESOP and Guided Summarization Task , 2010, TAC.

[2] Jun-ichi Fukumoto,et al. Automated Summarization Evaluation with Basic Elements. , 2006, LREC.

[3] Ani Nenkova,et al. Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[4] Ani Nenkova,et al. The Pyramid Method: Incorporating human content selection variation in summarization evaluation , 2007, TSLP.

[5] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[6] Hans van Halteren,et al. Evaluating Information Content by Factoid Analysis: Human annotation and stability , 2004, EMNLP.

[7] Eduard H. Hovy,et al. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[8] Eduard Hovy,et al. Evaluating DUC 2005 using Basic Elements , 2005 .

[9] K. Srinathan,et al. Evaluating Information Coverage in Machine Generated Summary and Variable Length Documents , 2010, COMAD.