Machine Learning Approach for the Automatic Annotation of the Events

After the beginning of the extension of current Web towards the semantics, the annotation starts to take a significant role, since it takes part to give the semantic aspect to the different types of documents. With the proliferation of news articles from thousands of different sources now available on the Web, summarization of such information is becoming increasingly important. We will define a methodological approach to extract the events from the news articles and to annotate them according to the principal events which they contain. Considering the large number of news source (for examples, BBC, Reuters, CNN...), every day, thousands of articles are produced in the entire world concerning a given event. This is why we have to think to automate the process of annotation of such articles.

[1]  Amit P. Sheth,et al.  Meteor-s web service annotation framework , 2004, WWW '04.

[2]  Saleem Abuleil,et al.  Events extraction and classification for Arabic information retrieval systems , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[3]  Nicholas Kushmerick,et al.  Event Extraction from Heterogeneous News Sources , 2006 .

[4]  Dragomir R. Radev A Common Theory of Information Fusion from Multiple Text Sources Step One: Cross-Document Structure , 2000, SIGDIAL Workshop.

[5]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[6]  Philippe Smets The transferable belief model and other interpretations of Dempster-Shafer's model , 1990, UAI.

[7]  Ghassan Mourad,et al.  Analyse informatique des signes typographiques pour la segmentation de textes et l'extraction automatique de citations : réalisation des applications informatiques : SegATex et CitaRE , 2001 .

[8]  I. Biskri,et al.  Hybrid approach for the assistance in the events extraction in great textual data bases , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[9]  Rim Faiz Identifying Relevant Sentences in News Articles for Event Information Extraction , 2006, Int. J. Comput. Process. Orient. Lang..

[10]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[11]  Jean-Pierre Desclés,et al.  EXCOM: An Automatic Annotation Engine for Semantic Information , 2006, FLAIRS.

[12]  Colin de la Higuera,et al.  Probabilistic DFA Inference using Kullback-Leibler Divergence and Minimality , 2000, ICML.

[13]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[14]  Tod S. Levitt,et al.  Uncertainty in artificial intelligence , 1988 .

[15]  Inderjeet Mani,et al.  Guidelines for Annotating Temporal Information , 2001, HLT.

[16]  Hongyuan Zha,et al.  Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering , 2002, SIGIR '02.

[17]  Xavier Tannier,et al.  Annotating and measuring temporal relations in texts , 2004, COLING.

[18]  Nicholas Kushmerick,et al.  Clustering Sentences for Discovering Events in News Articles , 2006, ECIR.

[19]  Marja-Riitta Koivunen,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2001, WWW '01.