ChronoSeeker: search engine for future and past events

In this paper, we propose an on-demand search engine called ChronoSeeker, which allows users to find past/future events based on their interest. Our goal is providing a search engine which can collect as many future/past events as possible relevant to user's query in obtaining various future scenarios considering both predictions and histories. Two technical issues are treated, (1) efficient search method for event information and (2) accurate filtering method for removing noises from search results. To search for event information effectively, our system expands a user query by some typical expressions related to event information such as year expressions, temporal modifiers and context terms. To remove noisy information, we selected five types of features for a machine learning technique to classify candidates into event information or not. Our experiment showed that filtering performance achieved an 85% F-measure, and that query expansion can collect dozens of times more CEs than those without expansion.

[1]  Jeffrey P. Bigham,et al.  Organizing and Searching the World Wide Web of Facts - Step One: The One-Million Fact Extraction Challenge , 2006, AAAI.

[2]  Munmun De Choudhury,et al.  Can blog communication dynamics be correlated with stock market activity? , 2008, Hypertext.

[3]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[4]  James Pustejovsky,et al.  Introduction to the special issue on temporal information processing , 2004, TALIP.

[5]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[6]  Keiji Yamada,et al.  Experiments on Personal Opinion Expression and Consensus Building using “Future Chronicle” , 2008, 2008 Second International Symposium on Universal Communication.

[7]  Johan Bollen,et al.  Between Conjecture and Memento: Shaping A Collective Emotional Perception of the Future , 2008, AAAI Spring Symposium: Emotion, Personality, and Social Behavior.

[8]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[9]  Gilad Mishne,et al.  Predicting Movie Sales from Blogger Sentiment , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[10]  R. Baeza-Yates Searching the Future , 2022 .

[11]  David E. Millard,et al.  Artequakt: Generating Tailored Biographies with Automatically Annotated Fragments from the Web , 2002, SAAKM@ECAI.

[12]  Kazuo Kunieda,et al.  ChronoSeeker: Future Opinion Extraction , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[13]  Jian Zhang,et al.  Daily Prediction of Major Stock Indices from Textual WWW Data , 1998, KDD.

[14]  Alexander Podelko Multiple Dimensions of Performance Requirements , 2007, Int. CMG Conference.

[15]  L. V. Williams,et al.  Prediction Markets , 2003 .

[16]  Iadh Ounis,et al.  ACM SIGIR workshop on mathematical/formal methods in information retrieval MF/IR 2005 , 2005, SIGF.

[17]  Adam Jatowt,et al.  Supporting analysis of future-related information in news archives and the web , 2009, JCDL '09.