Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi

In this paper, we put forward a strategy that supplements Hindi WordNet entries with information on the temporality of its word senses. Each synset of Hindi WordNet is automatically annotated to one of the five dimensions: past, present, future, neutral and atemporal. We use semi-supervised learning strategy to build temporal classifiers over the glosses of manually selected initial seed synsets. The classification process is iterated based on the repetitive confidence based expansion strategy of the initial seed list until cross-validation accuracy drops. The resource is unique in its nature as, to the best of our knowledge, still no such resource is available for Hindi.

[1]  Michael Wilson MRC Psycholinguistic Database , 2001 .

[2]  James Pustejovsky,et al.  SemEval-2007 Task 15: TempEval Temporal Relation Identification , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[3]  Miriam J. Metzger Making sense of credibility on the Web: Models for evaluating online information and recommendations for future research , 2007, J. Assoc. Inf. Sci. Technol..

[4]  Roi Blanco,et al.  Ranking related news predictions , 2011, SIGIR.

[5]  Srikanta J. Bedathur,et al.  Index maintenance for time-travel text search , 2012, SIGIR '12.

[6]  Yann Mathet,et al.  Propagation Strategies for Building Temporal Ontologies , 2014, EACL.

[7]  Yann Mathet,et al.  TempoWordNet for sentence time tagging , 2014, WWW '14 Companion.

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss analysis , 2005, CIKM 2005.

[10]  Susan T. Dumais,et al.  Understanding temporal query dynamics , 2011, WSDM '11.

[11]  Roi Blanco,et al.  Overview of NTCIR-11 Temporal Information Access (Temporalia) Task , 2014, NTCIR.

[12]  Gözde Özbal,et al.  A Comparison of Unsupervised Methods to Associate Colors with Words , 2011, ACII.

[13]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss classification , 2005, CIKM '05.

[14]  Ricardo Campos,et al.  Survey of Temporal Information Retrieval and Related Applications , 2014, ACM Comput. Surv..

[15]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.