Learning-based pronoun resolution for Turkish with a comparative evaluation

The aim of this paper is twofold. On the one hand, it attempts to explore several machine learning models for pronoun resolution in Turkish, a language not sufficiently studied with respect to anaphora resolution and rarely being subjected to machine learning experiments. On the other hand, this paper offers an evaluation of the classification performances of the learning models in order to gain insight into the question of how to match a model to the task at hand. In addition to the expected observation that each model should be tuned to an optimum level of expressive power so as to avoid underfitting and overfitting, the results also suggest that non-linear models properly tuned to avoid overfitting outperform linear ones when applied to the data used in our experiments.

[1]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[2]  D. Wolpert,et al.  No Free Lunch Theorems for Search , 1995 .

[3]  N. Cocchiarella,et al.  Situations and Attitudes. , 1986 .

[4]  Scott Weinstein,et al.  Providing a Unified Account of Definite Noun Phrases in Discourse , 1983, ACL.

[5]  T. Ho,et al.  Data Complexity in Pattern Recognition , 2006 .

[6]  Andrew Kehler,et al.  Coherence, reference, and the theory of grammar , 2002, CSLI lecture notes series.

[7]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[8]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[9]  Walter Daelemans,et al.  Combined Optimization of Feature Selection and Algorithm Parameters in Machine Learning of Language , 2003, ECML.

[10]  Eser Erguvanlı Taylan Pronominal versus Zero Representation of Anaphora in Turkish , 1986 .

[11]  Varol Akman,et al.  Situated Processing of Pronominal Anaphora , 1994 .

[12]  Ruslan Mitkov,et al.  Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems , 2001, Appl. Artif. Intell..

[13]  Ann Banfield,et al.  Unspeakable Sentences : Narration and Representation in the Language of Fiction , 1982 .

[14]  François Trouilleux A Rule-based Pronoun Resolution System for French , 2002 .

[15]  Breck Baldwin,et al.  CogNIAC: high precision coreference with limited knowledge and linguistic resources , 1997 .

[16]  Claire Gardent,et al.  Improving Machine Learning Approaches to Coreference Resolution , 2002, ACL.

[17]  Branimir Boguraev,et al.  Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser , 1996, COLING.

[18]  Wendy G. Lehnert,et al.  Using Decision Trees for Coreference Resolution , 1995, IJCAI.

[19]  Erdem Uçar,et al.  Automatic Acquisition of Subcategorization Frames for Turkish with Purely Statistical Methods , 2007 .

[20]  R. Mitkov ANAPHORA RESOLUTION: THE STATE OF THE ART , 2007 .

[21]  David Fisher,et al.  Description of the UMass system as used for MUC-6 , 1995, MUC.

[22]  Walter Daelemans,et al.  Comparing Learning Approaches to Coreference Resolution. There is More to it Than 'Bias' , 2005, ICML 2005.

[23]  Yilmaz Kiliçaslan,et al.  Syntax of information structure in Turkish , 2004 .

[24]  Dan I. Slobin,et al.  Studies in Turkish Linguistics , 1986 .

[25]  Shalom Lappin Anaphora Processing: Linguistic, Cognitive, and Computational Modelling , 2005 .

[26]  Yilmaz Kiliçaslan,et al.  A Computational Model for Resolving Pronominal Anaphora in Turkish Using Hobbs' Naïve Algorithm , 2005, WEC.

[27]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[28]  Scott Bennett,et al.  Evaluating Automated and Manual Acquisition of Anaphora Resolution Strategies , 1995, ACL.

[29]  Jaime G. Carbonell,et al.  Anaphora Resolution: A Multi-Strategy Approach , 1988, COLING.

[30]  Jerry R. Hobbs Resolving pronoun references , 1986 .

[31]  Wendy G. Lehnert,et al.  A trainable approach to coreference resolution for information extraction , 1996 .

[32]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[33]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[34]  Joe F. Zhou,et al.  Proceedings of the 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, : 21-22 June 1999, University of Maryland, College Park, MD, USA , 1999 .

[35]  Veronique Hoste,et al.  Optimization issues in machine learning of coreference resolution , 2005 .

[36]  Walter Daelemans,et al.  Evaluation of Machine Learning Methods for Natural Language Processing Tasks , 2002, LREC.

[37]  Walter Daelemans,et al.  GAMBL, genetic algorithm optimization of memory-based WSD , 2004, SENSEVAL@ACL.

[38]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[39]  Ruslan Mitkov,et al.  Pronoun resolution: The practical alternative , 2000 .

[40]  Walter Daelemans,et al.  Combined Optimization of Feature Selection and Algorithm Parameter Interaction in Machine Learning of Language , 2003 .

[41]  Keith Devlin,et al.  Jon Barwise's Papers on Natural Language Semantics , 2004, Bulletin of Symbolic Logic.

[42]  M. Mcluhan Understanding Media: The Extensions of Man , 1964 .

[43]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[44]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[45]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[46]  Cem Bozsahin,et al.  Contextually appropriate reference generation , 2002, Natural Language Engineering.

[47]  Savas Yildirim,et al.  A Computational Model for Anaphora Resolution in Turkish via Centering Theory: an Initial Approach , 2004, International Conference on Computational Intelligence.

[48]  Michael Strube,et al.  The Influence of Minimum Edit Distance on Reference Resolution , 2002, EMNLP.

[49]  Jian Su,et al.  Coreference Resolution Using Competition Learning Approach , 2003, ACL.

[50]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[51]  Claire Cardie,et al.  Noun Phrase Coreference as Clustering , 1999, EMNLP.

[52]  Mürvet Enç Topic Switching and Pronominal Subjects in Turkish , 1986 .

[53]  Noam Chomsky,et al.  Lectures on Government and Binding , 1981 .

[54]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[55]  Walter Daelemans,et al.  Parameter optimization for machine-learning of word sense disambiguation , 2002, Natural Language Engineering.

[56]  Graeme Hirst,et al.  Anaphora in Natural Language Understanding: A Survey , 1981, Lecture Notes in Computer Science.

[57]  Balthasar Bickel,et al.  Referential Density in Discourse and Syntactic Typology , 2003 .

[58]  Savas Yildirim,et al.  A Machine Learning Approach to Personal Pronoun Resolution in Turkish , 2007, FLAIRS.

[59]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[60]  V. Vapnik Pattern recognition using generalized portrait method , 1963 .

[61]  U. Turan Null vs. Overt Subjects in Turkish Discourse: A Centering Analysis , 1995 .

[62]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .