Automatic Identification of Whole-Part Relations in Portuguese

In this paper, we improve the extraction of semantic relations between textual elements as it is currently performed by STRING, a hybrid statistical and rule-based Natural Language Processing chain for Portuguese, by targeting whole-part relations (meronymy), that is, a semantic relation between an entity that is perceived as a constituent part of another entity, or a member of a set. In this case, we focus on the type of meronymy involving human entities and body-part nouns. 1998 ACM Subject Classification I.2.7 Natural Language Processing

[1]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[2]  Ilia Markov,et al.  Integrating Verbal Idioms into an NLP System , 2014, PROPOR.

[3]  J. Hobbs,et al.  Using and Extending WordNet to Support Question- Answering , 2007 .

[4]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[5]  Jay Earley,et al.  An efficient context-free parsing algorithm , 1970, Commun. ACM.

[6]  Raquel Amaro,et al.  WordNet.PT global – Extending WordNet.PT to Portuguese varieties , 2011 .

[7]  Andrea Esuli,et al.  PageRanking WordNet Synsets: An Application to Opinion Mining , 2007, ACL.

[8]  Jorge Baptista ViPEr: A Lexicon-Grammar of European Portuguese Verbs , 2012 .

[9]  Hugo Gon,et al.  Onto.PT: Towards the Automatic Construction of a Lexical Ontology for Portuguese , 2012 .

[10]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[11]  Christiane Fellbaum,et al.  Using Wordnet for Text Retrieval , 1998 .

[12]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[13]  António Branco,et al.  LXGram: A Deep Linguistic Processing Grammar for Portuguese , 2010, PROPOR.

[14]  Lucy Vanderwende,et al.  MindNet: Acquiring and Structuring Semantic Information from Text , 1998, COLING-ACL.

[15]  C. Fellbaum An Electronic Lexical Database , 1998 .

[16]  Simone Pribbenow,et al.  Midwinters, end games, and body parts: a classification of part-whole relations , 1995, Int. J. Hum. Comput. Stud..

[17]  Peter Harremoës,et al.  Maximum Entropy Fundamentals , 2001, Entropy.

[18]  Christopher S. G. Khoo,et al.  Semantic relations in information science , 2006, Annu. Rev. Inf. Sci. Technol..

[19]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[20]  Jorge Baptista,et al.  Caracterização e Processamento de Expressões Temporais em Português , 2010, Linguamática.

[21]  Diana Santos,et al.  Desafios na avaliação conjunta do reconhecimento de entidades mencionadas: O Segundo HAREM , 2008 .

[22]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[23]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[24]  Hugo Gonçalo Oliveira,et al.  Utilização do (analisador sintáctico) PEN para extracção de informação das definições de um dicionário , 2008 .

[25]  Sandra Collovini,et al.  A review on Relation Extraction with an eye on Portuguese , 2013, Journal of the Brazilian Computer Society.

[26]  Arul Menezes,et al.  MindNet: An Automatically-Created Lexical Resource , 2005, HLT.

[27]  Noam Chomsky,et al.  Remarks on Nominalization , 2020, Nominalization.

[28]  Palmira Marrafa,et al.  WordNet do português: uma base de dados de conhecimento linguístico , 2001 .

[29]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[30]  Paolo Rosso,et al.  Text Categorization and Information Retrieval Using WordNet Senses , 2004 .

[31]  Graeme Hirst,et al.  Ontology and the Lexicon , 2004, Handbook on Ontologies.

[32]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[33]  Andrea C. Schalley Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci, Alessandro Oltramari, and Laurent Prévot (eds.): Ontology and the Lexicon: a natural language processing perspective. (Studies in Natural Language Processing.) , 2012, Lang. Resour. Evaluation.

[34]  Léxico-gramática das frases fixas do portugués europeo. Breve presentación , 2005 .

[35]  Abdellatif Rahmoun,et al.  Using WordNet for Text Categorization , 2008, Int. Arab J. Inf. Technol..

[36]  Suk Hwan Lim,et al.  Extracting and Ranking Product Features in Opinion Documents , 2010, COLING.

[37]  Laura Plaza,et al.  AUTOMATIC SUMMARIZATION OF NEWS USING WORDNET CONCEPT GRAPHS , 2010 .

[38]  Diogo Correia de Oliveira Extraction and Classification of Named Entities , 2010 .

[39]  Erick Galani Maziero,et al.  A base de dados lexical e a interface web do TeP 2.0: thesaurus eletrônico para o Português do Brasil , 2008, WebMedia.

[40]  Roberto Navigli,et al.  An analysis of ontology-based query expansion strategies , 2003 .

[41]  Nuno Seco,et al.  PAPEL: A Dictionary-Based Lexical Ontology for Portuguese , 2008, PROPOR.

[43]  Edith Bolling Anaphora Resolution , 2006 .

[44]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[45]  C. Maria Keet,et al.  Representing and reasoning over a taxonomy of part-whole relations , 2008, Appl. Ontology.

[46]  Nuno Seco,et al.  Noun Sense Disambiguation with WordNet for Software Design Retrieval , 2003, Canadian Conference on AI.

[47]  Gosse Bouma,et al.  On Learning Subtypes of the Part-Whole Relation: Do Not Mix Your Seeds , 2010, ACL.

[48]  Sanda M. Harabagiu,et al.  The Informative Role of WordNet in Open-Domain Question Answering , 2004, HLT-NAACL 2004.

[49]  W. Grove Statistical Methods for Rates and Proportions, 2nd ed , 1981 .

[50]  Pushpak Bhattacharyya,et al.  Generic Text Summarization Using WordNet , 2004, LREC.

[51]  Dan I. Moldovan,et al.  Automatic Discovery of Part-Whole Relations , 2006, CL.

[52]  António Branco,et al.  Out-of-the-Box Robust Parsing of Portuguese , 2010, PROPOR.

[53]  Willem Robert van Hage,et al.  A Method for Learning Part-Whole Relations , 2006, International Semantic Web Conference.

[54]  Sarabjot S. Anand,et al.  Predicting the Polarity Strength of Adjectives Using WordNet , 2009, ICWSM.

[55]  Piek Vossen,et al.  EuroWordNet: a multilingual database for information retrieval , 1997 .

[56]  Hugo Gonçalo Oliveira Onto.PT: Towards the Automatic Construction of a Lexical Ontology for Portuguese , 2013 .

[57]  Clement T. Yu,et al.  Semantic-Based Grouping of Search Engine Results Using WordNet , 2007, APWeb/WAIM.

[58]  Douglas Herrmann,et al.  A Taxonomy of Part-Whole Relations , 1987, Cogn. Sci..

[59]  Christian Leclère Sur une Restructuration Dative , 1995 .

[60]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[61]  Sébastien Paumier,et al.  De la reconnaissance des formes linguistiques à l'analyse syntaxique , 2003 .

[62]  M. Gross Les bases empiriques de la notion de predicat semantique (The Empirical Foundations of the Notion of Semantic Predicate). , 1981 .

[63]  Deen Freelon ReCal: Intercoder Reliability Calculation as a Web Service , 2010 .

[64]  Philip Resnik,et al.  Disambiguating Noun Groupings with Respect to Wordnet Senses , 1995, VLC@ACL.

[65]  Lucy Vanderwende Ambiguity in the Acquisition of Lexical Information , 1995, ArXiv.

[66]  Jorge Baptista,et al.  Frozen Sentences of Portuguese: Formal Descriptions for NLP , 2004 .

[67]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[68]  Bento Carlos Dias-da-Silva,et al.  A construção de um thesaurus eletrônico para o Português do Brasil , 2003 .

[69]  Dan I. Moldovan,et al.  Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[70]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[71]  António Branco,et al.  A Deep Linguistic Processing Grammar for Portuguese , 2009 .

[72]  Madelyn Anne Iris,et al.  Problems of the part-whole relation , 1989 .

[73]  Paulo Rocha,et al.  CETEMPúblico: Um corpus de grandes dimensões de linguagem jornalística portuguesa , 2000 .

[74]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[75]  Jorge Baptista,et al.  Portuguese Temporal Expressions Recognition: From TE Characterization to an Effective TER Module Implementation , 2009, 2009 Seventh Brazilian Symposium in Information and Human Language Technology.

[76]  A. Karimi,et al.  Master‟s thesis , 2011 .

[77]  Palmira Marrafa,et al.  Portuguese WordNet: general architecture and internal semantic relations , 2002 .

[78]  H. Toutenburg Fleiss, J. L.: Statistical Methods for Rates and Proportions. John Wiley & Sons, New York‐London‐Sydney‐Toronto 1973. XIII, 233 S. , 1974 .

[79]  Jean-Pierre Chanod,et al.  Robustness beyond shallowness: incremental deep parsing , 2002, Natural Language Engineering.

[80]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[81]  Eneko Agirre,et al.  A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches , 2009, NAACL.