Coreference for Learning to Extract Relations: Yes Virginia, Coreference Matters

As an alternative to requiring substantial supervised relation training data, many have explored bootstrapping relation extraction from a few seed examples. Most techniques assume that the examples are based on easily spotted anchors, e.g., names or dates. Sentences in a corpus which contain the anchors are then used to induce alternative ways of expressing the relation. We explore whether coreference can improve the learning process. That is, if the algorithm considered examples such as his sister, would accuracy be improved? With coreference, we see on average a 2-fold increase in F-Score. Despite using potentially errorful machine coreference, we see significant increase in recall on all relations. Precision increases in four cases and decreases in six.

[1]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[2]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[3]  Dong-Hong Ji,et al.  Relation Extraction Using Label Propagation Based Semi-Supervised Learning , 2006, ACL.

[4]  Ellen Riloff,et al.  Automatically Generating Extraction Patterns from Untagged Text , 1996, AAAI/IAAI, Vol. 2.

[5]  Alex Baron,et al.  Who is Who and What is What: Experiments in Cross-Document Co-Reference , 2008, EMNLP.

[6]  Zornitsa Kozareva,et al.  Not All Seeds Are Equal: Measuring the Quality of Text Mining Seeds , 2010, NAACL.

[7]  Elizabeth Boschee,et al.  An Exploratory Study Towards "Machines that Learn to Read" , 2008, AAAI Fall Symposium: Biologically Inspired Cognitive Architectures.

[8]  Estevam R. Hruschka,et al.  Populating the Semantic Web by Macro-reading Internet Text , 2009, SEMWEB.

[9]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[10]  Sergey Bratus,et al.  Experiments in Multi-Modal Automatic Content Extraction , 2001, HLT.

[11]  Guodong Zhou,et al.  Semi-Supervised Learning for Relation Extraction , 2008, IJCNLP.

[12]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[13]  Joseph Olive,et al.  Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation , 2011 .

[14]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.