论文信息 - Combining Multiple Sources of Evidence in Web Information Extraction

Combining Multiple Sources of Evidence in Web Information Extraction

Extraction of meaningful content from collections of web pages with unknown structure is a challenging task, which can only be successfully accomplished by exploiting multiple heterogeneous resources. In the Ex information extraction tool, so-called extraction ontologies are used by human designers to specify the domain semantics, to manually provide extraction evidence, as well as to define extraction subtasks to be carried out via trainable classifiers. Elements of an extraction ontology can be endowed with probability estimates, which are used for selection and ranking of attribute and instance candidates to be extracted. At the same time, HTML formatting regularities are locally exploited.

Vojtech Svátek | Martin Labský

[1] W. Bruce Croft,et al. Table extraction for answer retrieval , 2006, Information Retrieval.

[2] Cui Tao,et al. Automatically Extracting Ontologically Specified Data from HTML Tables of Unknown Structure , 2002, ER.

[3] Martin Labsk. Information Extraction with Presentation Ontologies , 2006 .

[4] Oren Etzioni,et al. Unsupervised Resolution of Objects and Relations on the Web , 2007, NAACL.

[5] Thomas G. Dietterich. Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[6] Vojtech Svátek,et al. Towards web information extraction using extraction ontologies and (indirectly) domain ontologies , 2007, K-CAP '07.

[7] Vojtech Svátek,et al. The Ex Project: Web Information Extraction Using Extraction Ontologies , 2009, Knowledge Discovery Enhanced with Semantic and Social Information.

[8] Atanas Kiryakov,et al. Semantic Annotation, Indexing, and Retrieval , 2003, SEMWEB.

[9] Oren Etzioni,et al. Extracting Product Features and Opinions from Reviews , 2005, HLT.

[10] Andrew McCallum,et al. Learning Field Compatibilities to Extract Database Records from Unstructured Text , 2006, EMNLP.

[11] John Gaschnig,et al. MODEL DESIGN IN THE PROSPECTOR CONSULTANT SYSTEM FOR MINERAL EXPLORATION , 1981 .