Finding and Extracting Data Records from Web Pages
暂无分享,去创建一个
Fidel Cacheda | Alberto Pan | Manuel Álvarez | Juan Raposo | Fernando Bellas | Fidel Cacheda | M. Álvarez | F. Bellas | A. Pan | J. Raposo
[1] Bing Liu,et al. Extracting Web Data Using Instance-Based Learning , 2005, World Wide Web.
[2] Bing Liu,et al. Structured Data Extraction from the Web Based on Partial Tree Alignment , 2006, IEEE Transactions on Knowledge and Data Engineering.
[3] Chun-Nan Hsu,et al. Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web , 1998, Inf. Syst..
[4] Georg Gottlob,et al. Visual Web Information Extraction with Lixto , 2001, VLDB.
[5] Valter Crescenzi,et al. Automatic annotation of data extracted from large Web sites , 2003, WebDB.
[6] Hector Garcia-Molina,et al. Semistructured Data: The Tsimmis Experience , 1997, ADBIS.
[7] Sriram Raghavan,et al. Crawling the Hidden Web , 2001, VLDB.
[8] Berthier A. Ribeiro-Neto,et al. A brief survey of web data extraction tools , 2002, SGMD.
[9] Victor Carneiro,et al. Crawling the Content Hidden Behind Web Forms , 2007, ICCSA.
[10] C. Notredame,et al. Recent progress in multiple sequence alignment: a survey. , 2002, Pharmacogenomics.
[11] Craig A. Knoblock,et al. Hierarchical Wrapper Induction for Semistructured Information Sources , 2004, Autonomous Agents and Multi-Agent Systems.
[12] Nicholas Kushmerick,et al. Wrapper Induction for Information Extraction , 1997, IJCAI.
[13] Soon Ae Chun,et al. Semantic deep web: automatic attribute extraction from the deep web data sources , 2007, SAC '07.
[14] Valter Crescenzi,et al. Clustering Web pages based on their structure , 2005, Data Knowl. Eng..
[15] Gaston H. Gonnet,et al. New Indices for Text: Pat Trees and Pat Arrays , 1992, Information Retrieval: Data Structures & Algorithms.
[16] Alberto Pan,et al. Automatically maintaining wrappers for semi-structured web sources , 2007, Data Knowl. Eng..
[17] Sourav S. Bhowmick,et al. HW-STALKER: A machine learning-based system for transforming QURE-Pagelets to XML , 2005, Data Knowl. Eng..
[18] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[19] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .
[20] Frederick H. Lochovsky,et al. Data extraction and label assignment for web databases , 2003, WWW '03.
[21] David R. Karger,et al. Thresher: automating the unwrapping of semantic content from the World Wide Web , 2005, WWW '05.
[22] Tobias Dönz. Extracting Structured Data from Web Pages , 2003 .
[23] Valter Crescenzi,et al. RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.
[24] Surithong Srisa‐ard,et al. Mining the Web: Discovering Knowledge from Hypertext Data , 2003 .
[25] Ángel Viña,et al. Semi-Automatic Wrapper Generation for Commercial Web Sources , 2002, Engineering Information Systems in the Internet Context.
[26] Arnaud Sahuguet,et al. Building intelligent Web applications using lightweight wrappers , 2001, Data Knowl. Eng..
[27] Alberto Pan,et al. Automatically maintaining wrappers for Web sources , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).
[28] Chaomei Chen,et al. Mining the Web: Discovering knowledge from hypertext data , 2004, J. Assoc. Inf. Sci. Technol..
[29] Thomas Kistler,et al. WebL - A Programming Language for the Web , 1998, Comput. Networks.
[30] David W. Embley,et al. On the Automatic Extraction of Data from the Hidden Web , 2001, ER.