Finding and Extracting Data Records from Web Pages
暂无分享,去创建一个
[1] Frederick H. Lochovsky,et al. Data extraction and label assignment for web databases , 2003, WWW '03.
[2] Sriram Raghavan,et al. Crawling the Hidden Web , 2001, VLDB.
[3] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[4] Thomas Kistler,et al. WebL - A Programming Language for the Web , 1998, Comput. Networks.
[5] Hector Garcia-Molina,et al. Extracting structured data from Web pages , 2003, SIGMOD '03.
[6] Soumen Chakrabarti,et al. Mining the web - discovering knowledge from hypertext data , 2002 .
[7] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .
[8] Soon Ae Chun,et al. Semantic deep web: automatic attribute extraction from the deep web data sources , 2007, SAC '07.
[9] Valter Crescenzi,et al. RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.
[10] Bing Liu,et al. Structured Data Extraction from the Web Based on Partial Tree Alignment , 2006, IEEE Transactions on Knowledge and Data Engineering.
[11] Ángel Viña,et al. Semi-Automatic Wrapper Generation for Commercial Web Sources , 2002, Engineering Information Systems in the Internet Context.
[12] Arnaud Sahuguet,et al. Building intelligent Web applications using lightweight wrappers , 2001, Data Knowl. Eng..
[13] Nicholas Kushmerick,et al. Wrapper Induction for Information Extraction , 1997, IJCAI.
[14] Georg Gottlob,et al. Visual Web Information Extraction with Lixto , 2001, VLDB.
[15] Valter Crescenzi,et al. Automatic annotation of data extracted from large Web sites , 2003, WebDB.
[16] Berthier A. Ribeiro-Neto,et al. A brief survey of web data extraction tools , 2002, SGMD.
[17] Victor Carneiro,et al. Crawling the Content Hidden Behind Web Forms , 2007, ICCSA.
[18] C. Notredame,et al. Recent progress in multiple sequence alignment: a survey. , 2002, Pharmacogenomics.
[19] David R. Karger,et al. Thresher: automating the unwrapping of semantic content from the World Wide Web , 2005, WWW '05.
[20] Chun-Nan Hsu,et al. Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web , 1998, Inf. Syst..
[21] Sourav S. Bhowmick,et al. HW-STALKER: A machine learning-based system for transforming QURE-Pagelets to XML , 2005, Data Knowl. Eng..
[22] Hector Garcia-Molina,et al. Semistructured Data: The Tsimmis Experience , 1997, ADBIS.
[23] David W. Embley,et al. On the Automatic Extraction of Data from the Hidden Web , 2001, ER.
[24] Alberto Pan,et al. Automatically maintaining wrappers for Web sources , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).
[25] Alberto Pan,et al. Automatically maintaining wrappers for semi-structured web sources , 2007, Data Knowl. Eng..
[26] Fidel Cacheda,et al. Finding and Extracting Data Records from Web Pages , 2007, EUC.
[27] Gaston H. Gonnet,et al. New Indices for Text: Pat Trees and Pat Arrays , 1992, Information Retrieval: Data Structures & Algorithms.
[28] Bing Liu,et al. Extracting Web Data Using Instance-Based Learning , 2005, World Wide Web.
[29] Valter Crescenzi,et al. Clustering Web pages based on their structure , 2005, Data Knowl. Eng..
[30] Craig A. Knoblock,et al. Hierarchical Wrapper Induction for Semistructured Information Sources , 2004, Autonomous Agents and Multi-Agent Systems.