Extracting Object-relevant Data from Websites
暂无分享,去创建一个
[1] Andrew Tomkins,et al. Mining and knowledge discovery from the Web , 2004, 7th International Symposium on Parallel Architectures, Algorithms and Networks, 2004. Proceedings..
[2] Alberto O. Mendelzon,et al. WebOQL: restructuring documents, databases, and webs , 1999 .
[3] Hasan Davulcu,et al. METEOR: metadata and instance extraction from object referral lists on the web , 2005, WWW '05.
[4] Wei-Ying Ma,et al. VIPS: a Vision-based Page Segmentation Algorithm , 2003 .
[5] William W. Cohen,et al. A flexible learning system for wrapping tables and lists in HTML documents , 2002, WWW.
[6] Hector Garcia-Molina,et al. Semistructured Data: The Tsimmis Experience , 1997, ADBIS.
[7] Berthier A. Ribeiro-Neto,et al. A brief survey of web data extraction tools , 2002, SGMD.
[8] Wen-Syan Li,et al. Constructing multi-granular and topic-focused web site maps , 2001, WWW '01.
[9] Divyakant Agrawal,et al. Retrieving and organizing web pages by “information unit” , 2001, WWW '01.
[10] Ming-Syan Chen,et al. Mining Web informative structures and contents based on entropy analysis , 2004, IEEE Transactions on Knowledge and Data Engineering.
[11] Wai Lam,et al. Adapting Web information extraction knowledge via mining site-invariant and site-dependent features , 2007, TOIT.
[12] Valter Crescenzi,et al. RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.
[13] Nicholas Kushmerick,et al. Wrapper induction: Efficiency and expressiveness , 2000, Artif. Intell..
[14] Bing Liu,et al. Structured Data Extraction from the Web Based on Partial Tree Alignment , 2006, IEEE Transactions on Knowledge and Data Engineering.
[15] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[16] Keishi Tajima,et al. Cut as a querying unit for WWW, Netnews, and E-mail , 1998, HYPERTEXT '98.
[17] Wei-Ying Ma,et al. Web object retrieval , 2007, WWW '07.
[18] Wei-Ying Ma,et al. Simultaneous record detection and attribute labeling in web data extraction , 2006, KDD '06.
[19] Denilson Barbosa,et al. Adaptive record extraction from web pages , 2007, WWW '07.
[20] Craig A. Knoblock,et al. Hierarchical Wrapper Induction for Semistructured Information Sources , 2004, Autonomous Agents and Multi-Agent Systems.
[21] Hector Garcia-Molina,et al. Extracting structured data from Web pages , 2003, SIGMOD '03.
[22] Robert L. Grossman,et al. Mining data records in Web pages , 2003, KDD '03.
[23] Sriram Raghavan,et al. Navigating the intranet with high precision , 2007, WWW '07.