Exploring structure and content on the web: extraction and integration of the semi-structured web
暂无分享,去创建一个
[1] Jiawei Han,et al. CETR: content extraction via tag ratios , 2010, WWW '10.
[2] Sunita Sarawagi,et al. Integrating Unstructured Data into Relational Databases , 2006, 22nd International Conference on Data Engineering (ICDE'06).
[3] Valter Crescenzi,et al. RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.
[4] Daisy Zhe Wang,et al. WebTables: exploring the power of tables on the web , 2008, Proc. VLDB Endow..
[5] Lorenzo Blanco,et al. Flint: Google-basing the Web , 2008, EDBT '08.
[6] Donato Malerba,et al. HyLiEn: a hybrid approach to general list extraction on the web , 2011, WWW.
[7] Jiawei Han,et al. Document-topic hierarchies from document graphs , 2012, CIKM.
[8] Robert L. Grossman,et al. Mining data records in Web pages , 2003, KDD '03.
[9] Bing Liu,et al. Structured Data Extraction from the Web Based on Partial Tree Alignment , 2006, IEEE Transactions on Knowledge and Data Engineering.
[10] Rahul Gupta,et al. Answering Table Augmentation Queries from Unstructured Lists on the Web , 2009, Proc. VLDB Endow..
[11] Jiawei Han. Construction of Web-Based, Service-Oriented Information Networks: A Data Mining Perspective - (Abstract) , 2012, WAIM.
[12] Jayant Madhavan,et al. Harvesting relational tables from lists on the web , 2009, The VLDB Journal.
[13] Jiawei Han,et al. Building enriched web page representations using link paths , 2012, HT '12.