Automatic extraction of top-k lists from the web
暂无分享,去创建一个
Haixun Wang | Hongsong Li | Zhixian Zhang | Kenny Q. Zhu | Haixun Wang | Hongsong Li | Zhixian Zhang
[1] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[2] Haixun Wang,et al. A system for extracting top-K lists from the web , 2012, KDD.
[3] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[4] Haixun Wang,et al. Short Text Conceptualization Using a Probabilistic Knowledgebase , 2011, IJCAI.
[5] Daisy Zhe Wang,et al. WebTables: exploring the power of tables on the web , 2008, Proc. VLDB Endow..
[6] Donato Malerba,et al. Extracting general lists from web documents: a hybrid approach , 2011, IEA/AIE'11.
[7] Chao Liu,et al. FACTO: a fact lookup engine based on web tables , 2011, WWW.
[8] Haixun Wang,et al. Understanding Tables on the Web , 2012, ER.
[9] David Walker,et al. From dirt to shovels: fully automatic tool generation from ad hoc data , 2008, POPL '08.
[10] Louise E. Moser,et al. Extracting data records from the web using tag path clustering , 2009, WWW '09.
[11] Surajit Chaudhuri,et al. Ranking objects based on relationships and fixed associations , 2009, EDBT '09.
[12] Christopher D. Manning,et al. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.
[13] Marco Porta,et al. Cascading Style Sheets - Level 2 , 2012 .
[14] Robert L. Grossman,et al. Mining data records in Web pages , 2003, KDD '03.
[15] Dan Klein,et al. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.
[16] Wolfgang Gatterbauer,et al. Towards domain-independent information extraction from web tables , 2007, WWW '07.
[17] Sudipto Guha,et al. Ad-hoc aggregations of ranked lists in the presence of hierarchies , 2008, SIGMOD Conference.
[18] Donato Malerba,et al. Unexpected results in automatic list extraction on the web , 2011, SKDD.
[19] Beatrice Santorini,et al. Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd Revision) , 1990 .
[20] Christian S. Jensen,et al. Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives , 2012, TOIS.
[21] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[22] Sachio Hirokawa,et al. Testbed for information extraction from deep web , 2004, WWW Alt. '04.
[23] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.
[24] Haixun Wang,et al. Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.
[25] Evgeniy Gabrilovich,et al. Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.
[26] Moni Naor,et al. Optimal aggregation algorithms for middleware , 2001, PODS '01.
[27] José M. Vidal,et al. Cascading style sheets , 1997, World Wide Web J..