Information discovery from semi-structured record sets on the Web
暂无分享,去创建一个
[1] Joe Marini,et al. Document Object Model , 2002, Encyclopedia of GIS.
[2] William W. Cohen,et al. Language-Independent Set Expansion of Named Entities Using the Web , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).
[3] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.
[4] Jr. G. Forney,et al. Viterbi Algorithm , 1973, Encyclopedia of Machine Learning.
[5] Praveen Paritosh,et al. Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.
[6] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..
[7] Khaled Shaalan,et al. FiVaTech: Page-Level Web Data Extraction from Template Pages , 2007, IEEE Transactions on Knowledge and Data Engineering.
[8] Daniel S. Weld,et al. Open Information Extraction Using Wikipedia , 2010, ACL.
[9] Daniel S. Weld,et al. Learning 5000 Relational Extractors , 2010, ACL.
[10] Maria Ruiz-Casado,et al. Automatic Assignment of Wikipedia Encyclopedic Entries to WordNet Synsets , 2005, AWIC.
[11] Wolfgang Gatterbauer,et al. Towards domain-independent information extraction from web tables , 2007, WWW '07.
[12] Lidong Bing,et al. Wikipedia entity expansion and attribute extraction from the web using semi-supervised learning , 2013, WSDM.
[13] Eric Crestan,et al. Web-scale table census and classification , 2011, WSDM '11.
[14] Haixun Wang,et al. Understanding Tables on the Web , 2012, ER.
[15] Estevam R. Hruschka,et al. Coupled semi-supervised learning for information extraction , 2010, WSDM '10.
[16] Gerhard Weikum,et al. YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..
[17] Sachio Hirokawa,et al. Testbed for information extraction from deep web , 2004, WWW Alt. '04.
[18] ChengXiang Zhai,et al. Mining term association patterns from search logs for effective query reformulation , 2008, CIKM '08.
[19] Christus,et al. A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .
[20] Simone Paolo Ponzetto,et al. Deriving a Large-Scale Taxonomy from Wikipedia , 2007, AAAI.
[21] Simone Paolo Ponzetto,et al. Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia , 2009, IJCAI.
[22] Qiang Hao,et al. From one tree to a forest: a unified solution for structured web data extraction , 2011, SIGIR.
[23] Maria T. Pazienza,et al. Information Extraction , 2002, Lecture Notes in Computer Science.
[24] Sunita Sarawagi,et al. Annotating and searching web tables using entities, types and relationships , 2010, Proc. VLDB Endow..
[25] Lidong Bing,et al. Normalizing web product attributes and discovering domain ontology with minimal effort , 2011, WSDM '11.
[26] David W. Embley,et al. Record-boundary discovery in Web documents , 1999, SIGMOD '99.
[27] Marius Pasca,et al. Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds , 2007, WWW '07.
[28] Bing Liu,et al. Web data extraction based on partial tree alignment , 2005, WWW '05.
[29] Calton Pu,et al. XWRAP: an XML-enabled wrapper construction system for Web information sources , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).
[30] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[31] Ji-Rong Wen,et al. Efficient record-level wrapper induction , 2009, CIKM.
[32] Tobias Dönz. Extracting Structured Data from Web Pages , 2003 .
[33] Gholamreza Haffari,et al. A Rate Distortion Approach for Semi-Supervised Conditional Random Fields , 2009, NIPS.
[34] Daniel S. Weld,et al. Autonomously semantifying wikipedia , 2007, CIKM '07.
[35] Doug Downey,et al. Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.
[36] Nicholas Kushmerick,et al. Wrapper induction: Efficiency and expressiveness , 2000, Artif. Intell..
[37] Nenghai Yu,et al. BioSnowball: automated population of Wikis , 2010, KDD '10.
[38] Vijay V. Raghavan,et al. Fully automatic wrapper generation for search engines , 2005, WWW '05.
[39] Daniel S. Weld,et al. Automatically refining the wikipedia infobox ontology , 2008, WWW.
[40] Louise E. Moser,et al. Extracting data records from the web using tag path clustering , 2009, WWW '09.
[41] Jens Lehmann,et al. DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.
[42] Wai Lam,et al. Collaborative Information Extraction and Mining from Multiple Web Documents , 2006, SDM.
[43] Wei Liu,et al. ViDE: A Vision-Based Approach for Deep Web Data Extraction , 2010, IEEE Transactions on Knowledge and Data Engineering.
[44] William W. Cohen,et al. Iterative Set Expansion of Named Entities Using the Web , 2008, 2008 Eighth IEEE International Conference on Data Mining.
[45] Gerhard Weikum,et al. SOFIE: a self-organizing framework for information extraction , 2009, WWW '09.
[46] Bing Liu,et al. Structured Data Extraction from the Web Based on Partial Tree Alignment , 2006, IEEE Transactions on Knowledge and Data Engineering.
[47] Benjamin Rey,et al. Generating query substitutions , 2006, WWW '06.
[48] Jian Hu,et al. Cross lingual text classification by mining multilingual topics from wikipedia , 2011, WSDM '11.
[49] Calton Pu,et al. A fully automated object extraction system for the World Wide Web , 2001, Proceedings 21st International Conference on Distributed Computing Systems.
[50] Frederick H. Lochovsky,et al. Data extraction and label assignment for web databases , 2003, WWW '03.
[51] Valter Crescenzi,et al. RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.
[52] Yoshua Bengio,et al. Semi-supervised Learning by Entropy Minimization , 2004, CAP.
[53] Bin Zhao,et al. Max margin learning on domain-independent web information extraction , 2011, CIKM '11.
[54] Slav Petrov,et al. Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models , 2010, EMNLP.
[55] Marius Pasca,et al. Weakly-supervised discovery of named entities using web search queries , 2007, CIKM '07.
[56] See-Kiong Ng,et al. Distributional Similarity vs. PU Learning for Entity Set Expansion , 2010, ACL.
[57] Lidong Bing,et al. Towards a unified solution: data record region detection and segmentation , 2011, CIKM '11.
[58] Doug Downey,et al. KnowItNow: Fast, Scalable Information Extraction from the Web , 2005, HLT.
[59] Eric Crestan,et al. Web-Scale Distributional Similarity and Entity Set Expansion , 2009, EMNLP.
[60] David R. Karger,et al. Thresher: automating the unwrapping of semantic content from the World Wide Web , 2005, WWW '05.
[61] Robert L. Grossman,et al. Mining data records in Web pages , 2003, KDD '03.
[62] Georg Lausen,et al. ViPER: augmenting automatic information extraction with visual perceptions , 2005, CIKM '05.
[63] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.
[64] Benjamin Van Durme,et al. Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs , 2008, ACL.
[65] Hua Li,et al. Enhancing text clustering by leveraging Wikipedia semantics , 2008, SIGIR '08.
[66] Weifeng Su,et al. ODE: Ontology-assisted data extraction , 2009, TODS.
[67] Ruihua Song,et al. Joint optimization of wrapper generation and template detection , 2007, KDD '07.
[68] Oren Etzioni,et al. Open Information Extraction from the Web , 2007, CACM.
[69] Lidong Bing,et al. Robust detection of semi-structured web records using a DOM structure-knowledge-driven model , 2013, TWEB.
[70] S da SilvaAltigran,et al. DEByE - Date extraction by example , 2002 .
[71] Rahul Gupta,et al. Answering Table Augmentation Queries from Unstructured Lists on the Web , 2009, Proc. VLDB Endow..
[72] Jayant Madhavan,et al. Recovering Semantics of Tables on the Web , 2011, Proc. VLDB Endow..
[73] Jens Stoye,et al. Linear time algorithms for finding and representing all the tandem repeats in a string , 2004, J. Comput. Syst. Sci..
[74] Yan Zhang,et al. Ontology enhancement and concept granularity learning: keeping yourself current and adaptive , 2011, KDD.
[75] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..
[76] Dale Schuurmans,et al. Semi-Supervised Conditional Random Fields for Improved Sequence Segmentation and Labeling , 2006, ACL.
[77] Satoshi Sekine,et al. A survey of named entity recognition and classification , 2007 .
[78] Regina Barzilay,et al. Automatically Generating Wikipedia Articles: A Structure-Aware Approach , 2009, ACL.
[79] Khaled Shaalan,et al. A Survey of Web Information Extraction Systems , 2006, IEEE Transactions on Knowledge and Data Engineering.
[80] Lidong Bing,et al. Using query log and social tagging to refine queries based on latent topics , 2011, CIKM '11.
[81] ZhaiYanhong,et al. Extracting Web Data Using Instance-Based Learning , 2007 .
[82] Alberto O. Mendelzon,et al. WebOQL: restructuring documents, databases and Webs , 1998, Proceedings 14th International Conference on Data Engineering.
[83] Chun-Nan Hsu,et al. Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web , 1998, Inf. Syst..
[84] Carlotta Domeniconi,et al. Building semantic kernels for text classification using wikipedia , 2008, KDD.
[85] William W. Cohen,et al. Character-level Analysis of Semi-Structured Documents for Set Expansion , 2009, EMNLP.
[86] David W. Embley,et al. Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages , 1999, Data Knowl. Eng..
[87] Patrick Pantel,et al. Entity Extraction via Ensemble Semantics , 2009, EMNLP.
[88] Yan Zhang,et al. Learning ontology resolution for document representation and its applications in text mining , 2010, CIKM '10.
[89] Thorsten Joachims,et al. Optimizing search engines using clickthrough data , 2002, KDD.
[90] Gerhard Weikum,et al. WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .
[91] Filip Radlinski,et al. Query chains: learning to rank from implicit feedback , 2005, KDD '05.
[92] Oren Etzioni,et al. Open Information Extraction: The Second Generation , 2011, IJCAI.
[93] William W. Cohen,et al. Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.
[94] Craig A. Knoblock,et al. Hierarchical Wrapper Induction for Semistructured Information Sources , 2004, Autonomous Agents and Multi-Agent Systems.