Entity-Oriented Search

dbo:abstract The first lines of the Wikipedia article Categories dc:subject Wikipedia categories assigned to the article Disambiguation dbo:wikiPageDisambiguates Disambiguation links External links dbo:wikiPageExternalLink Links to external web pages Geo-coordinates georss:point Geographical coordinates Homepage foaf:homepage Link to the official homepage of an instance Image foaf:depiction Link to the first image on the Wikipedia page Label rdfs:label The page title of the Wikipedia article Page links dbo:wikiPageWikiLink Links to other Wikipedia articles Redirect dbo:wikiPageRedirects Wikipedia page to redirect to See Table 2.4 for the URI prefixes them deviate further from the regular extractors in that they aggregate data from all Wikipedia pages as opposed to operating on a single article. The resulting datasets include grammatical gender (for entities of type person), lexicalizations (alternative names for entities and concepts), topic signatures (strongest related terms), and thematic concepts (the main subject entities/concepts for Wikipedia categories). 2.3.2.3 Datasets and Resources The output of each DBpedia extractor, for each language, is made available as a separate dataset. All datasets are provided in two serializations: as Turtle (N-triples) and as Turtle quads (N-Quads, which include context). The datasets can be divided into the following categories: • DBpedia Ontology: The latest version of the ontology that was used while extracting all datasets. • Core datasets: All infobox-based and specific feature extractors (including the ones listed in Table 2.3) belong here. • Links to other datasets: DBpedia is interlinked with a large number of knowledge bases. The datasets in this group provide links to external resources both on the instance level (owl:sameAs), e.g., to Freebase and YAGO, and on the schema level (owl:equivalentClass and owl:equivalentProperty), most notably to schema.org. • NLP datasets: This last group corresponds to the output of the statistical extractors. Namespaces and Internationalization The generic DBpedia URI namespaces are listed in the upper block of Table 2.4. As part of the internationalization efforts, some datasets are available both in localized and in canonicalized version.

[1]  Ramanathan V. Guha,et al.  Semantic search , 2003, WWW '03.

[2]  Surajit Chaudhuri,et al.  InfoGather: entity augmentation and attribute discovery by holistic matching with web tables , 2012, SIGMOD Conference.

[3]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[4]  Andrew Trotman,et al.  Overview of the INEX 2010 Link the Wiki Track , 2010, INEX.

[5]  Xiaolong Wang,et al.  Modeling Mention, Context and Entity with Neural Networks for Entity Disambiguation , 2015, IJCAI.

[6]  Gianluca Demartini,et al.  Overview of the INEX 2008 Entity Ranking Track , 2009, INEX.

[7]  Krisztian Balog,et al.  Anticipating Information Needs Based on Check-in Activity , 2017, WSDM.

[8]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9]  G. Prasad LEARNING TO LINK ENTITIES WITH KNOWLEDGE BASE , 2016 .

[10]  Krisztian Balog,et al.  Exploiting Entity Linking in Queries for Entity Retrieval , 2016, ICTIR.

[11]  Gianluca DemartiniClaudiu Why finding entities in Wikipedia is difficult, sometimes , 2010 .

[12]  Peter Mika,et al.  Entity Search Evaluation over Structured Web Data , 2011 .

[13]  Vincent Ng,et al.  Supervised Noun Phrase Coreference Research: The First Fifteen Years , 2010, ACL.

[14]  Mounia Lalmas,et al.  Overview of the INEX 2007 Entity Ranking Track , 2008, INEX.

[15]  Lise Getoor,et al.  Entity resolution in geospatial data integration , 2006, GIS '06.

[16]  Ganesh Ramakrishnan,et al.  Compressed data structures for annotated web search , 2012, WWW.

[17]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[18]  M. de Rijke,et al.  Learning to Explain Entity Relationships in Knowledge Graphs , 2015, ACL.

[19]  Stefan Dietze,et al.  Improving Entity Retrieval on Structured Data , 2015, SEMWEB.

[20]  Satoshi Sekine,et al.  Extended Named Entity Ontology with Attribute Information , 2008, LREC.

[21]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[22]  M. de Rijke,et al.  Example Based Entity Search in the Web of Data , 2013, ECIR.

[23]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[24]  Gerhard Weikum,et al.  HYENA: Hierarchical Type Classification for Entity Names , 2012, COLING.

[25]  M. de Rijke,et al.  Finding similar experts , 2007, SIGIR.

[26]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[27]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[28]  Yeye He,et al.  Keyword++ , 2010, Proc. VLDB Endow..

[29]  Mark Johnson,et al.  How the Statistical Revolution Changes (Computational) Linguistics , 2009 .

[30]  Jiawei Han,et al.  Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions , 2015, IEEE Transactions on Knowledge and Data Engineering.

[31]  Krisztian Balog,et al.  Semistructured Data Search , 2013, PROMISE Winter School.

[32]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[33]  Ralph Grishman,et al.  Extracting Relations with Integrated Information Using Kernel Methods , 2005, ACL.

[34]  Hannah Bast,et al.  Semantic full-text search with broccoli , 2014, SIGIR.

[35]  Satoshi Sekine,et al.  Definition, Dictionaries and Tagger for Extended Named Entity Hierarchy , 2004, LREC.

[36]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[37]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[38]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[39]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[40]  Yeye He,et al.  Concept Expansion Using Web Tables , 2015, WWW.

[41]  Ihab F. Ilyas,et al.  Interpreting keyword queries over web knowledge bases , 2012, CIKM '12.

[42]  Kevin Chen-Chuan Chang,et al.  Towards rich query interpretation: walking back and forth for mining query templates , 2010, WWW '10.

[43]  Luo Si,et al.  Related entity finding by unified probabilistic models , 2013, World Wide Web.

[44]  William W. Cohen,et al.  A flexible learning system for wrapping tables and lists in HTML documents , 2002, WWW.

[45]  Simone Paolo Ponzetto,et al.  Ranking Entities for Web Queries Through Text and Knowledge , 2015, CIKM.

[46]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[47]  Slav Petrov,et al.  Syntactic Annotations for the Google Books NGram Corpus , 2012, ACL.

[48]  Oren Kurland,et al.  A ranking framework for entity oriented search using Markov random fields , 2012, JIWES '12.

[49]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[50]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[51]  Aidan Hogan,et al.  ReConRank: A Scalable Ranking Method for Semantic Web Data with Context , 2006 .

[52]  Jian Su,et al.  Named Entity Recognition using an HMM-based Chunk Tagger , 2002, ACL.

[53]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.

[54]  Rahul Gupta,et al.  Knowledge base completion via search-based question answering , 2014, WWW.

[55]  Simone Paolo Ponzetto,et al.  Knowledge-based graph document modeling , 2014, WSDM.

[56]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[57]  Wei-Ying Ma,et al.  Object-level ranking: bringing order to Web objects , 2005, WWW '05.

[58]  Paul Thomas,et al.  Overview of the TREC 2009 Entity Track , 2009, TREC.

[59]  Robinson Piramuthu,et al.  Is a picture really worth a thousand words?: - on the role of images in e-commerce , 2014, WSDM.

[60]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[61]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[62]  Jeffrey Xu Yu,et al.  Keyword Search in Relational Databases: A Survey , 2010, IEEE Data Eng. Bull..

[63]  W. Bruce Croft,et al.  Table extraction using conditional random fields , 2003, DG.O.

[64]  Gerhard Weikum,et al.  KORE: keyphrase overlap relatedness for entity disambiguation , 2012, CIKM.

[65]  Robert Parker,et al.  Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population , 2010, LREC.

[66]  Stefan Decker,et al.  Hierarchical Link Analysis for Ranking Web Data , 2010, ESWC.

[67]  Valentin Jijkoun,et al.  "More like these": growing entity classes from seeds , 2007, CIKM '07.

[68]  Peter Mika,et al.  Ad-hoc object retrieval in the web of data , 2010, WWW '10.

[69]  Andrew McCallum,et al.  Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models , 2011, ACL.

[70]  Dunja Mladenic,et al.  Query-Independent Learning to Rank for RDF Entity Search , 2012, ESWC.

[71]  Krisztian Balog,et al.  Overview of the TREC 2011 Entity Track , 2011, TREC.

[72]  Gianluca Demartini,et al.  Overview of the INEX 2009 Entity Ranking Track , 2009, INEX.

[73]  Anjan Goswami,et al.  A study on the impact of product images on user clicks for online shopping , 2011, WWW.

[74]  Jaap Kamps,et al.  Entity ranking using Wikipedia as a pivot , 2010, CIKM.

[75]  M. de Rijke,et al.  Ranking related entities: components and analyses , 2010, CIKM.

[76]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[77]  M. de Rijke,et al.  Query modeling for entity search based on terms, categories, and examples , 2011, TOIS.

[78]  Yeye He,et al.  SEISA: set expansion by iterative similarity aggregation , 2011, WWW.

[79]  Maarten de Rijke,et al.  Contextual factors for finding similar experts , 2010, J. Assoc. Inf. Sci. Technol..

[80]  Roi Blanco,et al.  Effective and Efficient Entity Search in RDF Data , 2011, SEMWEB.

[81]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[82]  Peter Christen,et al.  A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication , 2012, IEEE Transactions on Knowledge and Data Engineering.

[83]  Raphaël Troncy,et al.  GERBIL: General Entity Annotator Benchmarking Framework , 2015, WWW.

[84]  Marcin Sydow,et al.  QBEES: query-by-example entity search in semantic knowledge graphs based on maximal aspects, diversity-awareness and relaxation , 2017, Journal of Intelligent Information Systems.

[85]  Ladislav Hluchý,et al.  The SemSets model for ad-hoc semantic list search , 2012, WWW.

[86]  Mark Sanderson,et al.  Test Collection Based Evaluation of Information Retrieval Systems , 2010, Found. Trends Inf. Retr..

[87]  Kevin Chen-Chuan Chang,et al.  EntityRank: Searching Entities Directly and Holistically , 2007, VLDB.

[88]  Wei Shen,et al.  LIEGE:: link entities in web lists with knowledge base , 2012, KDD.

[89]  Jaap Kamps,et al.  Exploiting the category structure of Wikipedia for entity ranking , 2013, Artif. Intell..

[90]  Valentin I. Spitkovsky,et al.  A Cross-Lingual Dictionary for English Wikipedia Concepts , 2012, LREC.

[91]  Heng Ji,et al.  RPI-BLENDER TAC-KBP2013 Knowledge Base Population System , 2013, TAC.

[92]  Ian H. Witten,et al.  An open-source toolkit for mining Wikipedia , 2013, Artif. Intell..

[93]  Sougata Mukherjea,et al.  Utilizing Resource Importance for Ranking Semantic Web Query Results , 2004, SWDB.

[94]  Cong Yu,et al.  EntityEngine: answering entity-relationship queries using shallow semantics , 2010, CIKM '10.

[95]  Jane Greenberg,et al.  Using BM25F for semantic search , 2010, SEMSEARCH '10.

[96]  Gene H. Golub,et al.  Extrapolation methods for accelerating PageRank computations , 2003, WWW '03.

[97]  Satoshi Sekine,et al.  Extended Named Entity Hierarchy , 2002, LREC.

[98]  Steffen Staab,et al.  TripleRank: Ranking Semantic Web Data by Tensor Decomposition , 2009, SEMWEB.

[99]  Kun Bai,et al.  TableSeer: automatic table metadata extraction and searching in digital libraries , 2007, JCDL '07.

[100]  Giovanni Tummarello,et al.  Effective Retrieval Model for Entity with Multi-valued Attributes: BM25MF and Beyond , 2012, EKAW.

[101]  Gianluca Demartini,et al.  Combining inverted indices and structured search for ad-hoc object retrieval , 2012, SIGIR '12.

[102]  Vagelis Hristidis,et al.  ObjectRank: Authority-Based Keyword Search in Databases , 2004, VLDB.

[103]  William W. Cohen,et al.  Improving graph-walk-based similarity with reranking: Case studies for personal information management , 2010, TOIS.

[104]  James A. Thom,et al.  Entity ranking in Wikipedia: utilising categories, links and topic difficulty prediction , 2009, Information Retrieval.

[105]  Björn Buchhold,et al.  Semantic Search on Text and Knowledge Bases , 2016, Found. Trends Inf. Retr..

[106]  Jaap Kamps,et al.  Result Diversity and Entity Ranking Experiments: Anchors, Links, Text and Wikipedia , 2009, TREC.

[107]  Leif Azzopardi,et al.  Assessing multivariate Bernoulli models for information retrieval , 2008, TOIS.

[108]  Kevin Chen-Chuan Chang,et al.  Beyond pages: supporting efficient, scalable entity search with dual-inversion index , 2010, EDBT '10.

[109]  Jian Su,et al.  Entity Linking with Effective Acronym Expansion, Instance Selection, and Topic Modeling , 2011, IJCAI.

[110]  Gerhard Paass,et al.  From names to entities using thematic context distance , 2011, CIKM '11.

[111]  Beth M. Sundheim,et al.  Overview of Results of the MUC-6 Evaluation , 1995, MUC.

[112]  Vasudeva Varma,et al.  IIIT Hyderabad at TAC 2009 , 2008, TAC.