Semantic labeling for numerical attributes is a process of matching numerical attributes in tabular resources to properties and classes in knowledge bases. It can be used in many applications such as table search, table extension, and knowledge augmentation. One of the challenges of this tasks is to distinguish numerical attributes expressed in various scales or units of measurement. Indeed, how to distinguish the similar attributes of “human height centimeters” and “human height feet” and the dissimilar attribute “population million”. Previous studies assume the similar attributes expressed in the same scale. In fact, the similar attributes could be expressed differently since the data resource is constructed by different people in different background and context. In this paper, we propose a novel method to improve the performance of semantic labeling for numerical attributes in various scales. We use an external knowledge about unit conversion taken from Wikidata to generate more data resources for the numerical background knowledge bases (WBKB). Our empirical experiments show that using the WBKB can improve the performance of semantic labeling expressed in various scales.
[1]
Dominique Ritze,et al.
Matching HTML Tables to DBpedia
,
2015,
WIMS.
[2]
Jürgen Umbrich,et al.
Multi-level Semantic Labelling of Numerical Values
,
2016,
SEMWEB.
[3]
Meihui Zhang,et al.
InfoGather+: semantic matching and annotation of numeric and time-varying attributes in web tables
,
2013,
SIGMOD '13.
[4]
Ollivier Haemmerlé,et al.
An Ontology-Driven Annotation of Data Tables
,
2007,
WISE Workshops.
[5]
Liliana Ibanescu,et al.
Fuzzy Web Data Tables Integration Guided by an Ontological and Terminological Resource
,
2013,
IEEE Transactions on Knowledge and Data Engineering.
[6]
Gerhard Weikum,et al.
Making Sense of Entities and Quantities in Web Tables
,
2016,
CIKM.
[7]
Craig A. Knoblock,et al.
Semantic Labeling: A Domain-Independent Approach
,
2016,
SEMWEB.
[8]
Sunita Sarawagi,et al.
Open-domain quantity queries on web tables: annotation, response, and consensus models
,
2014,
KDD.
[9]
Craig A. Knoblock,et al.
Assigning Semantic Labels to Data Sources
,
2015,
ESWC.