It has been almost 4 years now since the world’s leading search engine operators, Bing, Google, Yahoo! and Yandex, decided to start working on an initiative to enrich web pages with structured data, known as schema.org. Since then, many web masters and those responsible for web pages started adapting this technology to enrich websites with semantic information. This paper analyzes parts of the structured data in the largest available open to the public web crawl, the Common Crawl, to find out how the hotel branch is using schema.org. On the use case of schema.org/Hotel, this paper studies who uses it, how it is applied and whether or not the classes and properties of the vocabulary are used in the syntactically and semantically correct way. Further, this paper will compare the usage based on numbers of 2013 and 2014 to find out whether or not an increase in usage can be noted. We observe a wide and growing distribution of schema.org, but also a large variety of erroneous and restricted usage of schema.org within the data set, which makes the data hard to use for real-life applications. When it comes to geographical comparison, the outcome shows that the United States are far in the lead with annotation of hotels with schema.org and Europe still has work to do to catch up.
[1]
Dieter Fensel,et al.
Improving the Online Visibility of Touristic Service Providers by Using Semantic Annotations
,
2014,
ESWC.
[2]
Hannes Werthner,et al.
Harmonise: A Step Toward an Interoperable E-Tourism Marketplace
,
2005,
Int. J. Electron. Commer..
[3]
Markus Zanker,et al.
An Automated Approach for Deriving Semantic Annotations of Tourism Products based on Geospatial Information
,
2009,
ENTER.
[4]
Nalin Sharda,et al.
Connecting Destinations with an Ontology-Based e-Tourism Planner
,
2007,
ENTER.
[5]
Christian Bizer,et al.
The WebDataCommons Microdata, RDFa and Microformat Dataset Series
,
2014,
International Semantic Web Conference.
[6]
Heiko Paulheim,et al.
Heuristics for Fixing Common Errors in Deployed schema.org Microdata
,
2015,
ESWC.
[7]
Dieter Fensel,et al.
Hotel Websites, Web 2.0, Web 3.0 and Online Direct Marketing: The Case of Austria
,
2014,
ENTER.
[8]
Ali Khalili,et al.
WYSIWYM Authoring of Structured Content Based on Schema.org
,
2013,
WISE.