A provenance-based approach to semantic web service description and discovery

Web services have become common, if not essential, in the areas of business-to-business integration, distributed computing, and enterprise application integration. Yet the XML-based standards for web service descriptions encode only a syntactic representation of the service input and output. The actual meaning of these terms, their formal definitions, and their relationships to other concepts are not represented. This poses challenges for leveraging web services in the development of software capabilities. As the number of services grows and the specificity of users' needs increases, the ability to find an appropriate service for a specific application is strained. In order to overcome this challenge, semantic web services were proposed. For the discovery of web services, semantic web services use ontologies to find matches between user requirements and service capabilities. The computational reasoning afforded by ontologies enables users to find categorizations that weren't explicitly defined. However, there are a number of methodological variants on semantic web service discovery. Based on e-Science, an analog to e-Business, one methodology advocates deep and detailed semantic description of a web service's inputs and outputs. Yet, this methodology predates recent advances in semantic web and provenance research, and it is unclear the extent to which it applies outside of e-Science. We explore this question through a within-subjects experiment and we extend this methodology with current research in provenance, semantic web, and web service standards, developing and empirically evaluating an integrated approach to web service description and discovery. Implications for more advanced web service discovery algorithms and user interfaces are also presented. We address limitations in semantic web service discovery.Our approach is grounded in semantic web standards and W3C provenance ontology.Our user study indicates the results extend beyond e-Science.Our user study provides insights for web service discovery applications.We have created a new publicly available provenance ontology for service discovery.

[1]  Sarah Callaghan,et al.  Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance , 2012, Int. J. Digit. Curation.

[2]  Deborah L. McGuinness,et al.  System Transparency, or How I Learned to Worry about Meaning and Love Provenance! , 2010, IPAW.

[3]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[4]  Michael Stollberg Scalable Semantic Web Service Discovery , 2009 .

[5]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[6]  Norma Stoltz Chinchilla Distinctions , 1999 .

[7]  Elena Paslaru Bontas Simperl,et al.  Semantic web service offer discovery for e-commerce , 2008, ICEC.

[8]  Ron Weber,et al.  An Ontological Model of an Information System , 1990, IEEE Trans. Software Eng..

[9]  Susan J. Harrington,et al.  Telecommuting: a test of trust, competing values, and relative advantage , 1999 .

[10]  R. Ledesma,et al.  Cliff's Delta Calculator: A non-parametric effect size program for two groups of observations , 2010 .

[11]  Adriane Chapman,et al.  Understanding provenance black boxes , 2010, Distributed and Parallel Databases.

[12]  Deborah L. McGuinness,et al.  Linked provenance data: A semantic Web-based approach to interoperable workflow traces , 2011, Future Gener. Comput. Syst..

[13]  Deborah L. McGuinness,et al.  Semantic Provenance for Science Data Products: Application to Image Data Processing , 2009, SWPM.

[14]  Liming Chen,et al.  Supporting Provenance in Service-oriented Computing Using the Semantic Web Technologies , 2006, IEEE Intell. Informatics Bull..

[15]  Margo I. Seltzer,et al.  Provenance: a future history , 2009, OOPSLA Companion.

[16]  Daniel Deutch,et al.  Putting Lipstick on Pig: Enabling Database-style Workflow Provenance , 2011, Proc. VLDB Endow..

[17]  Carole D. Hafner,et al.  The State of the Art in Ontology Design: A Survey and Comparative Review , 1997, AI Mag..

[18]  Eleni Stroulia,et al.  Examining Usage Protocols for Service Discovery , 2006, ICSOC.

[19]  Carole A. Goble,et al.  The impact of workflow tools on data-centric research , 2009, The Fourth Paradigm.

[20]  M. Bunge Treatise on basic philosophy , 1974 .

[21]  Dave Robertson,et al.  Discovery and Uncertainty in Semantic Web Services , 2008, URSW.

[22]  E. Rogers Diffusion of Innovations , 1962 .

[23]  James A. Thom,et al.  Requirements-oriented methodology for evaluating ontologies , 2009, Inf. Syst..

[24]  Pinar Yolum,et al.  Service matchmaking revisited: An approach based on model checking , 2010, J. Web Semant..

[25]  S. Sawilowsky,et al.  Analysis of Likert scale data in disability and medical rehabilitation research. , 1998 .

[26]  Jeannette M. Wing,et al.  Specification matching of software components , 1997 .

[27]  Edgar Erdfelder,et al.  G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences , 2007, Behavior research methods.

[28]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[29]  Steffen Staab,et al.  Knowledge Processes and Ontologies , 2001, IEEE Intell. Syst..

[30]  Amit P. Sheth,et al.  Extending Semantic Provenance into the Web of Data , 2011, IEEE Internet Computing.

[31]  N. Cliff Dominance statistics: Ordinal analyses to answer ordinal questions. , 1993 .

[32]  Ron Weber,et al.  On the deep structure of information systems , 1995, Inf. Syst. J..

[33]  John E. Hunter,et al.  Methods of Meta-Analysis , 1989 .

[34]  Deborah L. McGuinness,et al.  Explaining answers from the Semantic Web: the Inference Web approach , 2004, J. Web Semant..

[35]  Carole A. Goble,et al.  A Suite of Daml+Oil Ontologies to Describe Bioinformatics Web Services and Data , 2003, Int. J. Cooperative Inf. Syst..

[36]  Mark Klein,et al.  Massachusetts Institute of Technology Abraham Bernstein University of Zurich Toward High-Precision Service Retrieval , 2022 .

[37]  John Domingue,et al.  On the Integration of Services with the Web of Data , 2010 .

[38]  James A. Hendler,et al.  Agents and the Semantic Web , 2001, IEEE Intell. Syst..

[39]  Quynh Pham Thi,et al.  A Complexity Measure for Web Service , 2009, 2009 International Conference on Knowledge and Systems Engineering.

[40]  John E. Hunter,et al.  Methods of Meta-Analysis: Correcting Error and Bias in Research Findings , 1991 .

[41]  Deborah L. McGuinness,et al.  Provenance-Based Strategies to Develop Trust in Semantic Web Applications , 2010, IPAW.

[42]  Anthony J. G. Hey,et al.  Jim Gray on eScience: a transformed scientific method , 2009, The Fourth Paradigm.

[43]  Darren George,et al.  SPSS for Windows Step by Step: A Simple Guide and Reference , 1998 .

[44]  Hsinchun Chen,et al.  Intelligence and Security Informatics , 2006, Lecture Notes in Computer Science.

[45]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[46]  Sudha Ram,et al.  Understanding the Semantics of Data Provenance to Support Active Conceptual Modeling , 2006, Active Conceptual Modeling of Learning.

[47]  Charles R. Schwenk,et al.  Conjectures on Cognitive Simplification in Acquisition and Divestment Decision Making , 1985 .

[48]  Dov Dori,et al.  Humans, semantic services and similarity: A user study of semantic Web services matching and composition , 2011, J. Web Semant..

[49]  Paul T. Groth,et al.  The requirements of recording and using provenance in e- Science experiments , 2005 .

[50]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[51]  Janet A. Sniezek,et al.  Groups under uncertainty: An examination of confidence in group decision making☆ , 1992 .

[52]  S. Chatterjee,et al.  Design Science Research in Information Systems , 2010 .

[53]  V. Vianu,et al.  Edinburgh Why and Where: A Characterization of Data Provenance , 2017 .

[54]  George M. Kasper,et al.  A Theory of Decision Support System Design for User Calibration , 1996, Inf. Syst. Res..

[55]  Matthias Klusch,et al.  Automated semantic web service discovery with OWLS-MX , 2006, AAMAS '06.

[56]  Marco Luca Sbodio,et al.  Discovering Semantic Web services using SPARQL and intelligent agents , 2010, J. Web Semant..

[57]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[58]  Erich J. Neuhold,et al.  Matchmaking for business processes based on conjunctive finite state automata , 2005, Int. J. Bus. Process. Integr. Manag..

[60]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[61]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[62]  N. Cliff Answering Ordinal Questions with Ordinal Data Using Ordinal Statistics. , 1996, Multivariate behavioral research.

[63]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..