Proceedings of the 17th International Conference on Information Quality, IQ 2012, Paris, France, November 16-17, 2012

Master data management (MDM) provides an access t o the consistent views of the organization ́s most important data, also referred to as master data. In ddition to technical issues, there are many organ izational items related to MDM and its organizational implementatio n. However, current academic literature lacks empir ical studies on organizational challenges influencing the MD M initiatives. Consequently organizational issues i n establishing master data management function in an organizat ion re studied in this paper. Data collection is c onducted by participatory observations of a year-long MDM proje ct. Reflecting the findings to the literature shows that several new issues have emerged. These indicate that the im pl mentation of MDM is also affected by the organiz tion ́s ability to identify data owners and associate them with appropriate roles and responsibilities, and to create a unified understanding of the key terms and concepts regardi ng MDM. Also the importance of communication is emp hasized.

[1]  Fred D. Davis,et al.  A Model of the Antecedents of Perceived Ease of Use: Development and Test† , 1996 .

[2]  Parag Agrawal,et al.  Foundations of uncertain-data integration , 2010, Proc. VLDB Endow..

[3]  H. D. Rombach,et al.  The Goal Question Metric Approach , 1994 .

[4]  Richard C. Morey,et al.  Estimating and improving the quality of information in a MIS , 1982, CACM.

[5]  Thomas C. Redman,et al.  Data Quality: The Field Guide , 2001 .

[6]  Iris Vessey,et al.  Information Use in Solving a Well-Structured IS Problem: The Roles of IS and Application Domain Knowledge , 2010, ER.

[7]  H. Lesca,et al.  Gestion de l'information: Qualité de l'information et performances de l'entreprise , 2010 .

[8]  David Loshin Enterprise knowledge management: the data quality approach , 2000 .

[9]  Jonathan R Davis,et al.  Assuring Data Quality and Validity in Clinical Trials for Regulatory Decision Making , 1999 .

[10]  Amit P. Sheth,et al.  OntoQA: Metric-Based Ontology Quality Analysis , 2005 .

[11]  M. Mezzanzanica,et al.  Classification of longitudinal career paths , 2013 .

[12]  Dan Suciu,et al.  Embracing Uncertainty in Large-Scale Computational Astrophysics. , 2009, MUD 2009.

[13]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[14]  Yang Lee,et al.  CEIP Maps: Context-embedded Information Product Maps , 2007, AMCIS.

[15]  Felix Naumann,et al.  Completeness of integrated information sources , 2004, Inf. Syst..

[16]  Klaus Pohl,et al.  The three dimensions of requirements engineering: a framework and its applications , 1994, Inf. Syst..

[17]  Sanjay Chawla,et al.  Robust record linkage blocking using suffix arrays , 2009, CIKM.

[18]  Gerald W. McLaughlin,et al.  Assessing the Integrity of Web Sites Providing Data and Information on Corporate Behavior , 2005 .

[19]  Justin Wolfers,et al.  Using Prediction Markets to Track Information Flows: Evidence from Google , 2009, AMMA.

[20]  Georgia Koutrika,et al.  Entity resolution with iterative blocking , 2009, SIGMOD Conference.

[21]  M. de Rijke,et al.  Determining Expert Profiles (With an Application to Expert Finding) , 2007, IJCAI.

[22]  Richard Y. Wang,et al.  Quality information and knowledge , 1998 .

[23]  Graeme G. Shanks,et al.  Empirical Refinement of a Semiotic Information Quality Framework , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[24]  Fred D. Davis A technology acceptance model for empirically testing new end-user information systems : theory and results , 1985 .

[25]  Dov M. Gabbay,et al.  What Is Negation as Failure? , 2012, Logic Programs, Norms and Action.

[26]  Nicola Guarino,et al.  Evaluating ontological decisions with OntoClean , 2002, CACM.

[27]  Letizia Tanca,et al.  What you Always Wanted to Know About Datalog (And Never Dared to Ask) , 1989, IEEE Trans. Knowl. Data Eng..

[28]  Martin J. Eppler,et al.  Information Quality on Corporate Intranets: Conceptualization and Measurement , 1999, IQ.

[29]  William Edward Hammond,et al.  Standardising clinical data elements , 2010, Int. J. Funct. Informatics Pers. Medicine.

[30]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[31]  Les Gasser,et al.  A framework for information quality assessment , 2007, J. Assoc. Inf. Sci. Technol..

[32]  Haruhiko Kaiya,et al.  Using Domain Ontology as Domain Knowledge for Requirements Elicitation , 2006, 14th IEEE International Requirements Engineering Conference (RE'06).

[33]  Diane M. Strong,et al.  Information quality benchmarks: product and service performance , 2002, CACM.

[34]  Adir Even,et al.  Evaluating a model for cost-effective data quality management in a real-world CRM setting , 2010, Decis. Support Syst..

[35]  Marco Valtorta,et al.  Towards a Method for Data Accuracy Assessment Utilizing a Bayesian Network Learning Algorithm , 2009, JDIQ.

[36]  Zhao Li,et al.  A fast filtering scheme for large database cleansing , 2002, CIKM '02.

[37]  Irit Askira Gelman,et al.  Setting priorities for data accuracy improvements in satisficing decision-making scenarios: A guiding theory , 2010, Decis. Support Syst..

[38]  V. Zeithaml,et al.  E-S-QUAL A Multiple-Item Scale for Assessing Electronic Service Quality , 2004 .

[39]  Paul L. Bowen,et al.  Data quality initiatives: striving for continuous improvements , 2007, Int. J. Inf. Qual..

[40]  John R. Talburt,et al.  A Curriculum for a Master of Science in Information Quality , 2007, J. Inf. Syst. Educ..

[41]  Fred D. Davis,et al.  A Theoretical Extension of the Technology Acceptance Model: Four Longitudinal Field Studies , 2000, Management Science.

[42]  Rao R. Nemani,et al.  A Framework for Data Quality in Data Warehousing , 2009, UNISCON.

[43]  Alon Y. Halevy,et al.  Crowdsourcing systems on the World-Wide Web , 2011, Commun. ACM.

[44]  Christine Legner,et al.  Master Data Management for Collaborative Service Processes , 2004 .

[45]  M. Jarke,et al.  Fundamentals of Data Warehouses , 2003, Springer Berlin Heidelberg.

[46]  C. Lee Giles,et al.  Adaptive sorted neighborhood methods for efficient record linkage , 2007, JCDL '07.

[47]  G. Murphy,et al.  The Big Book of Concepts , 2002 .

[48]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[49]  Peter Fankhauser,et al.  Efficient entity resolution for large heterogeneous information spaces , 2011, WSDM '11.

[50]  Sumit Sarkar,et al.  A Framework for Reconciling Attribute Values from Multiple Data Sources , 2007, Manag. Sci..

[51]  Val Tannen,et al.  Models for Incomplete and Probabilistic Information , 2006, IEEE Data Eng. Bull..

[52]  Ahmed K. Elmagarmid,et al.  Guided data repair , 2011, Proc. VLDB Endow..

[53]  Juliano Lopes de Oliveira,et al.  A business process metamodel for Enterprise Information Systems automatic generation , 2010 .

[54]  Joseph Mathew,et al.  Definitions, concepts and scope of engineering asset management , 2010 .

[55]  Mario Piattini,et al.  A proposal for a set of attributes relevant for Web portal data quality , 2008, Software Quality Journal.

[56]  Leo Pipino,et al.  Factors affecting the assessment of web site quality , 2001, ECIS.

[57]  Laurian M. Chirica,et al.  The entity-relationship model: toward a unified view of data , 1975, SIGF.

[58]  Soo Young Rieh Judgment of information quality and cognitive authority in the Web , 2002, J. Assoc. Inf. Sci. Technol..

[59]  Vijayan Sugumaran,et al.  A semiotic metrics suite for assessing the quality of ontologies , 2005, Data Knowl. Eng..

[60]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[61]  Holmes Miller,et al.  The Multiple Dimensions of Information Quality , 1996, Inf. Syst. Manag..

[62]  Salvatore J. Stolfo,et al.  The merge/purge problem for large databases , 1995, SIGMOD '95.

[63]  Peter A. Todd,et al.  Understanding Information Technology Usage: A Test of Competing Models , 1995, Inf. Syst. Res..

[64]  Carl van Walraven,et al.  Administrative database research has unique characteristics that can risk biased results. , 2012, Journal of clinical epidemiology.

[65]  Renée J. Miller,et al.  Clean Answers over Dirty Databases: A Probabilistic Approach , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[66]  Jack E. Olson,et al.  Data Quality: The Accuracy Dimension , 2003 .

[67]  Andrew McCallum,et al.  Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.

[68]  Weiyi Meng,et al.  Efficient SPectrAl Neighborhood blocking for entity resolution , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[69]  Ihab F. Ilyas,et al.  Probabilistic Ranking Techniques in Relational Databases , 2011, Probabilistic Ranking Techniques in Relational Databases.

[70]  Felix Naumann,et al.  Quality-Driven Query Answering for Integrated Information Systems , 2002, Lecture Notes in Computer Science.

[71]  Stephen J. Andriole,et al.  Technology Due Diligence: Best Practices for Chief Information Officers, Venture Capitalists, and Technology Vendors , 2008 .

[72]  Keng Siau,et al.  Measuring information quality of web sites: development of an instrument , 1999, ICIS.

[73]  Evan W. Duggan,et al.  Examining SNS Adoption through Motivational Lens , 2011, AMCIS.

[74]  Charles Elkan,et al.  An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records , 1997, DMKD.

[75]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[76]  W. Bruce Croft,et al.  A framework to predict the quality of answers with non-textual features , 2006, SIGIR.

[77]  Paul Mangiameli,et al.  The Effects and Interactions of Data Quality and Problem Complexity on Classification , 2011, JDIQ.

[78]  Craig W. Fisher,et al.  Criticality of data quality as exemplified in two disasters , 2001, Inf. Manag..

[79]  R. P. Srivastava,et al.  A conceptual framework and belief‐function approach to assessing overall information quality , 2003, Int. J. Intell. Syst..

[80]  K. Lukka,et al.  The constructive approach in management accounting research , 1993 .

[81]  Boris Otto,et al.  Organizing master data management: findings from an expert survey , 2010, SAC '10.

[82]  Daniel Neagu,et al.  Information Quality Framework for e-Learning Systems , 2010 .

[83]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[84]  Danette McGilvray,et al.  Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information TM , 2008 .

[85]  Adir Even,et al.  Value-Driven Data Quality Assessment , 2005, ICIQ.

[86]  P. Pfeifer,et al.  Modeling customer relationships as Markov chains , 2000 .

[87]  Martin J. Eppler,et al.  Measuring Information Quality in the Web Context: A Survey of State-of-the-Art Instruments and an Application Methodology , 2002, ICIQ.

[88]  Richard Y. Wang,et al.  Developing Measurement Scales for Data-Quality Dimensions , 2014 .

[89]  Giuseppe Santucci,et al.  Structuring Primitives for a Dictionary of Entity Relationship Data Schemas , 1993, IEEE Trans. Software Eng..

[90]  Felix Naumann,et al.  A generalization of blocking and windowing algorithms for duplicate detection , 2011, 2011 International Conference on Data and Knowledge Engineering (ICDKE).

[91]  William W. Cohen,et al.  Learning to match and cluster large high-dimensional data sets for data integration , 2002, KDD.

[92]  Daniel L. Moody,et al.  Theoretical and practical issues in evaluating the quality of conceptual models: current state and future directions , 2005, Data Knowl. Eng..

[93]  Markus Helfert,et al.  Information Quality Management: Review of an Evolving Research Area , 2007 .

[94]  Hendrik Decker,et al.  Inconsistency-Tolerant Integrity Checking , 2011, IEEE Transactions on Knowledge and Data Engineering.

[95]  Katherine L Kahn,et al.  Validity of cancer registry data for measuring the quality of breast cancer care. , 2002, Journal of the National Cancer Institute.

[96]  Peter Christen,et al.  Data Matching , 2012, Data-Centric Systems and Applications.

[97]  Liping Zhao,et al.  Determining the in-hospital cost of bleeding in patients undergoing percutaneous coronary intervention. , 2009, Journal of interventional cardiology.

[98]  B. Russell Knowledge by Acquaintance and Knowledge by Description , 2007 .

[99]  Taghrid Obied,et al.  Oversight of clinical investigations- A risk based approach to monitoring , 2014 .

[100]  Younghwa Lee,et al.  The Technology Acceptance Model: Past, Present, and Future , 2003, Commun. Assoc. Inf. Syst..

[101]  Yu Cai,et al.  Supporting data quality management in decision-making , 2006, Decis. Support Syst..

[102]  Sven Wåhlin,et al.  [Who can you trust?]. , 2012, Lakartidningen.

[103]  John Mylopoulos,et al.  Information Modeling in the Time of the Revolution , 1998, Inf. Syst..

[104]  Dongwon Lee,et al.  HARRA: fast iterative hashed record linkage for large-scale data collections , 2010, EDBT '10.

[105]  Richard Y. Wang,et al.  Data Quality , 2000, Advances in Database Systems.

[106]  Frank van Harmelen,et al.  A semantic web primer , 2004 .

[107]  Roman Lukyanenko,et al.  Rethinking data quality as an outcome of conceptual modeling choices , 2011, ICIQ.

[108]  Antoine Isaac,et al.  SKOS Use Cases and Requirements , 2009 .

[109]  Martin J. Eppler A Generic Framework for Information Quality in knowledge-intensive Processes , 2001 .

[110]  Edward Curry,et al.  Enterprise energy management using a linked dataspace for Energy Intelligence , 2012, 2012 Sustainable Internet and ICT for Sustainability (SustainIT).

[111]  Chen Li,et al.  Efficient record linkage in large data sets , 2003, Eighth International Conference on Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings..

[112]  Jan Mendling,et al.  On the Usage of Labels and Icons in Business Process Modeling , 2010, Int. J. Inf. Syst. Model. Des..

[113]  Gernot Gräfe,et al.  Incredible Information on the Internet: Biased Information Provision and a Lack of Credibility as a Cause of Insufficient Information Quality , 2003, ICIQ.

[114]  Terri Simmons,et al.  CSDQ: A User-Centered Approach to Improving the Quality of Customer Support Data , 2005, ICIQ.

[115]  Graeme G. Shanks,et al.  Understanding Data Quality and Data Warehousing: A Semiotic Approach , 1998, IQ.

[116]  Peter Christen,et al.  A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.

[117]  Francesc D. Muñoz-Escoí,et al.  Revisiting and Improving a Result on Integrity Preservation by Concurrent Transactions , 2010, OTM Workshops.

[118]  S Meads,et al.  The medical record as a data source: use and abuse. , 1982, Topics in health record management.

[119]  Ephraim R. McLean,et al.  Measuring e-Commerce Success: Applying the DeLone & McLean Information Systems Success Model , 2004, Int. J. Electron. Commer..

[120]  John Scott Social Network Analysis , 1988 .

[121]  Craig W. Fisher,et al.  Introduction to Information Quality , 2006 .

[122]  D. Malakoff,et al.  Spiraling Costs Threaten Gridlock , 2008, Science.

[123]  A. Parasuraman,et al.  A conceptual framework for understanding e-service quality : implications for future research and managerial practice , 2000 .

[124]  Sandeep Purao,et al.  A multi-layered ontology for comparing relationship semantics in conceptual models of databases , 2005, Appl. Ontology.

[125]  Cosmin Stroe,et al.  Automatic Configuration Selection Using Ontology Matching Task Profiling , 2012, ESWC.

[126]  Yi-Shun Wang,et al.  An Instrument for Measuring Customer Satisfaction Toward Web Sites That Market Digital Products and Services , 2001, J. Electron. Commer. Res..

[127]  Serge Abiteboul,et al.  On the representation and querying of sets of possible worlds , 1987, SIGMOD '87.

[128]  Pierre L'Ecuyer,et al.  Quasi-Monte Carlo methods for Markov chains with continuous multi-dimensional state space , 2010, Math. Comput. Simul..

[129]  Carl F. Pieper,et al.  Quantifying Data Quality for Clinical Trials Using Electronic Data Capture , 2008, PloS one.

[130]  Lois W. Sayrs Interviews : an introduction to qualitative research interviewing , 1996 .

[131]  Adir Even,et al.  Utility-driven assessment of data quality , 2007, DATB.

[132]  Praveen Paritosh,et al.  The anatomy of a large-scale human computation engine , 2010, HCOMP '10.

[133]  Lifang Gu,et al.  Adaptive Filtering for Efficient Record Linkage , 2004, SDM.

[134]  Anany Levitin,et al.  Data as a Resource: Properties, Implications, and Prescriptions , 1998 .

[135]  Carl van Walraven,et al.  Administrative database research infrequently used validated diagnostic or procedural codes. , 2011, Journal of clinical epidemiology.

[136]  Graeme G. Shanks,et al.  Stakeholder Perceptions of Data Quality in a Data Warehouse Environment , 1999, Aust. Comput. J..

[137]  Thomas C. Redman,et al.  Measuring Data Accuracy: A Framework and Review , 2014 .

[138]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[139]  Qi Su,et al.  Internet-scale collection of human-reviewed data , 2007, WWW '07.

[140]  Jan Mendling,et al.  On a Quest for Good Process Models: The Cross-Connectivity Metric , 2008, CAiSE.

[141]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[142]  Stephanie Watts,et al.  A Relevant, Believable Approach for Data Quality Assessment , 2003, ICIQ.

[143]  Mario Piattini,et al.  CALDEA: a data quality model based on maturity levels , 2003, Third International Conference on Quality Software, 2003. Proceedings..

[144]  Shaofeng Liu,et al.  Integration of decision support systems to improve decision support performance , 2010, Knowledge and Information Systems.

[145]  G. Shankaranarayan,et al.  Managing Data Quality in Dynamic Decision Environments: An Information Product Approach , 2003, J. Database Manag..

[146]  Jennifer Widom,et al.  ULDBs: databases with uncertainty and lineage , 2006, VLDB.

[147]  Avigdor Gal,et al.  Efficient Entity Resolution with MFIBlocks , 2009 .

[148]  Thomas C. Redman,et al.  Data Driven: Profiting from Your Most Important Business Asset , 2008 .

[149]  Philip Fei Wu,et al.  Opening the Black Boxes of TAM: Towards a Mixed Methods Approach , 2009, ICIS.

[150]  Roger A. Stone,et al.  Nursing Research: A Qualitative Perspective , 1987 .

[151]  InduShobha N. Chengalur-Smith,et al.  The Impact of Experience and Time on the Use of Data Quality Information in Decision Making , 2003, Inf. Syst. Res..

[152]  Peter Buneman,et al.  Provenance in databases , 2009, SIGMOD '07.

[153]  Arne Sølvberg,et al.  Understanding quality in conceptual modeling , 1994, IEEE Software.

[154]  Martin J. Eppler Managing Information Quality , 2003 .

[155]  Jordi Nin Guerrero,et al.  Blocking anonymized data , 2007 .

[156]  Graeme G. Shanks,et al.  Conceptual Data Modelling: an empirical study of expert and novice data modellers , 1997, Australas. J. Inf. Syst..

[157]  Arie Hordijk,et al.  Series Expansions for Continuous-Time Markov Processes , 2010, Oper. Res..

[158]  Martin J. Eppler,et al.  Information Quality - organizational, technological, and legal perspectives , 2004 .

[159]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[160]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[161]  James A. Thom,et al.  Ontology evaluation using wikipedia categories for browsing , 2007, CIKM '07.

[162]  Ronald W. Helms Data Quality Issues in Electronic Data Capture , 2001 .

[163]  Gianluca Demartini,et al.  ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking , 2012, WWW.

[164]  Zheng Zhou,et al.  Development and validation of an instrument to measure user perceived service quality of information presenting Web portals , 2005, Inf. Manag..

[165]  Keith A. Willoughby Title of article: , 2010 .

[166]  Lilia Maria Vargas,et al.  Research into Information Quality: A Study of the State of the Art in IQ and Its Consolidation , 2006, ICIQ.

[167]  Salvatore J. Stolfo,et al.  Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.

[168]  M. McCloskey,et al.  Natural categories: Well defined or fuzzy sets? , 1978 .

[169]  Norman W. Paton,et al.  Feedback-based annotation, selection and refinement of schema mappings for dataspaces , 2010, EDBT '10.

[170]  Mario Mezzanzanica,et al.  The Federal Observatory of the Labour Market in Lombardy: Models and Methods for the costruction of a Statistical Information System for Data Analysis , 2009 .

[171]  Richard Y. Wang,et al.  IP-MAP: Representing the Manufacture of an Information Product , 2000, IQ.

[172]  Diane M. Strong,et al.  A Model for Delivering Quality Information as Product and Service , 1997, IQ.

[173]  Arie Segev On Information Quality and the WWW Impact: A Position Paper , 1996, IQ.

[174]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[175]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[176]  Joerg Evermann,et al.  Evaluating Ontologies: Towards a Cognitive Measure of Quality , 2007, 2007 Eleventh International IEEE EDOC Conference Workshop.

[177]  Alon Y. Halevy,et al.  Pay-as-you-go user feedback for dataspace systems , 2008, SIGMOD Conference.

[178]  Ping Zhang,et al.  A Comparison of the Most Important Website Features in Different Domains: An Empirical Study of User Perceptions , 2000 .

[179]  John Krogstie,et al.  Defining quality aspects for conceptual models , 1995, ISCO.

[180]  Matthew Lease,et al.  Crowdsourcing Document Relevance Assessment with Mechanical Turk , 2010, Mturk@HLT-NAACL.

[181]  Ron Weber,et al.  Research Commentary: Information Systems and Conceptual Modeling - A Research Agenda , 2002, Inf. Syst. Res..

[182]  Fred D. Davis Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology , 1989, MIS Q..

[183]  Daniel L. Sherrell,et al.  Communications of the Association for Information Systems , 1999 .

[184]  Samuli Pekkola,et al.  Factors influencing the alignment of SOA development with business objectives , 2009, ECIS.

[185]  Robert Stevens,et al.  The Current State of SKOS Vocabularies on the Web , 2012, ESWC.

[186]  Yair Wand,et al.  Using Cognitive Principles to Guide Classification in Information Systems Modeling , 2008, MIS Q..

[187]  Bernd Heinrich,et al.  Assessing data currency — a probabilistic approach , 2011, J. Inf. Sci..

[188]  Christopher P. Firth Data Quality in Practice: Experience from the Front Line , 1996, IQ.

[189]  Carol Friedman,et al.  Research Paper: The Canon Group's Effort: Working Toward a Merged Model , 1995, J. Am. Medical Informatics Assoc..

[190]  Stuart E. Madnick,et al.  Overview and Framework for Data and Information Quality Research , 2009, JDIQ.

[191]  Stuart E. Madnick,et al.  Improving data quality through effective use of data semantics , 2006, Data Knowl. Eng..

[192]  Richard N. Shiffman,et al.  Collaboration between the Medical Informatics Community and Guideline Authors: Fostering HIT Standard Development that Matters , 2006, AMIA.

[193]  R. Thomson,et al.  Using routine comparative data to assess the quality of health care: understanding and avoiding common pitfalls , 2003, Quality & safety in health care.

[194]  Raymond J. Mooney,et al.  Adaptive Blocking: Learning to Scale Up Record Linkage , 2006, Sixth International Conference on Data Mining (ICDM'06).

[195]  Darijus Strasunskas,et al.  Empirical Insights on a Value of Ontology Quality in Ontology-Driven Web Search , 2008, OTM Conferences.

[196]  InduShobha N. Chengalur-Smith,et al.  The Impact of Data Quality Information on Decision Making: An Exploratory Analysis , 1999, IEEE Trans. Knowl. Data Eng..

[197]  Stacey L Knobler,et al.  The Critical Path to New Medical Products , 2005 .

[198]  Ron Weber,et al.  An Ontological Model of an Information System , 1990, IEEE Trans. Software Eng..

[199]  Edward Curry,et al.  XBRL and open data for global financial ecosystems: A linked data approach , 2012, Int. J. Account. Inf. Syst..

[200]  Charles N Mead,et al.  Data interchange standards in healthcare IT--computable semantic interoperability: now possible but still difficult, do we really need a better mousetrap? , 2006, Journal of healthcare information management : JHIM.

[201]  AnHai Doan,et al.  Matching Schemas in Online Communities: A Web 2.0 Approach , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[202]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[203]  Bill Hostmann,et al.  Magic Quadrant for Business Intelligence Platforms , 2010 .

[204]  Anca Ioana Andreescu,et al.  Combining actual trends in software systems for business management , 2008, CompSysTech.

[205]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[206]  M. D. Myers,et al.  Qualitative Research in Business & Management , 2008 .

[207]  Felix Naumann,et al.  Information Quality: How Good Are Off-The-Shelf DBMS? , 2004, ICIQ.

[208]  Daisy Zhe Wang,et al.  Probabilistic declarative information extraction , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[209]  J J Cimino,et al.  Toward a medical-concept representation language. The Canon Group. , 1994, Journal of the American Medical Informatics Association : JAMIA.

[210]  P. Mouncey Improving Data Warehouse and Business Information Quality , 2001 .

[211]  Jeff Heflin,et al.  Extending Functional Dependency to Detect Abnormal Data in RDF Graphs , 2011, SEMWEB.

[212]  Diane M. Strong,et al.  AIMQ: a methodology for information quality assessment , 2002, Inf. Manag..

[213]  Diane M. Strong,et al.  Product and Service Performance Model for Information Quality: An Update , 1998, IQ.

[214]  Paolo Traverso,et al.  Automatic OBDD-Based Generation of Universal Plans in Non-Deterministic Domains , 1998, AAAI/IAAI.

[215]  A. Parasuraman,et al.  The Behavioral Consequences of Service Quality , 1996 .

[216]  Maurice van Keulen,et al.  A probabilistic XML approach to data integration , 2005, 21st International Conference on Data Engineering (ICDE'05).

[217]  Ivan M. Milman,et al.  Enterprise Master Data Management: An SOA Approach to Managing Core Information , 2008 .

[218]  Keizo Oyama,et al.  A Fast Linkage Detection Scheme for Multi-Source Information Integration , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[219]  Roman Lukyanenko,et al.  Information Loss in the Era of User-Generated Data , 2012 .

[220]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[221]  Daniele Magazzeni,et al.  A universal planning system for hybrid domains , 2011, Applied Intelligence.

[222]  Michael Rosenblatt,et al.  The Clinical Trials Enterprise in the United States: A Call for Disruptive Innovation , 2012 .

[223]  Ron Weber,et al.  On the deep structure of information systems , 1995, Inf. Syst. J..

[224]  Gerhard Knolmayer,et al.  Quality of Material Master Data and Its Effect on the Usefulness of Distributed ERP Systems , 2006, ER.

[225]  Norbert Ritter,et al.  Duplicate detection in probabilistic data , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[226]  T. Davenport Competing on analytics. , 2006, Harvard business review.

[227]  Peter Christen Towards Parameter-free Blocking for Scalable Record Linkage , 2007 .

[228]  José Farinha,et al.  A Data Quality Metamodel Extension to CWM , 2007, APCCM.

[229]  Maria Grazia Fugini,et al.  Analysis-Sensitive Conversion of Administrative Data into Statistical Information Systems , 2007, ICEIS.

[230]  B. Russell Our Knowledge of the External World , 1960 .

[231]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[232]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[233]  Donald P. Ballou,et al.  Designing Information Systems to Optimize the Accuracy-Timeliness Tradeoff , 1995, Inf. Syst. Res..

[234]  David R. Karger,et al.  Human-powered Sorts and Joins , 2011, Proc. VLDB Endow..

[235]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[236]  Aldo Gangemi,et al.  Modelling Ontology Evaluation and Validation , 2006, ESWC.

[237]  Robert Kern,et al.  Exploring the "Crowd" as enabler of better information quality , 2011, ICIQ.

[238]  Alexander Maedche,et al.  An ERP-centric Master Data Management Approach , 2010, AMCIS.

[239]  Alexander Borgida,et al.  Conceptual Modeling of Information Systems , 1985, On Knowledge Base Management Systems.

[240]  Edward Curry,et al.  The Role of Community-Driven Data Curation for Enterprises , 2010, Linking Enterprise Data.

[241]  Matthias Jarke,et al.  Architecture and Quality in Data Warehouses , 1998, CAiSE.

[242]  Marcus Kaiser,et al.  A Procedure to Develop Metrics for Currency and its Application in CRM , 2009, JDIQ.

[243]  Thomas Redman,et al.  The impact of poor data quality on the typical enterprise , 1998, CACM.

[244]  J. Silvertown A new dawn for citizen science. , 2009, Trends in ecology & evolution.

[245]  Richard Y. Wang,et al.  Manage Your Information as a Product , 1998 .

[246]  Martin J. Eppler,et al.  Quality Criteria of Content-Driven Websites and their Influence on Customer Satisfaction and Loyalty: an Empirical Test of an Information Quality Framework , 2003, ICIQ.

[247]  Anthony Hunter,et al.  Approaches to Measuring Inconsistent Information , 2005, Inconsistency Tolerance.

[248]  Mh Monique Jansen-Vullers,et al.  Business process simulation - tool survey , 2006 .

[249]  Felix Naumann,et al.  XML Duplicate Detection Using Sorted Neighborhoods , 2006, EDBT.

[250]  Tim Kraska,et al.  CrowdDB: answering queries with crowdsourcing , 2011, SIGMOD '11.

[251]  Kwang-Ting Cheng,et al.  Automatic Functional Test Generation Using The Extended Finite State Machine Model , 1993, 30th ACM/IEEE Design Automation Conference.

[252]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[253]  M. de Rijke,et al.  Broad expertise retrieval in sparse data environments , 2007, SIGIR.

[254]  Yair Wand,et al.  Emancipating instances from the tyranny of classes in information modeling , 2000, TODS.

[255]  Alex Berson,et al.  Master Data Management And Customer Data Integration For A Global Enterprise , 2007 .

[256]  Adir Even,et al.  Economics-Driven Data Management: An Application to the Design of Tabular Data Sets , 2007, IEEE Transactions on Knowledge and Data Engineering.

[257]  Gary Wills,et al.  An evaluation of Information quality frameworks for the World Wide Web , 2006 .

[258]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[259]  Varghese S. Jacob,et al.  Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product , 2004, Manag. Sci..

[260]  Marta Indulska,et al.  How do practitioners use conceptual modeling in practice? , 2006, Data Knowl. Eng..

[261]  Zhengxin Chen,et al.  Duplicate detection using k-way sorting method , 2000, SAC '00.

[262]  Craig A. Knoblock,et al.  Learning Blocking Schemes for Record Linkage , 2006, AAAI.

[263]  Marcel Schoppers,et al.  Universal Plans for Reactive Robots in Unpredictable Environments , 1987, IJCAI.

[264]  Mario Piattini,et al.  A Portal Data Quality Model For Users And Developers , 2007, ICIQ.

[265]  L. Nordsletten,et al.  Local and national electronic databases in Norway demonstrate a varying degree of validity. , 2005, Journal of clinical epidemiology.

[266]  Stuart E. Madnick,et al.  A Cyclic-Hierarchical Method for Database Data-Quality Evaluation and Improvement , 2014 .

[267]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[268]  Boris Otto,et al.  One Size Does Not Fit All---A Contingency Approach to Data Governance , 2009, JDIQ.

[269]  Mark Gaynor,et al.  Implications of sensors and sensor-networks for data quality management , 2008, Int. J. Inf. Qual..

[270]  Sarah Hudson Scholle,et al.  Comparison of administrative-only versus administrative plus chart review data for reporting HEDIS hybrid measures. , 2007, The American journal of managed care.

[271]  Hendrik Decker,et al.  Modeling, Measuring and Monitoring the Quality of Information , 2009, ER Workshops.

[272]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993 .

[273]  Craig W. Fisher,et al.  An Accuracy Metric: Percentages, Randomness, and Probabilities , 2009, JDIQ.

[274]  Kenneth A Getz,et al.  Assessing the Impact of Protocol Design Changes on Clinical Trial Performance , 2008, American journal of therapeutics.

[275]  Paolo Missier,et al.  Forget Dimensions: Define Your Information Quality Using Quality View Patterns , 2014 .

[276]  Thomas H. Davenport,et al.  Information Ecology: Mastering the Information and Knowledge Environment , 1997 .

[277]  Anja Klein Incorporating quality aspects in sensor data streams , 2007, PIKM '07.

[278]  Jiajie Zhang,et al.  Operationalization of the UFuRT methodology for usability analysis in the clinical research data management domain , 2009, J. Biomed. Informatics.

[279]  Larry P. English Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits , 1999 .

[280]  Shai Ben-David,et al.  Modeling and Querying Possible Repairs in Duplicate Detection , 2009, Proc. VLDB Endow..

[281]  John A. Hoxmeier Typology of database quality factors , 1998, Software Quality Journal.

[282]  James M. Lang The Benefits of Making It Harder to Learn. , 2012 .

[283]  Peter Fankhauser,et al.  A Precise Blocking Method for Record Linkage , 2005, DaWaK.

[284]  Volker Gruhn,et al.  Complexity Metrics for business Process Models , 2006, BIS.

[285]  Harri Haapasalo,et al.  Managing One Master Data - Challenges and Preconditions , 2010, Ind. Manag. Data Syst..

[286]  Knut Hinkelmann,et al.  C2ST: A QUALITY FRAMEWORK TO EVALUATE E-GOVERNMENT SERVICE DELIVERY , 2009 .

[287]  Carlo Batini,et al.  Methodologies for data quality assessment and improvement , 2009, CSUR.

[288]  Christer Carlsson,et al.  Past, present, and future of decision support technology , 2002, Decis. Support Syst..

[289]  David J. Hand,et al.  How to lie with bad data , 2005 .

[290]  Graeme Shanks,et al.  A Semiotic Information Quality Framework , 2004 .

[291]  Barbara D. Klein User Perceptions of Data Quality: Internet and Traditional Text Sources , 2001, J. Comput. Inf. Syst..

[292]  Hendrik Decker Causes of the Violation of Integrity Constraints for Supporting the Quality of Databases , 2011, ICCSA.

[293]  Wei Zhang,et al.  A Framework for Corporate Householding , 2002, ICIQ.

[294]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[295]  Diane M. Strong,et al.  Teaching Information Quality in Information Systems Undergraduate Education , 1999, Informing Sci. Int. J. an Emerg. Transdiscipl..

[296]  Anette Weisbecker,et al.  Master Data Management: Products and Research , 2009, ICIQ.

[297]  Ruth Sara Aguilar-Savén,et al.  Business process modelling: Review and framework , 2004 .

[298]  Enrico Motta,et al.  A framework for evaluating semantic metadata , 2007, K-CAP '07.

[299]  Fred D. Davis,et al.  User Acceptance of Computer Technology: A Comparison of Two Theoretical Models , 1989 .

[300]  William J. Doll,et al.  The measurement of end-user computing satisfaction: theoretical and methodological issues , 1991 .

[301]  Fabio Stella,et al.  Dependency Discovery in Data Quality , 2010, CAiSE.

[302]  Felix Naumann,et al.  Assessment Methods for Information Quality Criteria , 2000, IQ.

[303]  Benedetto Intrigila,et al.  UPMurphi: A Tool for Universal Planning on PDDL+ Problems , 2009, ICAPS.

[304]  Peter Christen,et al.  Towards Scalable Real-Time Entity Resolution using a Similarity-Aware Inverted Index Approach , 2008, AusDM.

[305]  Jeffrey Parsons,et al.  Effects of Local Versus Global Schema Diagrams on Verification and Communication in Conceptual Data Modeling , 2002, J. Manag. Inf. Syst..

[306]  Alistair Miles,et al.  SKOS: Simple Knowledge Organisation for the Web , 2007 .

[307]  Ying Su,et al.  A Methodology For Information Quality Assessment In The Designing And Manufacturing Processes Of Mechanical Products , 2004, ICIQ.

[308]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[309]  Edward Curry,et al.  Leveraging matching dependencies for guided user feedback in linked data applications , 2012, IIWeb '12.

[310]  Marsha Ann Tate,et al.  Web Wisdom: How To Evaluate and Create Information Quality on the Web , 1999 .

[311]  Josep-Lluís Larriba-Pey,et al.  On the Use of Semantic Blocking Techniques for Data Cleansing and Integration , 2007, 11th International Database Engineering and Applications Symposium (IDEAS 2007).

[312]  Andreas Thor,et al.  Multi-pass sorted neighborhood blocking with MapReduce , 2012, Computer Science - Research and Development.

[313]  David Loshin,et al.  The Practitioner's Guide to Data Quality Improvement , 2010 .

[314]  M. Do Fast approximation of Kullback-Leibler distance for dependence trees and hidden Markov models , 2003, IEEE Signal Processing Letters.

[315]  Barry W. Boehm,et al.  Understanding and Controlling Software Costs , 1988, IEEE Trans. Software Eng..

[316]  Andy Koronios,et al.  Towards a Capability Maturity Model for Information Quality Management: A TDQM Approach , 2006, ICIQ.

[317]  Peter Christen,et al.  A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication , 2012, IEEE Transactions on Knowledge and Data Engineering.

[318]  Ephraim R. McLean,et al.  Information Systems Success: The Quest for the Independent Variables , 1992, J. Manag. Inf. Syst..

[319]  Felix Naumann,et al.  An Introduction to Duplicate Detection , 2010, An Introduction to Duplicate Detection.

[320]  Paul C. Tang,et al.  Position Paper: AMIA Advocates National Health Information System in Fight Against National Health Threats , 2002, J. Am. Medical Informatics Assoc..

[321]  Elena Console,et al.  Data Fusion , 2009, Encyclopedia of Database Systems.

[322]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[323]  Anders Haug,et al.  Barriers to master data quality , 2011, J. Enterp. Inf. Manag..

[324]  S. Kemmis,et al.  Participatory Action Research: Communicative Action and the Public Sphere. , 2005 .

[325]  James D. McKeen,et al.  Developments in Practice XXX: Master Data Management: Salvation Or Snake Oil? , 2008, Commun. Assoc. Inf. Syst..

[326]  Steve Tuck Is MDM the route to the Holy Grail? , 2008 .

[327]  Susan Gauch,et al.  Incorporating quality metrics in centralized/distributed information retrieval on the World Wide Web , 2000, SIGIR '00.

[328]  Vassilis Moustakis,et al.  Website Quality Assessment Criteria , 2004, ICIQ.

[329]  Jennifer Widom,et al.  Lineage tracing in data warehouses , 2001 .

[330]  Audun Jøsang,et al.  A survey of trust and reputation systems for online service provision , 2007, Decis. Support Syst..