A framework for information quality assessment

One of the main components in information quality (IQ) assurance is an IQ measurement model design and operationalization. One cannot manage IQ without first being able to measure it meaningfully and establishing a causal connection between the source of IQ change, the IQ problem types, the types of activities affected, and their implications. A better understanding is needed of the roots of IQ change through the development of a systematic, predictive, reusable IQ assessment framework. The framework should enable effective IQ reasoning through the disambiguation of IQ problem sources, and through the rapid and inexpensive development of context-specific IQ measurement models. Here we propose a general IQ assessment framework. In contrast to context-specific IQ assessment models, which usually focus on a few variables determined by local needs, our framework consists of comprehensive typologies of IQ problems, related activities, and a taxonomy of IQ dimensions organized in a systematic way based on sound theories and practices. The framework can be used as a knowledge resource and as a guide for developing IQ measurement models for many different settings. The framework was validated and refined by developing specific IQ measurement models for two large-scale collections of two large classes of information objects: Simple Dublin Core records and online encyclopedia articles. Some of the results of the collection-specific operationalization of the framework are reported.

[1]  Konrad Saur,et al.  IFLA Study Group on the Functional Requirements for Bibliographic Records , 1998 .

[2]  H. Garfinkel Studies in Ethnomethodology , 1968 .

[3]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[4]  Les Gasser,et al.  Assessing Information Quality of a Community-Based Encyclopedia , 2005, ICIQ.

[5]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[6]  A. Mccook Is Peer Review Broken , 2006 .

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Linda C. Smith,et al.  INFORMATION QUALITY IN A COMMUNITY-BASED ENCYCLOPEDIA , 2005 .

[9]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[10]  J. Michell An introduction to the logic of psychological measurement , 1995 .

[11]  William B. Frakes,et al.  Information and misinformation: An investigation of the notions of information, misinformation, informing, and misinforming , 1986, Journal of the American Society for Information Science.

[12]  John M. Carroll,et al.  Making Use: Scenario-Based Design of Human-Computer Interactions , 2000 .

[13]  Sarah Currier,et al.  Building Quality Assurance into Metadata Creation: An Analysis based on the Learning Objects and e-Prints Communities of Practice , 2003, Dublin Core Conference.

[14]  Margaret Higgins,et al.  Meta-Information, and Time: Factors in Human Decision Making , 1999, J. Am. Soc. Inf. Sci..

[15]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[16]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[17]  Les Gasser,et al.  Metadata Quality For Federated Collections , 2004, ICIQ.

[18]  P. George Benson,et al.  A first course in business statistics , 1981 .

[19]  N. Roberts,et al.  Value-added processes in information systems , 1986 .

[20]  Diane I. Hillmann,et al.  The Continuum of Metadata Quality: Defining, Expressing, Exploiting , 2004 .

[21]  Arlene G. Taylor The Organization of Information. Library and Information Science Text Series. , 1999 .

[22]  F. B. Vernadat,et al.  Decisions with Multiple Objectives: Preferences and Value Tradeoffs , 1994 .

[23]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[24]  David A. Wallace,et al.  Archives and the Public Good: Accountability and Records in Modern Society , 2002 .

[25]  Robert S. Taylor,et al.  Information use environments. , 1991 .

[26]  Diane Hillmann,et al.  Analyzing Metadata for Effective Use and Re-Use , 2003, Dublin Core Conference.

[27]  Diane M. Strong,et al.  Data quality in context , 1997, CACM.

[28]  Jean Tague-Sutcliffe,et al.  Measuring information : an information services perspective , 1995 .

[29]  Igor Aleksander,et al.  The Study of Information: Interdisciplinary Messages , 1986, J. Inf. Technol..

[30]  A. G. Oettinger,et al.  Language and information , 1968 .

[31]  H Austin,et al.  The economics of information systems. , 1986, Computers in healthcare.

[32]  B. C. Brookes ROBERT FAIRTHORNE AND THE SCOPE OF INFORMATION SCIENCE , 1974 .

[33]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[34]  Michael K. Buckland,et al.  Information as thing , 1991, J. Am. Soc. Inf. Sci..

[35]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[36]  James R. Evans,et al.  The management and control of quality , 1989 .

[37]  Thomas C. Redman,et al.  Data Quality Management and Technology , 1992 .