Quality-Driven Information Filtering- In the Context of Web-Based Information Systems

Web-based information systems, such as search engines, news portals, electronic markets and community sites, provide access to information originating from numerous information providers. The quality of provided information varies as information providers have different levels of knowledge and different intentions. Users of web-based systems are therefore confronted with the increasingly difficult task to select high quality information from the vast amount of Web-accessible information. How can information systems support users to distinguish high quality from low quality information? Which filtering mechanisms can be applied? How can filtering decisions be explained to the user? The book gives an overview about information quality assessment in context of web-based systems. Afterwards, a quality-driven information filtering framework is developed. The framework allows information consumers to apply a wide range of different filtering policies. In order to facilitate the information consumers' understanding of filtering decisions, the framework generates explanations why information satisfies a specific policy. The book targets Web developers who need to handle information quality problems within their applications as well as researchers working on the topic.

[1]  Denis A. Nicole,et al.  Named Graphs as a Mechanism for Reasoning About Provenance , 2006, APWeb.

[2]  Axel Funk,et al.  Die GNU General Public License, Version 3 , 2007 .

[3]  Barbara Pernici,et al.  Data Quality in Web Information Systems , 2003, J. Data Semant..

[4]  Marsha Ann Tate,et al.  Web Wisdom: How To Evaluate and Create Information Quality on the Web , 1999 .

[5]  Raphael Volz,et al.  A Comparison of RDF Query Languages , 2004, SEMWEB.

[6]  Richard Y. Wang,et al.  Data Quality , 2000, Advances in Database Systems.

[7]  Frank van Harmelen,et al.  A semantic web primer , 2004 .

[8]  Jennifer Golbeck,et al.  Computing and Applying Trust in Web-based Social Networks , 2005 .

[9]  Chrysanthos Dellarocas,et al.  Immunizing online reputation reporting systems against unfair ratings and discriminatory behavior , 2000, EC '00.

[10]  Pablo Castells,et al.  An Ontology-Based Information Retrieval Model , 2005, ESWC.

[11]  Tiziana Catarci,et al.  Trusting Data Quality in Cooperative Information Systems , 2002, OTM.

[12]  Jeremy J. Carroll,et al.  Signing RDF Graphs , 2003, SEMWEB.

[13]  Thomas Redman,et al.  The impact of poor data quality on the typical enterprise , 1998, CACM.

[14]  Diane M. Strong,et al.  Data quality in context , 1997, CACM.

[15]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[16]  Giri Kumar Tayi,et al.  Enhancing data quality in data warehouse environments , 1999, CACM.

[17]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[18]  Jeremy J. Carroll,et al.  Named graphs, provenance and trust , 2005, WWW '05.

[19]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[20]  Jeremy J. Carroll,et al.  Named graphs , 2005, J. Web Semant..

[21]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[22]  Raph Levien,et al.  Attack-Resistant Trust Metrics , 2009, Computing with Social Trust.

[23]  Hector Garcia-Molina,et al.  Web Spam Taxonomy , 2005, AIRWeb.

[24]  Georg Lausen,et al.  Spreading activation models for trust propagation , 2004, IEEE International Conference on e-Technology, e-Commerce and e-Service, 2004. EEE '04. 2004.

[25]  Alan H. Karp,et al.  Computing the digest of an RDF graph , 2004 .

[26]  Julian L. Simon,et al.  Basic research methods in social science : the art of empirical investigation , 1970 .

[27]  Robert M. MacGregor,et al.  Representing Contextualized Data using Semantic Web Tools , 2003, PSSS.

[28]  Ching-Lai Hwang,et al.  Multiple attribute decision making : an introduction , 1995 .

[29]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[30]  Deborah L. McGuinness,et al.  Infrastructure for Web Explanations , 2003, SEMWEB.

[31]  Felix Naumann,et al.  Assessment Methods for Information Quality Criteria , 2000, IQ.

[32]  Ophir Frieder,et al.  Information Retrieval: Algorithms and Heuristics , 1998 .

[33]  Yolanda Gil,et al.  Trusting Information Sources One Citizen at a Time , 2002, SEMWEB.

[34]  Richard Y. Wang,et al.  Data quality assessment , 2002, CACM.

[35]  Jennifer Widom,et al.  Practical lineage tracing in data warehouses , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[36]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[37]  Leo Sauermann,et al.  Gnowsis Adapter Framework: Treating Structured Data Sources as Virtual RDF Graphs , 2005, SEMWEB.

[38]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[39]  Martin J. Eppler,et al.  Conceptualizing Information Quality: A Review of Information Quality Frameworks from the Last Ten Years , 2000, IQ.

[40]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.

[41]  Stefan Decker,et al.  Semantic web methods for knowledge management , 2002 .

[42]  Jaswinder Pal Singh,et al.  Computing and using reputations for internet ratings , 2001, EC '01.

[43]  Thomas C. Redman,et al.  Data Quality: The Field Guide , 2001 .

[44]  Jeff Heflin,et al.  A Model Theoretic Semantics for Ontology Versioning , 2004, SEMWEB.

[45]  E. Friedman,et al.  The Social Cost of Cheap Pseudonyms , 2001 .

[46]  Karsten Tolle,et al.  Trust and context using the RDF-Source related Storage System (RDF-S3) and easy RQL (eRQL) , 2004, Berliner XML Tage.

[47]  Audun Jøsang,et al.  A survey of trust and reputation systems for online service provision , 2007, Decis. Support Syst..

[48]  Andy Oram,et al.  Peer-to-Peer: Harnessing the Power of Disruptive Technologies , 2001 .

[49]  Matthias Jarke,et al.  Dwq : Esprit Long Term Research Project, No 22469 Data Warehouse Quality: a Review of the Dwq Project , 2022 .

[50]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[51]  Felix Naumann,et al.  Quality-Driven Query Answering for Integrated Information Systems , 2002, Lecture Notes in Computer Science.

[52]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[53]  William M. Daley,et al.  Digital Signature Standard (DSS) , 2000 .

[54]  Giorgos Zacharia,et al.  Collaborative reputation mechanisms in electronic marketplaces , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[55]  Evangelos Triantaphyllou,et al.  Multi-criteria Decision Making Methods: A Comparative Study , 2000 .

[56]  Jennifer Golbeck,et al.  Using Trust and Provenance for Content Filtering on the Semantic Web , 2006, MTW.

[57]  David R. Karger,et al.  Piggy Bank: Experience the Semantic Web Inside Your Web Browser , 2005, International Semantic Web Conference.

[58]  Diane M. Strong,et al.  Information quality benchmarks: product and service performance , 2002, CACM.

[59]  Lydia B. Chilton,et al.  Tabulator: Exploring and Analyzing linked data on the Semantic Web , 2006 .

[60]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[61]  Marc Najork,et al.  Detecting spam web pages through content analysis , 2006, WWW '06.

[62]  John Beckford Quality: A Critical Introduction , 1998 .

[63]  Ernesto Damiani,et al.  Choosing reputable servents in a P2P network , 2002, WWW.

[64]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[65]  Cai-Nicolas Ziegler,et al.  Towards decentralized recommender systems , 2005 .

[66]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[67]  Deborah L. McGuinness,et al.  Explaining reasoning in description logics , 1996 .

[68]  Richard Cyganiak,et al.  NG4J - Named Graphs API for Jena , 2005 .

[69]  Ying Chen,et al.  Query processing with quality control in the World Wide Web , 1998, World Wide Web.

[70]  Michael Gertz,et al.  Managing Data Quality and Integrity in Federated Databases , 1998, IICIS.

[71]  Raymond T. Ng,et al.  Distance-based outliers: algorithms and applications , 2000, The VLDB Journal.

[72]  Martin J. Eppler,et al.  Measuring Information Quality in the Web Context: A Survey of State-of-the-Art Instruments and an Application Methodology , 2002, ICIQ.

[73]  David A. Hull Stemming algorithms: a case study for detailed evaluation , 1996 .

[74]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[75]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[76]  Felix Naumann From Databases to Information Systems - Information Quality Makes the Difference , 2001, IQ.