Quality-Driven Query Answering for Integrated Information Systems

Querying the Web.- Integrating Autonomous Information Sources.- Information Quality.- Information Quality Criteria.- Quality Ranking Methods.- Quality-Driven Query Answering.- Quality-Driven Query Planning.- Query Planning Revisited.- Completeness of Data.- Completeness-Driven Query Optimization.- Discussion.- Conclusion.

[1]  Danièle Gardy,et al.  On the effect of join operations on relation sizes , 1989, TODS.

[2]  Michael R. Genesereth,et al.  Answering recursive queries using views , 1997, PODS '97.

[3]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[4]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.

[5]  Ying Chen,et al.  Query processing with quality control in the World Wide Web , 1998, World Wide Web.

[6]  Luis Gravano,et al.  GlOSS: text-source discovery over the Internet , 1999, TODS.

[7]  David Maier,et al.  On the foundations of the universal relation model , 1984, TODS.

[8]  Matthias Jarke,et al.  Design and Analysis of Quality Information for Data Warehouses , 1998, ER.

[9]  Dennis Tsichritzis,et al.  The ANSI/X3/SPARC DBMS Framework Report of the Study Group on Dabatase Management Systems , 1978, Inf. Syst..

[10]  C. J. Date Relational Database - Selected Writings , 1986 .

[11]  Felix Naumann Data Fusion and Data Quality , 1998 .

[12]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[13]  Ulf Leser,et al.  Query planning in mediator based information systems , 2000 .

[14]  Beng Chin Ooi,et al.  On getting some answers quickly, and perhaps more later , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[15]  Arnaud Sahuguet,et al.  Building Light-Weight Wrappers for Legacy Web Data-Sources Using W4F , 1999, VLDB.

[16]  Jo-Mei Chang A Heuristic Approach to Distributed Query Processing , 1982, VLDB.

[17]  Hans-Joachim Lenz,et al.  European Conference on Information Systems ( ECIS ) 2000 Data Integration by Means of Object Identification in Information Systems , 2017 .

[18]  Felix Naumann,et al.  Do Metadata Models meet IQ Requirements? , 1999, IQ.

[19]  Diane M. Strong,et al.  An Information Quality Assessment Methodology: Extended Abstract , 1999, IQ.

[20]  Hongjun Lu,et al.  Cleansing Data for Mining and Warehousing , 1999, DEXA.

[21]  Felix Naumann,et al.  Cooperative Query Answering with Density Scores , 2000 .

[22]  King-Lup Liu,et al.  Estimating the usefulness of search engines , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[23]  Daniel S. Weld,et al.  Planning to Gather Information , 1996, AAAI/IAAI, Vol. 1.

[24]  Arnon Rosenthal Note on the expected size of a join , 1981, SGMD.

[25]  Hector Garcia-Molina,et al.  Finding replicated Web collections , 2000, SIGMOD '00.

[26]  Alon Y. Halevy,et al.  MiniCon: A scalable algorithm for answering queries using views , 2000, The VLDB Journal.

[27]  Stavros Christodoulakis,et al.  Implications of certain assumptions in database performance evauation , 1984, TODS.

[28]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[29]  Gerhard Weikum,et al.  Towards Guaranteed Quality and Dependability of Information Services , 1999, BTW.

[30]  Mary Jane Willshire,et al.  Data Bryte: A Data Warehouse Cleansing Framework , 1999, IQ.

[31]  Agha Iqbal Ali,et al.  Streamlined computation for data envelopment analysis , 1993 .

[32]  Thomas Redman,et al.  The impact of poor data quality on the typical enterprise , 1998, CACM.

[33]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[34]  Luis Gravano,et al.  STARTS: Stanford proposal for Internet meta-searching , 1997, SIGMOD '97.

[35]  Jeffrey D. Ullman,et al.  MedMaker: a mediation system based on declarative specifications , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[36]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[37]  Robert M. Pirsig Man and Machine. (Book Reviews: Zen and the Art of Motorcycle Maintenance. An Inquiry into Values) , 1974 .

[38]  Chad Carson,et al.  Optimizing queries over multimedia repositories , 1996, SIGMOD '96.

[39]  George B. Dantzig,et al.  Linear programming and extensions , 1965 .

[40]  Monica Bobrowski,et al.  A Homogeneous Framework to Measure Data Quality , 1999, IQ.

[41]  Salvatore J. Stolfo,et al.  Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.

[42]  Vladimir Zadorozhny,et al.  Learning response time for WebSources using query feedback and application in query optimization , 2000, The VLDB Journal.

[43]  Abraham Charnes,et al.  Measuring the efficiency of decision making units , 1978 .

[44]  Divesh Srivastava,et al.  Data model and query evaluation in global information systems , 1995, Journal of Intelligent Information Systems.

[45]  Doron Rotem,et al.  Random Sampling from Database Files: A Survey , 1990, SSDBM.

[46]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[47]  Ulf Leser,et al.  Designing a Global Information Resource for Molecular Biology , 1999, BTW.

[48]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[49]  Charles Schroeder,et al.  DataBryte: A Proposed Data Warehouse Cleansing Framework , 1998, IQ.

[50]  Yannis Papakonstantinou,et al.  Object Fusion in Mediator Systems , 1996, VLDB.

[51]  David S. Johnson,et al.  Computers and In stractability: A Guide to the Theory of NP-Completeness. W. H Freeman, San Fran , 1979 .

[52]  Xiaolei Qian,et al.  Query folding , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[53]  A. Charnes,et al.  Data Envelopment Analysis Theory, Methodology and Applications , 1995 .

[54]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[55]  Craig A. Knoblock,et al.  Query processing in the SIMS information mediator , 1997 .

[56]  Elizabeth M. Pierce Enumerating Data Errors: A Survey of the Counting Literature , 1998, IQ.

[57]  Ulf Leser,et al.  Combining Heterogeneous Data Sources through Query Correspondence Assertions , 1998, Workshop on Web Information and Data Management.

[58]  Sumit Ganguly,et al.  Query optimization for parallel execution , 1992, SIGMOD '92.

[59]  Alon Y. Halevy,et al.  Using Probabilistic Information in Data Integration , 1997, VLDB.

[60]  William Kent,et al.  The breakdown of the information model in multi-database systems , 1991, SGMD.

[61]  Ana Maria de Carvalho Moura,et al.  A survey on metadata for describing and retrieving Internet resources , 1998, World Wide Web.

[62]  Hector Garcia-Molina,et al.  SCAM: A Copy Detection Mechanism for Digital Documents , 1995, DL.

[63]  Arnon Rosenthal,et al.  Outerjoin simplification and reordering for query optimization , 1997, TODS.

[64]  Ken Orr,et al.  Data quality and systems theory , 1998, CACM.

[65]  Luis Gravano,et al.  Evaluating Top-k Selection Queries , 1999, VLDB.

[66]  Abraham Charnes,et al.  Programming with linear fractional functionals , 1962 .

[67]  Donald D. Chamberlin,et al.  Access Path Selection in a Relational Database Management System , 1989 .

[68]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[69]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[70]  Amihai Motro,et al.  Estimating the Quality of Databases , 1998, FQAS.

[71]  Wojciech Ziarko,et al.  Discovery through rough set theory , 1999, Commun. ACM.

[72]  Matthias Jarke,et al.  Dwq : Esprit Long Term Research Project, No 22469 Data Warehouse Quality: a Review of the Dwq Project , 2022 .

[73]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[74]  Amihai Motro,et al.  Completeness Information and Its Application to Query Processing , 1986, VLDB.

[75]  Felix Naumann,et al.  Quality-driven Integration of Heterogenous Information Systems , 1999, VLDB.

[76]  Felix Naumann,et al.  Density Scores for Cooperative Query Answering , 1999, Föderierte Datenbanken.

[77]  T. Saaty,et al.  The Analytic Hierarchy Process , 1985 .

[78]  Felix Naumann,et al.  Query Planning with Information Quality Bounds , 2000, FQAS.

[79]  Ephraim R. McLean,et al.  Information Systems Success: The Quest for the Dependent Variable , 1992, Inf. Syst. Res..

[80]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[81]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[82]  Ian Horrocks,et al.  Knowledge Representation on the Web , 2000, Description Logics.

[83]  Hubert Naacke,et al.  Leveraging mediator cost models with heterogeneous data sources , 1998, Proceedings 14th International Conference on Data Engineering.

[84]  Rajiv M. Dewan,et al.  Internet service providers, proprietary content, and the battle for users' dollars , 1998, CACM.

[85]  Maria-Esther Vidal,et al.  Using Quality of Data Metadata for Source Selection and Ranking , 2000, WebDB.

[86]  YerneniStanford,et al.  Maximizing Coverage of Mediated Web QueriesRamana , 2000 .

[87]  Joann J. Ordille,et al.  Query-Answering Algorithms for Information Agents , 1996, AAAI/IAAI, Vol. 1.

[88]  Tok Wang Ling,et al.  A Data Model for Semistructured Data with Partial and Inconsistent Information , 2000, EDBT.

[89]  Andrei Z. Broder,et al.  A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[90]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[91]  Michael V. Mannino,et al.  Statistical profile estimation in database systems , 1988, CSUR.

[92]  Alain Pirotte,et al.  Generalized joins , 1976, SGMD.

[93]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[94]  Luis Gravano,et al.  The Effectiveness of GlOSS for the Text Database Discovery Problem , 1994, SIGMOD Conference.

[95]  Theo Härder,et al.  The intrinsic problems of structural heterogeneity and an approach to their solution , 1999, The VLDB Journal.

[96]  L. G. H. Cijan A polynomial algorithm in linear programming , 1979 .

[97]  Felix Naumann,et al.  Assessment Methods for Information Quality Criteria , 2000, IQ.

[98]  Arun N. Swami,et al.  On the Estimation of Join Result Sizes , 1994, EDBT.

[99]  Dennis Shasha,et al.  An extensible Framework for Data Cleaning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[100]  Richard Bellman,et al.  Dynamic Programming and Stochastic Control Processes , 1958, Inf. Control..

[101]  Felix Naumann,et al.  Quality Driven Source Selection Using Data Envelope Analysis , 1998, IQ.

[102]  Felix Naumann,et al.  Completeness of Information Sources , 2000 .

[103]  R. Weiner Lecture Notes in Economics and Mathematical Systems , 1985 .

[104]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.