Using Planning for Query Decomposition in Bioinformatics

Domains like bioinformatics are complexdata integration domains because data from remote sources and specialized applications need to be combined to answer queries. An important characteristic of such a domain is that actions may be mutually exclusive or causally related. Moreover, there is (partially complete) domain specific knowledge about how queries should be answered since an average user is a sophisticated domain expert. Query plans in these domains should not only answer the query but also respect any user intent or domain guidance provided to improve the perceived quality of the result and query execution time. Previously, methods like rule inversion and view unfolding have been found to be more effective than AI planning in obtaining access sequences for sources without interactions, a case when sources are just repositories of data. We present a solution in SHQPlanner, a hierarchical temporal planner for query planning and execution monitoring in complex domains based on previous theoretical work on HTN planning with partial domain knowledge on one hand and temporal reasoning for query cost on the other. SHQPlanner is a sound, complete, and efficient domain-independent query planner that can incorporate partial query decomposition, source preferences, data and application interaction, and temporal constraints.

[1]  Carole A. Goble,et al.  Transparent access to multiple bioinformatics information sources , 2001, IBM Syst. J..

[2]  Laura M. Haas,et al.  DiscoveryLink: A system for integrated access to life sciences data sources , 2001, IBM Syst. J..

[3]  Oliver M. Duschka Query Optimization Using Local Completeness , 1997, AAAI/IAAI.

[4]  Andreas D. Baxevanis,et al.  The Molecular Biology Database Collection: 2002 update , 2002, Nucleic Acids Res..

[5]  Val Tannen,et al.  K2/Kleisli and GUS: Experiments in integrated access to genomic data sources , 2001, IBM Syst. J..

[6]  S. Kambhampati,et al.  Sapa: A Domain-Independent Heuristic Metric Temporal Planner , 2014 .

[7]  HalevyAlon,et al.  MiniCon: A scalable algorithm for answering queries using views , 2001, VLDB 2001.

[8]  Daniel S. Weld,et al.  Planning to Gather Information , 1996, AAAI/IAAI, Vol. 1.

[9]  Thure Etzold,et al.  SRS - an indexing and retrieval tool for flat file data libraries , 1993, Comput. Appl. Biosci..

[10]  Kutluhan Erol,et al.  Hierarchical task network planning: formalization, analysis, and implementation , 1996 .

[11]  Asunción Gómez-Pérez,et al.  Knowledge Management through Ontologies , 1998, PAKM.

[12]  Subbarao Kambhampati,et al.  Hybrid Planning for Partially Hierarchical Domains , 1998, AAAI/IAAI.

[13]  Peter D. Karp,et al.  A Strategy for Database Interoperation , 1995, J. Comput. Biol..

[14]  Anthony Barrett,et al.  Task-Decomposition via Plan Parsing , 1994, AAAI.

[15]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[16]  Biplav Srivastava,et al.  A Common Data Representation for Organizing and Managing Annotations of Biochip Expression Data , 2002 .

[17]  M. Kanehisa,et al.  DBGET/LinkDB: an integrated database retrieval system. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[18]  Terence Critchlow,et al.  Automatic Generation of Warehouse Mediators Using an Ontology Engine , 1998, KRDB.