bcnQL: A Query Language for Biochemical Network

This paper proposes a graph data model that can represent the Biochemical Network. In the data model, the Node class and extended Node subclasses are used to represent the Biochemical Entities and Interactions, and the Edge class is used to describe the relationships between nodes. Furthermore, the Path and Hyper-Path classes are proposed to represent various processes and pathways. The study presented in this paper also proposes a Query Language bcnQL that empowers users to query entities, interactions, processes and pathways with arbitrary conditions. In addition, new graphs can be composed with hyper-paths using supported graph functions. The language employs node formula, edge formula, path formula, hyper-path formula and graph formula to construct a collection of node objects, edge objects, path objects, hyper-path objects and graph objects respectively. Some query examples are presented to demonstrate the applicability of the language for this specific domain. Finally, we provide a prototype implementation for the query language.

[1]  Ulf Leser,et al.  A query language for biological networks , 2005, ECCB/JBI.

[2]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[3]  S. Wodak,et al.  Representing and Analysing Molecular and Cellular Function Using the Computer , 2000, Biological chemistry.

[4]  Gary D Bader,et al.  BIND--The Biomolecular Interaction Network Database. , 2001, Nucleic acids research.

[5]  Michel Scholl,et al.  Gram: a graph data model and query languages , 1992, ECHT '92.

[6]  Ralf Hartmut Güting,et al.  GraphDB: Modeling and Querying Graphs in Databases , 1994, VLDB.

[7]  Gultekin Özsoyoglu,et al.  A graph query language and its query processing , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[8]  Peter D. Karp,et al.  An ontology for biological function based on molecular interactions , 2000, Bioinform..

[9]  Tsuguchika Kaminuma,et al.  A Database for Cell Signaling Networks , 1998, J. Comput. Biol..

[10]  Gultekin Özsoyoglu,et al.  Pathways Database System: An Integrated System for Biological Pathways , 2003, Bioinform..

[11]  Alberto O. Mendelzon,et al.  GraphLog: a visual formalism for real life recursion , 1990, PODS '90.

[12]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[13]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..