Integrating fragmented software applications into holistic solutions: focus on drug discovery

Introduction: Current advances in software development and global molecular profiling technologies allow the development of holistic software solutions for drug discovery. Such solutions must streamline in silico drug and therapy development by integrating all types of data into one knowledge base and also by enabling continuous analysis workflows uninterrupted by manual restructuring of inputs and outputs from workflow components. They must provide a collaborative environment for data sharing between multiple users and allow importing of all types of experimental data for subsequent analysis. Areas covered: The reader is provided with a review of disparate software applications currently used in drug development and a discussion of existing organizational challenges for development of holistic software solutions. The reader is also provided with a proposed conceptual framework for integration of software components and some details for its implementation are suggested. Expert opinion: Holistic solutions can undoubtedly affect the speed, quality and cost of drug development and personalized therapy. However, it must be constantly evolved to rapidly adopt new experimental and statistical methods, incorporate advances in software technologies and allow perpetual optimization of its components. Perpetual improvements in data structure, data quality, statistical algorithms and other mathematical approaches for computer modeling can gradually shift financial and cultural emphasis in the pharmaceutical industry away from traditional experimental approaches and towards computational approaches.

[1]  Péter Csermely,et al.  The efficiency of multi-target drugs: the network approach might help drug design. , 2004, Trends in pharmacological sciences.

[2]  Xiaohui Wu,et al.  Predictive modeling of plant messenger RNA polyadenylation sites , 2007, BMC Bioinformatics.

[3]  Weida Tong,et al.  Assessing QSAR Limitations - A Regulatory Perspective , 2005 .

[4]  Hiroaki Kitano,et al.  Biological robustness , 2008, Nature Reviews Genetics.

[5]  S. Salzberg,et al.  Fast algorithms for large-scale genome alignment and comparison. , 2002, Nucleic acids research.

[6]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[7]  Anton Yuryev,et al.  Computational Approaches for Drug Repositioning and Combination Therapy Design , 2010, J. Bioinform. Comput. Biol..

[8]  M. Zhan,et al.  Exploring pathways from gene co-expression to network dynamics. , 2009, Methods in molecular biology.

[9]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[10]  Mickael Guedj,et al.  Should We Abandon the t-Test in the Analysis of Gene Expression Microarray Data: A Comparison of Variance Modeling Strategies , 2010, PloS one.

[11]  N. Lee,et al.  Computational and experimental approaches for modeling gene regulatory networks. , 2007, Current pharmaceutical design.

[12]  Y. Wang,et al.  Inferring regulatory networks. , 2008, Frontiers in bioscience : a journal and virtual library.

[13]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[15]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[16]  Sergei Egorov,et al.  MedScan, a natural language processing engine for MEDLINE abstracts , 2003, Bioinform..

[17]  Melissa Paczkowski,et al.  Cataloging the relationships between proteins , 2006, Molecular biotechnology.

[18]  Hao Sun,et al.  Structure‐based Drug Metabolism Predictions for Drug Design , 2010, Chemical biology & drug design.

[19]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[20]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[21]  D. Higgins,et al.  Multiple sequence alignments. , 2005, Current opinion in structural biology.

[22]  István A. Kovács,et al.  Drug-therapy networks and the prediction of novel drug targets , 2008, Journal of biology.

[23]  Illés J. Farkas,et al.  Uniformly curated signaling pathways reveal tissue-specific cross-talks and support drug target discovery , 2010, Bioinform..

[24]  Anton Yuryev,et al.  Molecular Networks in microarray Analysis , 2007, J. Bioinform. Comput. Biol..

[25]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Anton Yuryev,et al.  Ariadne's ChemEffect and Pathway Studio knowledge base , 2009, Expert opinion on drug discovery.

[27]  T. Ideker A systems approach to discovering signaling and regulatory pathways--or, how to digest large interaction networks into relevant pieces. , 2004, Advances in experimental medicine and biology.

[28]  John C. Dearden,et al.  In silico prediction of drug toxicity , 2003, J. Comput. Aided Mol. Des..

[29]  Qi Liu,et al.  BMC Bioinformatics BioMed Central Methodology article Comparative evaluation of gene-set analysis methods , 2007 .

[30]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[31]  R. Stephens,et al.  SLEPR: A Sample-Level Enrichment-Based Pathway Ranking Method — Seeking Biological Themes through Pathway-Level Consistency , 2008, PloS one.

[32]  Julie D Thompson,et al.  Multiple Sequence Alignment Using ClustalW and ClustalX , 2003, Current protocols in bioinformatics.

[33]  J. SantaLucia,et al.  A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Sean Ekins,et al.  Pathway mapping tools for analysis of high content data. , 2007, Methods in molecular biology.

[35]  Ernesto Picardi,et al.  Computational methods for ab initio and comparative gene finding. , 2010, Methods in molecular biology.

[36]  Per Broberg,et al.  A comparative review of estimates of the proportion unchanged genes and the false discovery rate , 2005, BMC Bioinformatics.

[37]  Danail Bonchev,et al.  From Molecular to Biological Structure and Back , 2007, J. Chem. Inf. Model..