Development of an Informatics Platform for Therapeutic Protein and Peptide Analytics

The momentum gained by research on biologics has not been met yet with equal thrust on the informatics side. There is a noticeable lack of software for data management that empowers the bench scientists working on the development of biologic therapeutics. SARvision|Biologics is a tool to analyze data associated with biopolymers, including peptides, antibodies, and protein therapeutics programs. The program brings under a single user interface tools to filter, mine, and visualize data as well as those algorithms needed to organize sequences. As part of the data-analysis tools, we introduce two new concepts: mutation cliffs and invariant maps. Invariant maps show the variability of properties when a monomer is maintained constant in a position of the biopolymer. Mutation cliff maps draw attention to pairs of sequences where a single or limited number of point mutations elicit a large change in a property of interest. We illustrate the program and its applications using a peptide data set collected from the literature.

[1]  Jan H. Jensen,et al.  Building a BioChemformatics Database , 2008, J. Chem. Inf. Model..

[2]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.

[3]  Louis M. Weiner,et al.  Monoclonal antibodies: versatile platforms for cancer immunotherapy , 2010, Nature Reviews Immunology.

[4]  Andrew C. Chan,et al.  Therapeutic antibodies for autoimmunity and inflammation , 2010, Nature Reviews Immunology.

[5]  Paul C Nagle,et al.  Characteristics of and trends in the late-stage biopharmaceutical pipeline. , 2008, The American journal of managed care.

[6]  Hongli Li,et al.  HELM: A Hierarchical Notation Language for Complex Biomolecule Structure Representation , 2012, J. Chem. Inf. Model..

[7]  Costas D Maranas,et al.  Recent advances in computational protein design. , 2011, Current opinion in structural biology.

[8]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[9]  Edgar P Herrero,et al.  Polymer-based oral peptide nanomedicines. , 2012, Therapeutic delivery.

[10]  B. Zinman,et al.  Insulins: past, present, and future. , 2012, Endocrinology and metabolism clinics of North America.

[11]  David E. Golan,et al.  Protein therapeutics: a summary and pharmacological classification , 2008, Nature Reviews Drug Discovery.

[12]  William Lingran Chen,et al.  Self-Contained Sequence Representation: Bridging the Gap between Bioinformatics and Cheminformatics , 2011, J. Chem. Inf. Model..

[13]  Gerald M. Maggiora,et al.  On Outliers and Activity Cliffs-Why QSAR Often Disappoints , 2006, J. Chem. Inf. Model..

[14]  Marie L Ary,et al.  Data management solutions for protein therapeutic research and development. , 2005, Drug discovery today.

[15]  H. Sharma,et al.  Monoclonal antibodies as novel neurotherapeutic agents in CNS injury and repair. , 2012, International review of neurobiology.

[16]  J. Burnett,et al.  RNA-based therapeutics: current progress and future prospects. , 2012, Chemistry & biology.

[17]  David Weininger,et al.  CHUCKLES: A method for representing and searching peptide and peptoid sequences on both monomer and atomic levels , 1994, J. Chem. Inf. Comput. Sci..

[18]  S. Whyte,et al.  Granulocyte colony-stimulating factors for febrile neutropenia prophylaxis following chemotherapy: systematic review and meta-analysis , 2011, BMC Cancer.