On predicting transport proteins and their substrates for the reconstruction of metabolic networks

The reconstruction of a metabolic network is a key step in systems biology and synthetic biology. A genome-scale network reconstruction builds a map to represent the gene-protein-reaction (GPR) association between the genes, the proteins which are the gene products, to the reaction that is carried out by the protein. For metabolism, the reaction is a metabolic reaction, and the protein is an enzyme catalysing the reaction. The transport of substrates across membranes is modeled by transport reactions. For higher organisms, the cellular compartments are important, so models include the extracellular space, the cytosol, the mitochondrion, and sometimes the peroxisome. The prediction of transport proteins for the GPR association, in particular, the specific substrate, or substrates, the protein transports across the membrane, and which membrane, is not as developed as the prediction of the GPR association for enzymes. In this paper, we illustrate the deficiencies in the state-of-the-art, and present our initial work to improve the prediction of GPR associations for transmembrane transporters.

[1]  Minoru Kanehisa,et al.  Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs , 2003, Bioinform..

[2]  Peter D. Karp,et al.  Annotation-based inference of transporter function , 2008, ISMB.

[3]  W. K. Purves Life: The Science of Biology , 1985 .

[4]  M. Gromiha,et al.  Classification of transporters using efficient radial basis function networks with position‐specific scoring matrices and biochemical properties , 2010, Proteins.

[5]  Milton H. Saier,et al.  Reliability of Nine Programs of Topological Predictions and Their Application to Integral Membrane Channel and Carrier Proteins , 2014, Journal of Molecular Microbiology and Biotechnology.

[6]  Ian T. Paulsen,et al.  TransportDB: a comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channels , 2006, Nucleic Acids Res..

[7]  Volkhard Helms,et al.  Classifying Substrate Specificities of Membrane Transporters from Arabidopsis thaliana , 2010, J. Chem. Inf. Model..

[8]  Ian T. Paulsen,et al.  Comparative Analyses of Fundamental Differences in Membrane Transport Capabilities in Prokaryotes and Eukaryotes , 2005, PLoS Comput. Biol..

[9]  V. Helms,et al.  Functional classification of membrane transporters and channels based on filtered TM/non‐TM amino acid composition , 2012, Biopolymers.

[10]  S. White,et al.  Membrane protein folding and stability: physical principles. , 1999, Annual review of biophysics and biomolecular structure.

[11]  Yungki Park,et al.  Prediction of the burial status of transmembrane residues of helical membrane proteins , 2007, BMC Bioinformatics.

[12]  Patrick X. Zhao,et al.  Prediction of Membrane Transport Proteins and Their Substrate Specificities Using Primary Sequence Information , 2014, PloS one.

[13]  Robert H. Michell Cell membranes , 1976, Nature.

[14]  Charles Elkan,et al.  The Transporter Classification Database: recent advances , 2008, Nucleic Acids Res..

[15]  Milton H. Saier,et al.  The IUBMB-endorsed transporter classification system , 2004, Methods in molecular biology.

[16]  M H Saier,et al.  A web-based program for the prediction of average hydropathy, average amphipathicity and average similarity of multiply aligned homologous proteins. , 2001, Journal of molecular microbiology and biotechnology.

[17]  István Simon,et al.  The HMMTOP transmembrane topology prediction server , 2001, Bioinform..

[18]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[19]  Patrick Xuechun Zhao,et al.  A nearest neighbor approach for automated transporter prediction and categorization from protein sequences , 2008, Bioinform..

[20]  Georg E Schulz Transmembrane beta-barrel proteins. , 2003, Advances in protein chemistry.

[21]  Yu-Yen Ou,et al.  Prediction of transporter targets using efficient RBF networks with PSSM profiles and biochemical properties , 2011, Bioinform..

[22]  Robert Sabatier,et al.  IMGT standardized criteria for statistical analysis of immunoglobulin V‐REGION amino acid properties , 2004, Journal of molecular recognition : JMR.

[23]  S. Munro,et al.  A Comprehensive Comparison of Transmembrane Domains Reveals Organelle-Specific Properties , 2010, Cell.

[24]  Stefan Bruder,et al.  Engineering of yeast hexose transporters to transport d-xylose without inhibition by d-glucose , 2014, Proceedings of the National Academy of Sciences.

[25]  Ian T. Paulsen,et al.  TransportDB: a relational database of cellular membrane transport systems , 2004, Nucleic Acids Res..

[26]  Thomas G. Dietterich,et al.  Bioinformatics The Machine Learning Approach 2nd ed. , 2001 .

[27]  Milton H Saier,et al.  Bioinformatic analyses of integral membrane transport proteins encoded within the genome of the planctomycetes species, Rhodopirellula baltica. , 2014, Biochimica et biophysica acta.

[28]  Lubert Stryer,et al.  Protein structure and function , 2005, Experientia.

[29]  Y. Z. Chen,et al.  Prediction of transporter family from protein sequence by support vector machine approach , 2005, Proteins.

[30]  Christina A Cuomo,et al.  Approaches to Fungal Genome Annotation , 2011, Mycology.

[31]  Patrick Xuechun Zhao,et al.  TransportTP: A two-phase classification approach for membrane transporter prediction and characterization , 2009, BMC Bioinformatics.

[32]  M. Michael Gromiha,et al.  Functional discrimination of membrane proteins using machine learning techniques , 2008, BMC Bioinformatics.

[33]  Robert D. Finn,et al.  The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam , 2015, Briefings Bioinform..

[34]  David T. Jones,et al.  Predicting Transmembrane Helix Packing Arrangements using Residue Contacts and a Force-Directed Algorithm , 2010, PLoS Comput. Biol..

[35]  Georg E. Schulz,et al.  Transmembrane β-barrel proteins , 2003 .

[36]  Yu-Yen Ou,et al.  Bioinformatics approaches for functional annotation of membrane proteins , 2014, Briefings Bioinform..

[37]  Burkhard Rost,et al.  LocTree3 prediction of localization , 2014, Nucleic Acids Res..

[38]  J. Nielsen,et al.  Metabolic model integration of the bibliome, genome, metabolome and reactome of Aspergillus niger , 2008, Molecular systems biology.

[39]  K. Chou,et al.  Prediction of protein subcellular locations by incorporating quasi-sequence-order effect. , 2000, Biochemical and biophysical research communications.

[40]  Milton H. Saier,et al.  The Transporter Classification Database , 2013, Nucleic Acids Res..

[41]  Milton H Saier,et al.  BioV Suite – a collection of programs for the study of transport protein evolution , 2012, The FEBS journal.

[42]  Volkhard Helms,et al.  Transferring functional annotations of membrane transporters on the basis of sequence similarity and sequence motifs , 2013, BMC Bioinformatics.

[43]  Marcin J. Skwark,et al.  Sequence analysis SPOCTOPUS: a combined predictor of signal peptides and membrane protein topology , 2008 .

[44]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..