The Solute Carrier Families Have a Remarkably Long Evolutionary History with the Majority of the Human Families Present before Divergence of Bilaterian Species

The Solute Carriers (SLCs) are membrane proteins that regulate transport of many types of substances over the cell membrane. The SLCs are found in at least 46 gene families in the human genome. Here, we performed the first evolutionary analysis of the entire SLC family based on whole genome sequences. We systematically mined and analyzed the genomes of 17 species to identify SLC genes. In all, we identified 4,813 SLC sequences in these genomes, and we delineated the evolutionary history of each of the subgroups. Moreover, we also identified ten new human sequences not previously classified as SLCs, which most likely belong to the SLC family. We found that 43 of the 46 SLC families found in Homo sapiens were also found in Caenorhabditis elegans, whereas 42 of them were also found in insects. Mammals have a higher number of SLC genes in most families, perhaps reflecting important roles for these in central nervous system functions. This study provides a systematic analysis of the evolutionary history of the SLC families in Eukaryotes showing that the SLC superfamily is ancient with multiple branches that were present before early divergence of Bilateria. The results provide foundation for overall classification of SLC genes and are valuable for annotation and prediction of substrates for the many SLCs that have not been tested in experimental transport assays.

[1]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[2]  Robert Fredriksson,et al.  Mapping the human membrane proteome : a majority of the human membrane proteins can be classified according to function and evolutionary origin , 2015 .

[3]  L. Hug,et al.  Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups” , 2009, Proceedings of the National Academy of Sciences.

[4]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[5]  H. Schiöth,et al.  The solute carrier (SLC) complement of the human genome: Phylogenetic classification reveals four major families , 2008, FEBS letters.

[6]  Da-Neng Wang,et al.  Ins and outs of major facilitator superfamily antiporters. , 2008, Annual review of microbiology.

[7]  Nicholas H. Putnam,et al.  The Trichoplax genome and the nature of placozoans , 2008, Nature.

[8]  G. Bánhegyi,et al.  Dehydroascorbate and glucose are taken up into Arabidopsis thaliana cell cultures by two distinct mechanisms , 2008, FEBS letters.

[9]  K. Inui,et al.  Physiological and pharmacokinetic roles of H+/organic cation antiporters (MATE/SLC47A). , 2008, Biochemical pharmacology.

[10]  H. Schiöth,et al.  The obesity gene, FTO, is of ancient origin, up-regulated during food deprivation and expressed in neurons of feeding-related nuclei of the brain. , 2008, Endocrinology.

[11]  B. Sundberg,et al.  The Evolutionary History and Tissue Mapping of Amino Acid Transporters Belonging to Solute Carrier Families SLC32, SLC36, and SLC38 , 2008, Journal of Molecular Neuroscience.

[12]  Y. Van de Peer,et al.  The genome of Laccaria bicolor provides insights into mycorrhizal symbiosis , 2008, Nature.

[13]  Nicholas H. Putnam,et al.  The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans , 2008, Nature.

[14]  Andreas Prlic,et al.  Ensembl 2008 , 2007, Nucleic Acids Res..

[15]  Robert Fredriksson,et al.  Identification of six putative human transporters with structural similarity to the drug transporter SLC22 family. , 2007, Genomics.

[16]  T. Kuroiwa,et al.  A 100%-complete sequence reveals unusually simple genomic features in the hot-spring red alga Cyanidioschyzon merolae , 2007, BMC Biology.

[17]  Nicholas H. Putnam,et al.  Sea Anemone Genome Reveals Ancestral Eumetazoan Gene Repertoire and Genomic Organization , 2007, Science.

[18]  Nicholas H. Putnam,et al.  The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation , 2007, Proceedings of the National Academy of Sciences.

[19]  Michael J. Haydon,et al.  A Novel Major Facilitator Superfamily Protein at the Tonoplast Influences Zinc Tolerance and Accumulation in Arabidopsis1[C][W][OA] , 2007, Plant Physiology.

[20]  M. J. Lemieux Eukaryotic major facilitator superfamily transporter modeling based on the prokaryotic GlpT crystal structure (Review) , 2007, Molecular membrane biology.

[21]  Scott Cain,et al.  ParameciumDB: a community resource that integrates the Paramecium tetraurelia genome sequence with genetic data , 2006, Nucleic Acids Res..

[22]  Robert D. Finn,et al.  Pfam: clans, web tools and services , 2005, Nucleic Acids Res..

[23]  R. Fredriksson,et al.  The repertoire of solute carriers of family 6: identification of new human and rodent genes. , 2005, Biochemical and biophysical research communications.

[24]  B. Haas,et al.  The Genome Sequence of Trypanosoma cruzi, Etiologic Agent of Chagas Disease , 2005, Science.

[25]  David L. Steffen,et al.  The genome of the social amoeba Dictyostelium discoideum , 2005, Nature.

[26]  W. Catterall,et al.  The VGL-Chanome: A Protein Superfamily Specialized for Electrical Signaling and Ionic Homeostasis , 2004, Science's STKE.

[27]  Andreas Rolfs,et al.  The ABCs of solute carriers: physiological, pathological and therapeutic implications of human membrane transport proteins , 2004, Pflügers Archiv.

[28]  Paul W Sternberg,et al.  The draft genome sequence of the nematode Caenorhabditis briggsae, a companion to C. elegans , 2003, Genome Biology.

[29]  H. Schiöth,et al.  The G-protein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints. , 2003, Molecular pharmacology.

[30]  Ian Dunham,et al.  Reevaluating human gene annotation: a second-generation analysis of chromosome 22. , 2003, Genome research.

[31]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[32]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[33]  Jian Wang,et al.  The Genome Sequence of the Malaria Mosquito Anopheles gambiae , 2002, Science.

[34]  M. Madera,et al.  A comparison of profile hidden Markov model procedures for remote homology detection. , 2002, Nucleic acids research.

[35]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[36]  B. Barrell,et al.  The genome sequence of Schizosaccharomyces pombe , 2002, Nature.

[37]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[38]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[39]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[40]  M H Saier,et al.  The amino acid/polyamine/organocation (APC) superfamily of transporters specific for amino acids, polyamines and organocations. , 2000, Microbiology.

[41]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[42]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[43]  I. Paulsen,et al.  Major Facilitator Superfamily , 1998, Microbiology and Molecular Biology Reviews.

[44]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[45]  J. Berg Genome sequence of the nematode C. elegans: a platform for investigating biology. , 1998, Science.

[46]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[47]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[48]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.