Genomic analysis of the domestication and post-Spanish conquest evolution of the llama and alpaca

Background Despite their regional economic importance and being increasingly reared globally, the origins and evolution of the llama and alpaca remain poorly understood. Here we report reference genomes for the llama, and for the guanaco and vicuña (their putative wild progenitors), compare these with the published alpaca genome, and resequence seven individuals of all four species to better understand domestication and introgression between the llama and alpaca. Results Phylogenomic analysis confirms that the llama was domesticated from the guanaco and the alpaca from the vicuña. Introgression was much higher in the alpaca genome (36%) than the llama (5%) and could be dated close to the time of the Spanish conquest, approximately 500 years ago. Introgression patterns are at their most variable on the X-chromosome of the alpaca, featuring 53 genes known to have deleterious X-linked phenotypes in humans. Strong genome-wide introgression signatures include olfactory receptor complexes into both species, hypertension resistance into alpaca, and fleece/fiber traits into llama. Genomic signatures of domestication in the llama include male reproductive traits, while in alpaca feature fleece characteristics, olfaction-related and hypoxia adaptation traits. Expression analysis of the introgressed region that is syntenic to human HSA4q21, a gene cluster previously associated with hypertension in humans under hypoxic conditions, shows a previously undocumented role for PRDM8 downregulation as a potential transcriptional regulation mechanism, analogous to that previously reported at high altitude for hypoxia-inducible factor 1α. Conclusions The unprecedented introgression signatures within both domestic camelid genomes may reflect post-conquest changes in agriculture and the breakdown of traditional management practices.

[1]  Chi-Chung Hui,et al.  Disruption at the Ptchd1 Locus on Xp22.11 in Autism Spectrum Disorder and Intellectual Disability Nih Public Access , 2010 .

[2]  J. Wheeler,et al.  Llamas and Alpacas: Pre-conquest breeds and post-conquest hybrids , 1995 .

[3]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[4]  Juan Miguel García-Gómez,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis Manipulation of FASTQ data with Galaxy , 2005 .

[5]  Seok Chung,et al.  Intrinsic FGF2 and FGF5 promotes angiogenesis of human aortic endothelial cells in 3D microfluidic angiogenesis system , 2016, Scientific Reports.

[6]  Olivier Delaneau,et al.  Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel , 2014, Nature Communications.

[7]  T. Reinheckel,et al.  Induction of Premalignant Host Responses by Cathepsin X/Z-Deficiency in Helicobacter Pylori-Infected Mice , 2013, PloS one.

[8]  Mario Barbato,et al.  SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data , 2015, Front. Genet..

[9]  S. Lightman,et al.  Constant light disrupts the circadian rhythm of steroidogenic proteins in the rat adrenal gland , 2013, Molecular and Cellular Endocrinology.

[10]  J. Mullikin,et al.  Specifying and Sustaining Pigmentation Patterns in Domestic and Wild Cats , 2012, Science.

[11]  M. Zeder,et al.  Central questions in the domestication of plants and animals , 2006 .

[12]  Zheng-zheng Shi,et al.  OLA1, a Translational Regulator of p21, Maintains Optimal Cell Proliferation Necessary for Developmental Progression , 2016, Molecular and Cellular Biology.

[13]  B. Setchell Domestication and reproduction , 1992 .

[14]  S. Gravel Population Genetics Models of Local Ancestry , 2012, Genetics.

[15]  Russell B. Corbett-Detig,et al.  A Hidden Markov Model Approach for Simultaneously Estimating Local Ancestry and Admixture Time Using Next Generation Sequence Data in Samples of Arbitrary Ploidy , 2016, bioRxiv.

[16]  C. Groves,et al.  The naming of wild animal species and their domestic derivatives , 2004 .

[17]  Thomas Wiehe,et al.  How repetitive are genomes? , 2006, BMC Bioinformatics.

[18]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[19]  P. Orozco‐terWengel,et al.  Comparing genetic diversity and demographic history in co-distributed wild South American camelids , 2018, Heredity.

[20]  K. Allen Identification of Downstream Targets of the Putative Transcription Factor Prdm8 , 2009 .

[21]  R. Durbin,et al.  Inference of human population history from individual whole-genome sequences. , 2011, Nature.

[22]  C. Pirker,et al.  FGF5 is expressed in melanoma and enhances malignancy in vitro and in vivo , 2017, Oncotarget.

[23]  E. Haase Comparison of reproductive biological parameters in male wolves and domestic dogs , 2000 .

[24]  D. Bickhart,et al.  Assessing signatures of selection through variation in linkage disequilibrium between taurine and indicine cattle , 2014, Genetics Selection Evolution.

[25]  Stephen M. Mount,et al.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. , 2003, Nucleic acids research.

[26]  B. Oh,et al.  ANTXR2 is a potential causative gene in the genome-wide association study of the blood pressure locus 4q21 , 2014, Hypertension Research.

[27]  R. Perera,et al.  Relationship between altitude and the prevalence of hypertension in Tibet: a systematic review , 2015, Heart.

[28]  Yang Cao,et al.  Comparative analysis on genome-wide DNA methylation in longissimus dorsi muscle between Small Tailed Han and Dorper×Small Tailed Han crossbred sheep , 2017, Asian-Australasian journal of animal sciences.

[29]  L. Vidal-Rioja,et al.  Polymorphisms in MC1R and ASIP genes and their association with coat color phenotypes in llamas (Lama glama) , 2016 .

[30]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[31]  J. Espinoza,et al.  Cysteine proteinases Fas1 and Fas2 are diagnostic markers for Fasciola hepatica infection in alpacas (Lama pacos). , 2002, Veterinary parasitology.

[32]  Kelsey E. Witt,et al.  Convergent evolution in human and domesticate adaptation to high-altitude environments , 2019, Philosophical Transactions of the Royal Society B.

[33]  A. Tibary,et al.  Chromosome-Level Alpaca Reference Genome VicPac3.1 Improves Genomic Insight Into the Biology of New World Camelids , 2019, Front. Genet..

[34]  G. M. Strain The Genetics of Deafness in Domestic Animals , 2015, Front. Vet. Sci..

[35]  Mary M. Christopher,et al.  A New Decade of Veterinary Research: Societal Relevance, Global Collaboration, and Translational Medicine , 2015, Front. Vet. Sci..

[36]  C. Ammann,et al.  Late Quaternary Glacier response to humidity changes in the arid Andes of Chile (18–29°S) , 2001 .

[37]  Kazushige Touhara,et al.  Extreme expansion of the olfactory receptor gene repertoire in African elephants and evolutionary dynamics of orthologous gene groups in 13 placental mammals , 2014, Genome research.

[38]  K. Munyard,et al.  Three novel mutations in ASIP associated with black fibre in alpacas (Vicugna pacos) , 2011, The Journal of Agricultural Science.

[39]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[40]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[41]  W. Johnson,et al.  Y-chromosome and mtDNA variation confirms independent domestications and directional hybridization in South American camelids. , 2017, Animal genetics.

[42]  A. Reverter,et al.  Gene expression profiling of ovine skin and wool follicle development using a combined ovine-bovine skin cDNA microarray. , 2005 .

[43]  M. Tohyama,et al.  150-kDa Oxygen-regulated Protein (ORP150) Suppresses Hypoxia-induced Apoptotic Cell Death* , 1999, The Journal of Biological Chemistry.

[44]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[45]  C. Renieri,et al.  Evidence of post-transcriptional readthrough regulation in FGF5 gene of alpaca. , 2018, Gene.

[46]  Yulin Chen,et al.  Fibroblast growth factor 5-short (FGF5s) inhibits the activity of FGF5 in primary and secondary hair follicle dermal papilla cells of cashmere goats. , 2016, Gene.

[47]  P. Orozco‐terWengel,et al.  Demography and rapid local adaptation shape Creole cattle genome diversity in the tropics , 2018, Evolutionary applications.

[48]  S. Leppla,et al.  A Heterodimer of a VHH (Variable Domains of Camelid Heavy Chain-only) Antibody That Inhibits Anthrax Toxin Cell Binding Linked to a VHH Antibody That Blocks Oligomer Formation Is Highly Protective in an Anthrax Spore Challenge Model* , 2015, The Journal of Biological Chemistry.

[49]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[50]  M. Poutanen,et al.  Normal prenatal but arrested postnatal sexual development of luteinizing hormone receptor knockout (LuRKO) mice. , 2001, Molecular endocrinology.

[51]  K. Piórkowska,et al.  Detection of genetic variants between different Polish Landrace and Puławska pigs by means of RNA‐seq analysis , 2018, Animal genetics.

[52]  Shih-Chieh Lin,et al.  Targeting Anthrax Toxin Receptor 2 Ameliorates Endometriosis Progression , 2019, Theranostics.

[53]  D. Gade Carl Troll on nature and culture in the Andes , 1996 .

[54]  J. Ott,et al.  Association of common variants in/near six genes (ATP2B1, CSK, MTHFR, CYP17A1, STK39 and FGF5) with blood pressure/hypertension risk in Chinese children , 2014, Journal of Human Hypertension.

[55]  Huanming Yang,et al.  Genetic Architecture and Selection of Chinese Cattle Revealed by Whole Genome Resequencing , 2018, Molecular biology and evolution.

[56]  Yulin Chen,et al.  Disruption of FGF5 in Cashmere Goats Using CRISPR/Cas9 Results in More Secondary Hair Follicles and Longer Fibers , 2016, PloS one.

[57]  Xianglong Li,et al.  Illumina-sequencing based transcriptome study of coat color phenotypes in domestic goats , 2017, Genes & Genomics.

[58]  M. Goldfarb,et al.  Human oncogenes detected by a defined medium culture assay. , 1987, Oncogene.

[59]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[60]  S. Lipton,et al.  Transcriptional profiling of MEF2-regulated genes in human neural progenitor cells derived from embryonic stem cells , 2014, Genomics data.

[61]  L. Santini,et al.  Generation length for mammals , 2013 .

[62]  R. Durbin,et al.  GeneWise and Genomewise. , 2004, Genome research.

[63]  Effect of short term diet restriction on gene expression in the bovine hypothalamus using next generation RNA sequencing technology , 2017, BMC Genomics.

[64]  Y. Hérault,et al.  Ulnaless (Ul), a regulatory mutation inducing both loss-of-function and gain-of-function of posterior Hoxd genes. , 1997, Development.

[65]  T. Wang,et al.  Alpaca fiber growth is mediated by microRNA let-7b via down-regulation of target gene FGF5. , 2015, Genetics and molecular research : GMR.

[66]  K. Ji,et al.  FGF21 regulates melanogenesis in alpaca melanocytes via ERK1/2-mediated MITF downregulation. , 2017, Biochemical and biophysical research communications.

[67]  Zachary A. Szpiech,et al.  selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection , 2014, Molecular biology and evolution.

[68]  Joseph K. Pickrell,et al.  Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data , 2012, PLoS genetics.

[69]  Thomas Mailund,et al.  Incomplete lineage sorting patterns among human, chimpanzee, and orangutan suggest recent orangutan speciation and widespread selection. , 2011, Genome research.

[70]  J. Vincent,et al.  Ptchd1 deficiency induces excitatory synaptic and cognitive dysfunctions in mouse , 2017, Molecular Psychiatry.

[71]  Asan,et al.  Altitude adaptation in Tibet caused by introgression of Denisovan-like DNA , 2014, Nature.

[72]  R. Durbin,et al.  Inferring human population size and separation history from multiple genome sequences , 2014, Nature Genetics.

[73]  M. Bruford,et al.  Genetic analysis reveals the wild ancestors of the llama and the alpaca , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[74]  Steven L. Salzberg,et al.  Faculty Opinions recommendation of SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. , 2013 .

[75]  Wasim S Khan,et al.  Hypoxic conditions increase hypoxia-inducible transcription factor 2α and enhance chondrogenesis in stem cells from the infrapatellar fat pad of osteoarthritis patients , 2007, Arthritis research & therapy.

[76]  Burkhard Morgenstern,et al.  AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints , 2005, Nucleic Acids Res..

[77]  L. Shapiro,et al.  FGF5 is a crucial regulator of hair length in humans , 2014, Proceedings of the National Academy of Sciences.

[78]  L. Vidal-Rioja,et al.  Molecular characterization of the llama FGF5 gene and identification of putative loss of function mutations. , 2017, Animal genetics.

[79]  Keith Bradnam,et al.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes , 2007, Bioinform..

[80]  M. Bruford,et al.  Mixed signals from hybrid genomes , 2014, Molecular ecology.

[81]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[82]  Max Costa,et al.  Hypoxia-Inducible Factor-1 (HIF-1) , 2006, Molecular Pharmacology.

[83]  S. Subramaniyam,et al.  Discovery of Gene Sources for Economic Traits in Hanwoo by Whole-genome Resequencing , 2016, Asian-Australasian journal of animal sciences.

[84]  Chaoyang Zhang,et al.  A Fourier Transformation based Method to Mine Peptide Space for Antimicrobial Activity , 2006, BMC Bioinformatics.

[85]  M. Hugh-jones,et al.  Anthrax and wildlife. , 2002, Revue scientifique et technique.

[86]  Jian Wang,et al.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler , 2012, GigaScience.

[87]  A. Arana,et al.  Single nucleotide polymorphisms in the Melanocortin 1 Receptor gene are linked with lightness of fibre colour in Peruvian Alpaca (Vicugna pacos). , 2011, Animal genetics.

[88]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[89]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[90]  San-Gang He,et al.  CRISPR/Cas9‐mediated loss of FGF5 function increases wool staple length in sheep , 2017, The FEBS journal.

[91]  M. Frith,et al.  Adaptive seeds tame genomic sequence comparison. , 2011, Genome research.

[92]  Albert J. Vilella,et al.  EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. , 2009, Genome research.

[93]  W. Johnson,et al.  The genetic inheritance of the blue-eyed white phenotype in alpacas (Vicugna pacos). , 2014, The Journal of heredity.

[94]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[95]  N. Vasudevan,et al.  GPER1/GPR30 in the brain: Crosstalk with classical estrogen receptors and implications for behavior , 2018, The Journal of Steroid Biochemistry and Molecular Biology.

[96]  David Reich,et al.  Testing for ancient admixture between closely related populations. , 2011, Molecular biology and evolution.

[97]  M. Calus,et al.  Human-Mediated Introgression of Haplotypes in a Modern Dairy Cattle Breed , 2018, Genetics.

[98]  Simon H. Martin,et al.  Evaluating the Use of ABBA–BABA Statistics to Locate Introgressed Loci , 2014, bioRxiv.

[99]  Michael P Mullen,et al.  DNA sequence polymorphisms within the bovine guanine nucleotide-binding protein Gs subunit alpha (Gsα)-encoding (GNAS) genomic imprinting domain are associated with performance traits , 2011, BMC Genetics.

[100]  A. Battaglia,et al.  Contribution of common and rare variants of the PTCHD1 gene to autism spectrum disorders and intellectual disability , 2015, European Journal of Human Genetics.

[101]  M. Bruford,et al.  Mitochondrial phylogeography and demographic history of the Vicuña: implications for conservation , 2007, Heredity.

[102]  Huanming Yang,et al.  Camelid genomes reveal evolution and adaptation to desert environments , 2014, Nature Communications.

[103]  G. M. Goñalons Camelids in ancient Andean societies: A review of the zooarchaeological evidence , 2008 .