Comparative genomic analysis identifies great-ape-specific structural variants and their evolutionary significance

During the origin of great apes about 14 million years ago, a series of phenotypic innovations emerged, such as the increased body size, the enlarged brain volume, the improved cognitive skill and the diversified diet. Yet, the genomic basis of these evolutionary changes remains unclear. Utilizing the high-quality genome assemblies of great apes (including human), gibbon and macaque, we conducted comparative genome analyses, and identified 15,885 great-ape-specific structural variants (GSSVs), including 8 coding GSSVs resulting in the creation of novel proteins (e.g. ACAN and CMYA5). Functional annotations of the GSSV-related genes revealed the enrichment of genes involved development and morphogenesis, especially neurogenesis and neural network formation, suggesting the potential role of GSSVs in shaping the great-ape-shared traits. Further dissection of the brain-related GSSVs show great-ape-specific changes of enhancer activities and gene expression in the brain, involving a group of GSSV-regulated genes (such as NOL3) that potentially contribute to the altered brain development and function in great apes. The presented data highlights the evolutionary role of structural variants in the phenotypic innovations during the origin of the great lineage.

[1]  C. Andersen,et al.  Transposon-activated POU5F1B promotes colorectal cancer growth and metastasis , 2022, Nature Communications.

[2]  K. Kutsche,et al.  Cardiofacioneurodevelopmental syndrome: Report of a novel patient and expansion of the phenotype , 2022, American journal of medical genetics. Part A.

[3]  Zhuxi Li,et al.  Identification of novel ACAN mutations in two Chinese families and genotype–phenotype correlation in patients with 74 pathogenic ACAN variations , 2021, Molecular genetics & genomic medicine.

[4]  Xiaochen Bo,et al.  clusterProfiler 4.0: A universal enrichment tool for interpreting omics data , 2021, Innovation.

[5]  Ian T. Fiddes,et al.  Sequence diversity analyses of an improved rhesus macaque genome enhance its biomedical utility , 2020, Science.

[6]  M. Suntsova,et al.  Differences between human and chimpanzee genomes and their implications in gene expression, protein functions and biochemical properties of the two species , 2020, BMC Genomics.

[7]  Heng Li,et al.  Higher rates of processed pseudogene acquisition in humans and three great apes revealed by long read assemblies , 2020, bioRxiv.

[8]  Eric S. Lander,et al.  Mapping and characterization of structural variation in 17,795 human genomes , 2020, Nature.

[9]  R. Pfundt,et al.  De novo CLTC variants are associated with a variable phenotype from mild to severe intellectual disability, microcephaly, hypoplasia of the corpus callosum, and epilepsy , 2019, Genetics in Medicine.

[10]  J. Infante,et al.  POLR3A-related spastic ataxia: new mutations and a look into the phenotype , 2019, Journal of Neurology.

[11]  S. Scherer,et al.  Pathogenic WDFY3 variants cause neurodevelopmental disorders and opposing effects on brain size. , 2019, Brain : a journal of neurology.

[12]  Zev N. Kronenberg,et al.  Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants , 2019, Nature Communications.

[13]  Yuichiro Watanabe,et al.  Updated meta‐analysis of CMYA5 rs3828611 and rs4704591 with schizophrenia in Asian populations , 2018, Early intervention in psychiatry.

[14]  S. Mundlos,et al.  Structural variation in the 3D genome , 2018, Nature Reviews Genetics.

[15]  David Haussler,et al.  High-resolution comparative analysis of great ape genomes , 2018, Science.

[16]  G. Drin,et al.  Identification of MOSPD2, a novel scaffold for endoplasmic reticulum membrane contact sites , 2018, EMBO reports.

[17]  Lei Zhan,et al.  POU5F1B promotes hepatocellular carcinoma proliferation by activating AKT. , 2018, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[18]  Heli Liu,et al.  Structural basis of SALM5-induced PTPδ dimerization for synaptic differentiation , 2018, Nature Communications.

[19]  S. Fukai,et al.  Structural basis of trans-synaptic interactions between PTPδ and SALMs for inducing synapse formation , 2018, Nature Communications.

[20]  M. Gerstein,et al.  Molecular and cellular reorganization of neural circuits in the human lineage , 2017, Science.

[21]  Kyungjae Myung,et al.  Ring finger protein 126 (RNF126) suppresses ionizing radiation–induced p53-binding protein 1 (53BP1) focus formation , 2017, The Journal of Biological Chemistry.

[22]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[23]  Yiping Shen,et al.  Novel pathogenic ACAN variants in non-syndromic short stature patients. , 2017, Clinica chimica acta; international journal of clinical chemistry.

[24]  Yanmei Zhang,et al.  Ring finger protein 43 associates with gastric cancer progression and attenuates the stemness of gastric cancer stem-like cells via the Wnt-β/catenin signaling pathway , 2017, Stem Cell Research & Therapy.

[25]  Xin Li,et al.  The impact of structural variation on human gene expression , 2016, Nature Genetics.

[26]  C. Mroske,et al.  CLTC as a clinically novel gene associated with multiple malformations and developmental delay , 2016, American journal of medical genetics. Part A.

[27]  David Haussler,et al.  Long-read sequence assembly of the gorilla genome , 2016, Science.

[28]  M. Creyghton,et al.  Epigenomic annotation of gene regulatory alterations during evolution of the primate brain , 2016, Nature Neuroscience.

[29]  A. Zihlman,et al.  Body composition in Pan paniscus compared with Homo sapiens has implications for changes during human evolution , 2015, Proceedings of the National Academy of Sciences.

[30]  Mark J. P. Chaisson,et al.  Resolving the complexity of the human genome using single-molecule sequencing , 2014, Nature.

[31]  R. Barton,et al.  Rapid Evolution of the Cerebellum in Humans and Other Great Apes , 2014, Current Biology.

[32]  Andrew S. Burrell,et al.  Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes. , 2014, Molecular phylogenetics and evolution.

[33]  Paolo Vineis,et al.  Genome-wide association study yields variants at 20p12.2 that associate with urinary bladder cancer. , 2014, Human molecular genetics.

[34]  Vineet Bafna,et al.  Amplification and thrifty single-molecule sequencing of recurrent somatic structural variations , 2014, Genome research.

[35]  M. Rosenfeld,et al.  Zebrafish Ciliopathy Screen Plus Human Mutational Analysis Identifies C21orf59 and CCDC65 Defects as Causing Primary Ciliary Dyskinesia. , 2013, American journal of human genetics.

[36]  D. Astuti,et al.  Perlman Syndrome: Overgrowth, Wilms Tumor Predisposition and DIS3L2 , 2013, American journal of medical genetics. Part C, Seminars in medical genetics.

[37]  Jan O. Korbel,et al.  Phenotypic impact of genomic structural variation: insights from and for human disease , 2013, Nature Reviews Genetics.

[38]  D. Geschwind,et al.  Familial cortical myoclonus with a mutation in NOL3 , 2012, Annals of neurology.

[39]  T. Arendt,et al.  Aggrecan: Beyond cartilage and into the brain. , 2012, The international journal of biochemistry & cell biology.

[40]  D. Rujescu,et al.  GWA study data mining and independent replication identify cardiomyopathy-associated 5 (CMYA5) as a risk gene for schizophrenia , 2011, Molecular Psychiatry.

[41]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[42]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[43]  Steven Gallinger,et al.  Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24 , 2007, Nature Genetics.

[44]  Jin Han,et al.  Oct4 pseudogenes are transcribed in cancers. , 2005, Biochemical and biophysical research communications.

[45]  Karl Zilles,et al.  Expansion of the neocerebellum in Hominoidea. , 2003, Journal of human evolution.

[46]  A. Zihlman,et al.  Body mass in lowland gorillas: a quantitative analysis. , 2000, American journal of physical anthropology.

[47]  H. Watanabe,et al.  Roles of aggrecan, a large chondroitin sulfate proteoglycan, in cartilage structure and function. , 1998, Journal of biochemistry.

[48]  D. Chivers Measuring food intake in wild animals: primates , 1998, Proceedings of the Nutrition Society.

[49]  R. J. Smith,et al.  Body mass in comparative primatology. , 1997, Journal of human evolution.

[50]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[51]  Gerhard Roth,et al.  Evolution of the brain and intelligence in primates. , 2012, Progress in brain research.

[52]  D. Alba Cognitive inferences in fossil apes (Primates, Hominoidea): does encephalization reflect intelligence? , 2010, Journal of anthropological sciences = Rivista di antropologia : JASS.

[53]  J. Rilling,et al.  Evolution of the Brain, in Humans – Paleoneurology , 2009 .

[54]  F. Mertens,et al.  The POU5F1P1 pseudogene encodes a putative protein similar to POU5F1 isoform 1. , 2008, Oncology reports.

[55]  S. Ward,et al.  Origin of the hominidae : the record of African large hominoid evolution between 14 my and 4 my , 1988 .

[56]  B. P. Wheatley The evolution of large body size in orangutans: A model for hominoid divergence , 1987, American journal of primatology.