Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data

Significance Ray-finned fishes form the largest and most diverse group of vertebrates. Establishing their phylogenetic relationships is a critical step to explaining their diversity. We compiled the largest comparative genomic database of fishes that provides genome-scale support for previous phylogenetic results and used it to resolve further some contentious relationships in fish phylogeny. A vetted set of exon markers identified in this study is a promising resource for current sequencing approaches to significantly increase genetic and taxonomic coverage to resolve the tree of life for all fishes. Our time-calibrated analysis suggests that most lineages of living fishes were already established in the Mesozoic Period, more than 65 million years ago. Our understanding of phylogenetic relationships among bony fishes has been transformed by analysis of a small number of genes, but uncertainty remains around critical nodes. Genome-scale inferences so far have sampled a limited number of taxa and genes. Here we leveraged 144 genomes and 159 transcriptomes to investigate fish evolution with an unparalleled scale of data: >0.5 Mb from 1,105 orthologous exon sequences from 303 species, representing 66 out of 72 ray-finned fish orders. We apply phylogenetic tests designed to trace the effect of whole-genome duplication events on gene trees and find paralogy-free loci using a bioinformatics approach. Genome-wide data support the structure of the fish phylogeny, and hypothesis-testing procedures appropriate for phylogenomic datasets using explicit gene genealogy interrogation settle some long-standing uncertainties, such as the branching order at the base of the teleosts and among early euteleosts, and the sister lineage to the acanthomorph and percomorph radiations. Comprehensive fossil calibrations date the origin of all major fish lineages before the end of the Cretaceous.

[1]  Thaine W. Rowley,et al.  The Tree of Life and a New Classification of Bony Fishes , 2013, PLoS currents.

[2]  Nicolas Bailly,et al.  Phylogenetic classification of bony fishes , 2017, BMC Evolutionary Biology.

[3]  A. Lemmon,et al.  High-Throughput Genomic Data in Systematics and Phylogenetics , 2013 .

[4]  Todd W. Anderson,et al.  Biodiversity, population regulation, and the stability of coral-reef fish communities , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  M. Friedman Explosive morphological diversification of spiny-finned teleost fishes in the aftermath of the end-Cretaceous extinction , 2010, Proceedings of the Royal Society B: Biological Sciences.

[6]  Caleb D. McMahan,et al.  Phylogenomic Systematics of Ostariophysan Fishes: Ultraconserved Elements Support the Surprising Non‐Monophyly of Characiformes , 2017, Systematic biology.

[7]  Peter C. Wainwright,et al.  Resolution of ray-finned fish phylogeny and timing of diversification , 2012, Proceedings of the National Academy of Sciences.

[8]  Sean R. Eddy,et al.  nhmmer: DNA homology search with profile HMMs , 2013, Bioinform..

[9]  Guoqing Lu,et al.  A practical approach to phylogenomics: the phylogeny of ray-finned fish (Actinopterygii) as a case study , 2007, BMC Evolutionary Biology.

[10]  A. Rokas,et al.  Contentious relationships in phylogenomic studies can be driven by a handful of genes , 2017, Nature Ecology &Evolution.

[11]  P. Holland,et al.  Phylogenomics of eukaryotes: impact of missing data on large alignments. , 2004, Molecular biology and evolution.

[12]  Travis C Glenn,et al.  Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales. , 2012, Systematic biology.

[13]  W. Fink,et al.  Interrelationships of the ostariophysan fishes (Teleostei) , 1981 .

[14]  B. Faircloth,et al.  A Phylogenomic Perspective on the Radiation of Ray-Finned Fishes Based upon Targeted Sequencing of Ultraconserved Elements (UCEs) , 2012, PloS one.

[15]  Alexandros Stamatakis,et al.  ExaBayes: Massively Parallel Bayesian Tree Inference for the Whole-Genome Era , 2014, Molecular biology and evolution.

[16]  Reinhold Hanel,et al.  Evolution of the immune system influences speciation rates in teleost fishes , 2016, Nature Genetics.

[17]  Peter C Wainwright,et al.  The evolution of pharyngognathy: a phylogenetic and functional appraisal of the pharyngeal jaw key innovation in labroid fishes and beyond. , 2012, Systematic biology.

[18]  A. Stamatakis,et al.  Computing the Internode Certainty and Related Measures from Partial Gene Trees , 2015, bioRxiv.

[19]  Chenhong Li,et al.  Species delimitation and phylogenetic reconstruction of the sinipercids (Perciformes: Sinipercidae) based on target enrichment of thousands of nuclear coding sequences. , 2017, Molecular phylogenetics and evolution.

[20]  Masami Hasegawa,et al.  CONSEL: for assessing the confidence of phylogenetic tree selection , 2001, Bioinform..

[21]  Byrappa Venkatesh,et al.  The Divergent Genomes of Teleosts. , 2018, Annual review of animal biosciences.

[22]  M. Friedman,et al.  Early members of ‘living fossil’ lineage imply later origin of modern ray-finned fishes , 2017, Nature.

[23]  Alexandros Stamatakis,et al.  Novel information theory-based measures for quantifying incongruence among phylogenetic trees. , 2014, Molecular biology and evolution.

[24]  D. Richter,et al.  A Large and Consistent Phylogenomic Dataset Supports Sponges as the Sister Group to All Other Animals , 2017, Current Biology.

[25]  B. Faircloth,et al.  Explosive diversification of marine fishes at the Cretaceous–Palaeogene boundary , 2018, Nature Ecology & Evolution.

[26]  A. Lemmon,et al.  Anchored hybrid enrichment for massively high-throughput phylogenomics. , 2012, Systematic biology.

[27]  G. Ortí,et al.  Addressing gene tree discordance and non-stationarity to resolve a multi-locus phylogeny of the flatfishes (Teleostei: Pleuronectiformes). , 2013, Systematic biology.

[28]  A. Meyer,et al.  Phylogenomic analysis of a rapid radiation of misfit fishes (Syngnathiformes) using ultraconserved elements. , 2017, Molecular phylogenetics and evolution.

[29]  R. Betancur-R,et al.  Expanded Taxonomic Sampling Coupled with Gene Genealogy Interrogation Provides Unambiguous Resolution for the Evolutionary Root of Angiosperms , 2017, Genome Biology and Evolution.

[30]  Tandy J. Warnow,et al.  ASTRAL: genome-scale coalescent-based species tree estimation , 2014, Bioinform..

[31]  Klaas Vandepoele,et al.  Major events in the genome evolution of vertebrates: paranome age and size differ considerably between ray-finned fishes and land vertebrates. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[32]  J. Lundberg,et al.  Genome-wide interrogation advances resolution of recalcitrant groups in the tree of life , 2017, Nature Ecology &Evolution.

[33]  G. Arratia Basal teleosts and teleostean phylogeny , 1997 .

[34]  Chenhong Li,et al.  EvolMarkers: a database for mining exon and intron markers for evolution, ecology and conservation studies , 2012, Molecular ecology resources.

[35]  Alexey M. Kozlov,et al.  ExaML version 3: a tool for phylogenomic analyses on supercomputers , 2015, Bioinform..

[36]  Jose V. Lopez,et al.  Fish-T1K (Transcriptomes of 1,000 Fishes) Project: large-scale transcriptome data for fish evolution studies , 2016, GigaScience.

[37]  I. Johnston,et al.  A well-constrained estimate for the timing of the salmonid whole genome duplication reveals major decoupling from species diversification , 2014, Proceedings of the Royal Society B: Biological Sciences.

[38]  Paramvir S. Dehal,et al.  Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate , 2005, PLoS biology.

[39]  Matt Friedman,et al.  Phylogenomic analysis of carangimorph fishes reveals flatfish asymmetry arose in a blink of the evolutionary eye , 2016, BMC Evolutionary Biology.

[40]  Tandy Warnow,et al.  On the Robustness to Gene Tree Estimation Error (or lack thereof) of Coalescent-Based Species Tree Methods. , 2015, Systematic biology.

[41]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[42]  S. Edwards,et al.  Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics , 2016, bioRxiv.

[43]  Hidetoshi Shimodaira An approximately unbiased test of phylogenetic tree selection. , 2002, Systematic biology.

[44]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.

[45]  Antonis Rokas,et al.  Inferring ancient divergences requires genes with strong phylogenetic signals , 2013, Nature.

[46]  S. O’Brien,et al.  The Genome 10K Project: a way forward. , 2015, Annual review of animal biosciences.

[47]  Huanming Yang,et al.  The Asian arowana (Scleropages formosus) genome provides new insights into the evolution of an early lineage of teleosts , 2016, Scientific Reports.

[48]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[49]  Bruce B. Collette,et al.  The Diversity of Fishes , 1997 .