NRE: a tool for exploring neutral loci in the human genome

BackgroundAnalyzing regions of the genome where genetic variation is free from the confounding effects of natural selection is essential for many population genetic studies. Several recent studies in humans have stressed the large effect of natural selection at linked neutral sites and have shown that the choice of putatively neutral regions can have a marked effect on estimates of demographic history.ResultsNRE (Neutral Region Explorer) provides a mechanism for the easy extraction and analysis of nearly neutral regions from the human genome. It can combine many genomic filters, including filters for selection, recombination rate, genetic distance to the nearest gene, percent overlap with annotated regions, and user-provided loci. The program implements a two-step filtering process for greater versatility, allowing users to compile a basic set of neutrality criteria, explore their effect, and use this knowledge to refine filtering. Results can be instantly downloaded in standard formats, along with summary and ranking statistics, or exported to genome browsers such as those from the 1000 Genomes and UCSC. The applicability and value of NRE are demonstrated through an example in the estimation of the ratio of chromosome X-to-autosomal effective population size using different strategies for the selection of neutral regions.ConclusionsThe combined features of NRE make possible the sort of flexible, rigorous mining and analysis of neutral loci increasingly demanded by population genetic studies. NRE is available at http://nre.cb.bscb.cornell.edu.

[1]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[3]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[4]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[5]  M. Nachman,et al.  Gene density and human nucleotide polymorphism. , 2002, Molecular biology and evolution.

[6]  David L. Wheeler,et al.  GenBank: update , 2004, Nucleic Acids Res..

[7]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[8]  H. Ellegren The different levels of genetic diversity in sex chromosomes and autosomes. , 2009, Trends in genetics : TIG.

[9]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[10]  E. Heyer,et al.  Sex‐specific demographic behaviours that shape human genomic variation , 2012, Molecular ecology.

[11]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[12]  Francesca Chiaromonte,et al.  Scoring Pairwise Genomic Sequence Alignments , 2001, Pacific Symposium on Biocomputing.

[13]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[14]  D. Reich,et al.  Human Population Differentiation Is Strongly Correlated with Local Recombination Rate , 2010, PLoS genetics.

[15]  August E. Woerner,et al.  A novel DNA sequence database for analyzing human demographic history. , 2008, Genome research.

[16]  B. Trask,et al.  Segmental duplications: organization and impact within the current human genome project assembly. , 2001, Genome research.

[17]  J. Mullikin,et al.  Nature Genetics: doi:10.1038/ng.303Supplementary Methods , 2022 .

[18]  Xiaofeng Zhu,et al.  The landscape of recombination in African Americans , 2011, Nature.

[19]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[20]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[21]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[22]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[23]  J. Felsenstein,et al.  Estimators of the human effective sex ratio detect sex biases on different timescales. , 2010, American journal of human genetics.

[24]  A. Gylfason,et al.  Fine-scale recombination rate differences between sexes, populations and individuals , 2010, Nature.

[25]  Ryan D. Hernandez,et al.  Simultaneous inference of selection and population growth from patterns of variation in the human genome , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[26]  S. Jeffery Evolution of Protein Molecules , 1979 .

[27]  Deborah A Nickerson,et al.  Population History and Natural Selection Shape Patterns of Genetic Variation in 132 Genes , 2004, PLoS biology.

[28]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[29]  David Haussler,et al.  The UCSC Known Genes , 2006, Bioinform..

[30]  August E. Woerner,et al.  The ratio of human X chromosome to autosome diversity is positively correlated with genetic distance from genes , 2010, Nature Genetics.

[31]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[32]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[33]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[34]  P. Green,et al.  Widespread Genomic Signatures of Natural Selection in Hominid Evolution , 2009, PLoS genetics.

[35]  B. Charlesworth,et al.  The effect of recombination on background selection. , 1996, Genetical research.

[36]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[37]  L. Feuk,et al.  Development of bioinformatics resources for display and analysis of copy number and other structural variants in the human genome , 2006, Cytogenetic and Genome Research.

[38]  Ryan D. Hernandez,et al.  Classic Selective Sweeps Were Rare in Recent Human Evolution , 2011, Science.