Fur: Find unique genomic regions for diagnostic PCR

Abstract Motivation Unique marker sequences are highly sought after in molecular diagnostics. Nevertheless, there are only few programs available to search for marker sequences, compared to the many programs for similarity search. We therefore wrote the program Fur for Finding Unique genomic Regions. Results Fur takes as input a sample of target sequences and a sample of closely related neighbors. It returns the regions present in all targets and absent from all neighbors. The recently published program genmap can also be used for this purpose and we compared it to fur. When analyzing a sample of 33 genomes representing the major phylogroups of E.coli, fur was 40 times faster than genmap but used three times more memory. On the other hand, genmap yielded three times more markers, but they were less accurate when tested in silico on a sample of 237 E.coli genomes. We also designed phylogroup-specific PCR primers based on the markers proposed by genmap and fur, and tested them by analyzing their virtual amplicons in GenBank. Finally, we used fur to design primers specific to a Lactobacillus species, and found excellent sensitivity and specificity in vitro. Availability and implementation Fur sources and documentation are available from https://github.com/evolbioinf/fur. The compiled software is posted as a docker container at https://hub.docker.com/r/haubold/fox. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  E. Denamur,et al.  ClermonTyping: an easy-to-use and accurate in silico method for Escherichia genus strain phylotyping , 2018, Microbial genomics.

[2]  Bernhard Haubold,et al.  High-complexity regions in mammalian genomes are enriched for developmental genes , 2018, Bioinform..

[3]  Bernhard Haubold,et al.  Phylonium: fast estimation of evolutionary distances from large samples of similar genomes , 2019, Bioinform..

[4]  B. Faircloth,et al.  Primer3—new capabilities and interfaces , 2012, Nucleic acids research.

[5]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[6]  C. Fraser,et al.  Temporal Variability of Escherichia coli Diversity in the Gastrointestinal Tracts of Tanzanian Children with and without Exposure to Antibiotics , 2018, mSphere.

[7]  G. Reid,et al.  Vaginal Microbiota and the Use of Probiotics , 2009, Interdisciplinary perspectives on infectious diseases.

[8]  K. Lemuth,et al.  DNA Microarray for Genotyping Antibiotic Resistance Determinants in Acinetobacter baumannii Clinical Isolates , 2013, Antimicrobial Agents and Chemotherapy.

[9]  Jean M. Macklaim,et al.  At the crossroads of vaginal health and disease, the genome sequence of Lactobacillus iners AB-1 , 2010, Proceedings of the National Academy of Sciences.

[10]  O. Clermont,et al.  Rapid and Simple Determination of theEscherichia coli Phylogenetic Group , 2000, Applied and Environmental Microbiology.

[11]  Knut Reinert,et al.  GenMap: ultra-fast computation of genome mappability , 2020, Bioinform..