RUCS: rapid identification of PCR primers for unique core sequences

Motivation: Designing PCR primers to target a specific selection of whole genome sequenced strains can be a long, arduous and sometimes impractical task. Such tasks would benefit greatly from an automated tool to both identify unique targets, and to validate the vast number of potential primer pairs for the targets in silico. Results: Here we present RUCS, a program that will find PCR primer pairs and probes for the unique core sequences of a positive genome dataset complement to a negative genome dataset. The resulting primer pairs and probes are in addition to simple selection also validated through a complex in silico PCR simulation. We compared our method, which identifies the unique core sequences, against an existing tool called ssGeneFinder, and found that our method was 6.5‐20 times more sensitive. We used RUCS to design primer pairs that would target a set of genomes known to contain the mcr‐1 colistin resistance gene. Three of the predicted pairs were chosen for experimental validation using PCR and gel electrophoresis. All three pairs successfully produced an amplicon with the target length for the samples containing mcr‐1 and no amplification products were produced for the negative samples. The novel methods presented in this manuscript can reduce the time needed to identify target sequences, and provide a quick virtual PCR validation to eliminate time wasted on ambiguously binding primers. Availability and implementation: Source code is freely available on https://bitbucket.org/genomicepidemiology/rucs. Web service is freely available on https://cge.cbs.dtu.dk/services/RUCS. Contact: mcft@cbs.dtu.dk Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[2]  A. Saha,et al.  Emergence of influenza A (H1N1)pdm09 genogroup 6B and drug resistant virus, India, January to May 2015. , 2016, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[3]  H. Hasman,et al.  WGS-based surveillance of third-generation cephalosporin-resistant Escherichia coli from bloodstream infections in Denmark , 2017, The Journal of antimicrobial chemotherapy.

[4]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[5]  Christina Backes,et al.  An integer linear programming approach for finding deregulated subgraphs in regulatory networks , 2011, Nucleic acids research.

[6]  B. Faircloth,et al.  Primer3—new capabilities and interfaces , 2012, Nucleic acids research.

[7]  Jian Ye,et al.  Primer-BLAST: A tool to design target-specific primers for polymerase chain reaction , 2012, BMC Bioinformatics.

[8]  R. Kaas,et al.  Detection of mcr-1 encoding plasmid-mediated colistin-resistant Escherichia coli isolates from human bloodstream infection and imported chicken meat, Denmark 2015. , 2015, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[9]  K. Yuen,et al.  Automated Pangenomic Analysis in Target Selection for PCR Detection and Identification of Bacteria by Use of ssGeneFinder Webserver and Its Application to Salmonella enterica Serovar Typhi , 2012, Journal of Clinical Microbiology.

[10]  Herman Goossens,et al.  Identification of a novel plasmid-mediated colistin-resistance gene, mcr-2, in Escherichia coli, Belgium, June 2016. , 2016, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[11]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[12]  O. Samuilova,et al.  FastPCR: An in silico tool for fast primer and probe design and advanced sequence analysis. , 2017, Genomics.

[13]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.