Simera: Modelling the PCR Process to Simulate Realistic Chimera Formation

Polymerase Chain Reaction (PCR) is the principal method of amplifying target DNA regions and, as such, is of great importance when performing microbial diversity studies. An unfortunate side effect of PCR is the formation of unwanted byproducts such as chimeras. The main goal of the work covered in this article is the development of an algorithm that simulates realistic chimeras for use in the evaluation of chimera detection software and for investigations into the accuracy of community structure analyses. Experimental data has helped to identify factors which may cause the formation of chimeras and has provided evidence of how influential these factors can be. This article makes use of some of this evidence in order to build a model with which to simulate the PCR process. This model helps to better explain the formation of chimeras and is therefore able to provide aid to future studies that intend to use PCR.

[1]  Marc‐David Cohen Pseudo‐Random Number Generators , 2006 .

[2]  E. Rubin,et al.  A mathematical model and a computerized simulation of PCR using complex templates. , 1996, Nucleic acids research.

[3]  John W. Emerson,et al.  Nonparametric Goodness-of-Fit Tests for Discrete Null Distributions , 2011, R J..

[4]  Christian L. Lauber,et al.  PrimerProspector: de novo design and taxonomic analysis of barcoded polymerase chain reaction primers , 2011, Bioinform..

[5]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[6]  C. Quince,et al.  Sample richness and genetic diversity as drivers of chimera formation in nSSU metagenetic analyses , 2012, Nucleic acids research.

[7]  Florent E. Angly,et al.  Grinder: a versatile amplicon and shotgun sequence simulator , 2012, Nucleic acids research.

[8]  D. C. Sullivan,et al.  PCR , 1989, Cell.

[9]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[10]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[11]  Russell J. Davenport,et al.  Removing Noise From Pyrosequenced Amplicons , 2011, BMC Bioinformatics.

[12]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[13]  François Pompanon,et al.  An In silico approach for the evaluation of DNA barcodes , 2010, BMC Genomics.

[14]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[15]  Rob Knight,et al.  UCHIME improves sensitivity and speed of chimera detection , 2011, Bioinform..