Biostatistics and Bioinformatics in Clinical Trials

Abstract Biostatistics and bioinformatics pertain to the acquisition and interpretation of quantitative information in medical research. Both disciplines involve data analysis and experimental design. No sharp delineation exists between the two, but bioinformatics tends to deal with data in many dimensions, so-called “big data,” such as in genomics. Both disciplines are concerned with conducting inferences and measuring evidence on the basis of observed data, and thus use similar tools and methodology. Relative to bioinformatics, biostatistics is an old discipline. However, the biostatistical aspects of this chapter are modern and describe responses to challenges arising from recent advances in cancer biology. The goal of the chapter is to convey an understanding of new approaches in clinical trial design and analysis.

[1]  Donald A. Berry,et al.  Statistical Innovations in Cancer Research , 2003 .

[2]  Shuguang Huang,et al.  Comparison of false discovery rate methods in identifying genes with differential expression. , 2005, Genomics.

[3]  Soonmyung Paik,et al.  Use of archived specimens in evaluation of prognostic and predictive biomarkers. , 2009, Journal of the National Cancer Institute.

[4]  D. Berry,et al.  Cancer and Leukemia Group B Pathology Committee guidelines for tissue microarray construction representing multicenter prospective clinical trial tissues. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[5]  Richard Simon,et al.  The Use of Genomics in Clinical Trial Design , 2008, Clinical Cancer Research.

[6]  Scott M. Berry A Statistician Reads the Sports Pages , 2003 .

[7]  K. Coombes,et al.  Deriving chemosensitivity from cell lines: Forensic bioinformatics and reproducible research in high-throughput biology , 2009, 1010.1092.

[8]  Christian A. Rees,et al.  Distinctive gene expression patterns in human mammary epithelial cells and breast cancers. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[9]  D. Rennie,et al.  The STARD statement for reporting studies of diagnostic accuracy: explanation and elaboration. , 2003, Annals of internal medicine.

[10]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[11]  S. P. Fodor,et al.  Multiplexed biochemical assays with biological chips , 1993, Nature.

[12]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[13]  M S Pepe,et al.  Phases of biomarker development for early detection of cancer. , 2001, Journal of the National Cancer Institute.

[14]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[15]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.

[16]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[17]  L. Penland,et al.  Use of a cDNA microarray to analyse gene expression patterns in human cancer , 1996, Nature Genetics.

[18]  D. Berry Bayesian approaches for comparative effectiveness research , 2012, Clinical trials.

[19]  Kevin Coombes,et al.  Prognostic Role of a Multigene Reverse Transcriptase-PCR Assay in Patients with Node-Negative Breast Cancer Not Receiving Adjuvant Systemic Therapy , 2005, Clinical Cancer Research.

[20]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[21]  Jeffrey S. Morris,et al.  Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments , 2004, Bioinform..

[22]  K. Coombes,et al.  What information should be required to support clinical "omics" publications? , 2011, Clinical chemistry.

[23]  D. Berry,et al.  Reports from the 2010 Clinical and Translational Cancer Research Think Tank Meeting: Design Strategies for Personalized Therapy Trials , 2012, Clinical Cancer Research.

[24]  J. Mesirov,et al.  GenePattern 2.0 , 2006, Nature Genetics.

[25]  James O. Berger,et al.  Statistical Analysis and the Illusion of Objectivity , 1988 .

[26]  V. Ambros,et al.  The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14 , 1993, Cell.

[27]  G. Omenn,et al.  Evolution of Translational Omics: Lessons Learned and the Path Forward , 2013 .

[28]  Stan Pounds,et al.  Estimating the Occurrence of False Positives and False Negatives in Microarray Studies by Approximating and Partitioning the Empirical Distribution of P-values , 2003, Bioinform..

[29]  Min Zhan,et al.  A data review and re-assessment of ovarian cancer serum proteomic profiling , 2003, BMC Bioinformatics.

[30]  Chris F. Taylor,et al.  Further Steps in Standardisation Report of the Second Annual Proteomics Standards Initiative Spring Workshop (Siena, Italy 17–20th April 2005) , 2005, Proteomics.

[31]  Jannik N. Andersen,et al.  Cancer genomics: from discovery science to personalized medicine , 2011, Nature Medicine.

[32]  Diane D. Liu,et al.  Bayesian clinical trials at the University of Texas M. D. Anderson Cancer Center , 2009, Clinical trials.

[33]  David L. Donoho,et al.  WaveLab and Reproducible Research , 1995 .

[34]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[35]  Edward S. Kim,et al.  Bayesian adaptive design for targeted therapy development in lung cancer — a step toward personalized medicine , 2008, Clinical trials.

[36]  S. Stigler Gauss and the Invention of Least Squares , 1981 .

[37]  G. S. Johnson,et al.  An Information-Intensive Approach to the Molecular Pharmacology of Cancer , 1997, Science.

[38]  Robert Gentleman,et al.  Statistical Applications in Genetics and Molecular Biology , 2005 .

[39]  M. Cronin,et al.  Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer. , 2006, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[40]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[41]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[42]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[43]  D. Berry Multiplicities in cancer research: ubiquitous and necessary evils. , 2012, Journal of the National Cancer Institute.

[44]  Chunyu Liu,et al.  Removing Batch Effects in Analysis of Expression Microarray Data: An Evaluation of Six Batch Adjustment Methods , 2011, PloS one.

[45]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[46]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Edward S. Kim,et al.  The BATTLE trial: personalizing therapy for lung cancer. , 2011, Cancer discovery.

[48]  L. Lim,et al.  An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans , 2001, Science.

[49]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[50]  D. Ransohoff,et al.  Sources of bias in specimens for research about molecular markers for cancer. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[51]  D. Berry Adaptive clinical trials: the promise and the caution. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[52]  A. Olshen,et al.  Differential exoprotease activities confer tumor-specific serum peptidome patterns. , 2005, The Journal of clinical investigation.

[53]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[54]  R. Simon,et al.  Evaluating the Efficiency of Targeted Designs for Randomized Clinical Trials , 2004, Clinical Cancer Research.

[55]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[56]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[57]  M. Bittner,et al.  Gene expression profiling of alveolar rhabdomyosarcoma with cDNA microarrays. , 1998, Cancer research.

[58]  D. Berry,et al.  I‐SPY 2: An Adaptive Breast Cancer Trial Design in the Setting of Neoadjuvant Chemotherapy , 2009, Clinical pharmacology and therapeutics.

[59]  C. Croce,et al.  Frequent deletions and down-regulation of micro- RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[60]  D. Berry Adaptive clinical trials in oncology , 2012, Nature Reviews Clinical Oncology.

[61]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[62]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[63]  W. Sauerbrei,et al.  Reporting recommendations for tumor marker prognostic studies (REMARK). , 2005, Journal of the National Cancer Institute.

[64]  R. Tibshirani,et al.  Empirical bayes methods and false discovery rates for microarrays , 2002, Genetic epidemiology.

[65]  Daniel J Sargent,et al.  Clinical trial designs for predictive biomarker validation: theoretical considerations and practical challenges. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[66]  Boris Freidlin,et al.  Randomized clinical trials with biomarkers: design issues. , 2010, Journal of the National Cancer Institute.

[67]  Cheng Cheng,et al.  Improving false discovery rate estimation , 2004, Bioinform..

[68]  Use of randomisation in the Medical Research Council's clinical trial of streptomycin in pulmonary tuberculosis in the 1940s , 1998, BMJ.

[69]  D Moher,et al.  The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomized trials. , 2001, Annals of internal medicine.

[70]  V. Ambros,et al.  An Extensive Class of Small RNAs in Caenorhabditis elegans , 2001, Science.

[71]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[72]  L. Chin,et al.  Making sense of cancer genomic data. , 2011, Genes & development.

[73]  T. Tuschl,et al.  Identification of Novel Genes Coding for Small Expressed RNAs , 2001, Science.

[74]  Hugues Bersini,et al.  Batch effect removal methods for microarray gene expression data integration: a survey , 2013, Briefings Bioinform..

[75]  Rafael A Irizarry,et al.  Frozen robust multiarray analysis (fRMA). , 2010, Biostatistics.

[76]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[77]  Douglas G. Altman,et al.  Reporting Recommendations for Tumor Marker Prognostic Studies (REMARK): Explanation and Elaboration , 2012, PLoS medicine.

[78]  D. Bernoulli Exposition of a New Theory on the Measurement of Risk , 1954 .

[79]  K. Baggerly Disclose all data in publications. , 2010, Nature.

[80]  Sarah R. Edmonson,et al.  High-resolution serum proteomic patterns for ovarian cancer detection. , 2004, Endocrine-related cancer.

[81]  D. Berry,et al.  From drug discovery to biomarker-driven clinical trials in lymphoma , 2012, Nature Reviews Clinical Oncology.

[82]  J. Matthews,et al.  Quantification and the Quest for Medical Certainty , 1995 .

[83]  J Jack Lee,et al.  Bayesian adaptive randomization designs for targeted agent development , 2008, Clinical trials.