NanoStringBioNet: Integrated R framework for bioscience knowledge discovery from NanoString nCounter data

The NanoString nCounter Analysis System is a medium-throughput gene expression quantification technique that is becoming increasingly popular in the fields of immunology and oncology due to its ease of use and sensitivity, particularly in the analysis of formalin-fixed paraffin embedded samples. Despite the growing interest in NanoString, systematic analysis frameworks for the reproducible analysis of nCounter data remain limited. NanoStringBioNet is a pair of R packages that form a semi-automatic, open source framework for integrative and systematic knowledge discovery from nCounter datasets. Using the NSData module, NanoStringBioNet preprocesses a raw NanoString dataset and stores it in Biobase format for sharability. Subsequently, the NSFunc module performs downstream analyses such as enrichment and network inference of stable differentially expressed gene clusters by leveraging existing data analysis tools and custom script.

[1]  Nader Pourmand,et al.  NanoStriDE: normalization and differential expression analysis of NanoString nCounter data , 2011, BMC Bioinformatics.

[2]  Jennifer L. Osborn,et al.  Direct multiplexed measurement of gene expression with color-coded probe pairs , 2008, Nature Biotechnology.

[3]  Etienne Z. Gnimpieba,et al.  Therapeutic Targeting of PTK7 is Cytotoxic in Atypical Teratoid Rhabdoid Tumors , 2017, Molecular Cancer Research.

[4]  Wolfgang Huber,et al.  analysis of count data { the DESeq2 package , 2015 .

[5]  Mia Hubert,et al.  Clustering in an object-oriented environment , 1997 .

[6]  H. Tesch,et al.  The West German Study Group Breast Cancer Intrinsic Subtype study: a prospective multicenter decision impact study utilizing the Prosigna assay for adjuvant treatment decision-making in estrogen-receptor-positive, HER2-negative early-stage breast cancer , 2016, Current medical research and opinion.

[7]  Yan Li,et al.  DEApp: an interactive web interface for differential expression analysis of next generation sequence data , 2017, Source Code for Biology and Medicine.

[8]  A. Stromberg,et al.  NanoStringDiff: a novel statistical method for differential expression analysis based on NanoString nCounter data , 2016, Nucleic acids research.

[9]  Takahiro Kanagawa,et al.  Bias and artifacts in multitemplate polymerase chain reactions (PCR). , 2003, Journal of bioscience and bioengineering.

[10]  Roland Eils,et al.  Complex heatmaps reveal patterns and correlations in multidimensional genomic data , 2016, Bioinform..

[11]  C. Hennig,et al.  Dissolution point and isolation robustness: Robustness criteria for general cluster analysis methods , 2008 .

[12]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[13]  Michael I. Love,et al.  Differential analysis of count data – the DESeq2 package , 2013 .

[14]  L. Waldron,et al.  mRNA transcript quantification in archival samples using multiplexed, color-coded probes , 2011, BMC biotechnology.

[15]  Ricardo J. G. B. Campello,et al.  On the selection of appropriate distances for gene expression data clustering , 2014, BMC Bioinformatics.

[16]  Peter J. Rousseeuw,et al.  Clustering Large Applications (Program CLARA) , 2008 .

[17]  Carol Lushbough,et al.  Bio-TDS: bioscience query tool discovery system , 2016, Nucleic Acids Res..

[18]  Christian Hennig,et al.  Cluster-wise assessment of cluster stability , 2007, Comput. Stat. Data Anal..

[19]  Charlotte Soneson,et al.  compcodeR - an R package for benchmarking differential expression methods for RNA-seq data , 2014, Bioinform..

[20]  Malika Charrad,et al.  NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set , 2014 .

[21]  Damian Szklarczyk,et al.  STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data , 2015, Nucleic Acids Res..

[22]  Paul C. Boutros,et al.  NanoStringNorm: an extensible R package for the pre-processing of NanoString mRNA and miRNA data , 2012, Bioinform..

[23]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .