Enriching Nanomaterials Omics Data: An Integration Technique to Generate Biological Descriptors

The interest toward omics data is growing in the field of toxicology owing to the diverse knowledge they generate, which can improve prediction and dosage profiling for more accurate safety assessment. An integration methodology is presented where high-throughput omics data are enriched with biological-pathway information to produce a novel set of biological (BIO) descriptors by decomposing omics data to meaningful clusters in terms of both their mechanistic interpretation and correlation affinity. A generalized simulated annealing algorithm is employed to estimate the optimal partition of the enriched data and accordingly produce novel descriptors based on gene content similarity. BIO descriptors are characterized by the pathway information fused to the data; thereby, they refer to groups of genes with similar biological implications rather than specific genes, which could vary across studies. The methodology is applied to an extensive proteomics data set and demonstrates that BIO descriptors are beneficial for modeling prediction, outperforming the prediction accuracy of the original omics data, and offering a readily available biological interpretation of the findings.

[1]  Krister Wennerberg,et al.  A transcriptomics data-driven gene space accurately predicts liver cytopathology and drug-induced liver injury , 2017, Nature Communications.

[2]  Yang Xiang,et al.  Generalized Simulated Annealing for Global Optimization: The GenSA Package , 2013, R J..

[3]  Morteza Mahmoudi,et al.  Protein-Nanoparticle Interactions , 2013 .

[4]  Reinhard Kreiling,et al.  A decision-making framework for the grouping and testing of nanomaterials (DF4nanoGrouping). , 2015, Regulatory toxicology and pharmacology : RTP.

[5]  Timothy E H Allen,et al.  Defining molecular initiating events in the adverse outcome pathway framework for risk assessment. , 2014, Chemical research in toxicology.

[6]  P. Dixon VEGAN, a package of R functions for community ecology , 2003 .

[7]  W. Chan,et al.  Prediction of nanoparticles-cell association based on corona proteins and physicochemical properties. , 2015, Nanoscale.

[8]  Ola Spjuth,et al.  Toward the Replacement of Animal Experiments through the Bioinformatics-driven Analysis of ‘Omics’ Data from Human Cell Cultures , 2015, Alternatives to laboratory animals : ATLA.

[9]  Hans C van Houwelingen,et al.  Integration of gene ontology pathways with North American Rheumatoid Arthritis Consortium genome-wide association data via linear modeling , 2009, BMC proceedings.

[10]  Shu-Dong Zhang,et al.  Application of connectivity mapping in predictive toxicology based on gene-expression similarity. , 2010, Toxicology.

[11]  N. Araki,et al.  An Integrated Approach of Differential Mass Spectrometry and Gene Ontology Analysis Identified Novel Proteins Regulating Neuronal Differentiation and Survival* , 2009, Molecular & Cellular Proteomics.

[12]  Ola Spjuth,et al.  Cancer biology, toxicology and alternative methods development go hand-in-hand. , 2014, Basic & clinical pharmacology & toxicology.

[13]  H. Bickeböller,et al.  Inclusion of a priori information in genome‐wide association analysis , 2009, Genetic epidemiology.

[14]  Blanca Suarez-Merino,et al.  High throughput toxicity screening and intracellular detection of nanomaterials , 2016, Wiley interdisciplinary reviews. Nanomedicine and nanobiotechnology.

[15]  Stefan Tenzer,et al.  Protein corona of nanoparticles: distinct proteins regulate the cellular uptake. , 2015, Biomacromolecules.

[16]  Egon L. Willighagen,et al.  RRegrs: an R package for computer-aided model selection with multiple regression models , 2015, Journal of Cheminformatics.

[17]  Sharon Munn,et al.  The Adverse Outcome Pathway approach in nanotoxicology , 2017 .

[18]  Kevin C. Dorff,et al.  The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models , 2010, Nature Biotechnology.

[19]  Pornpimol Charoentong,et al.  ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks , 2009, Bioinform..

[20]  G. Nienhaus,et al.  Toward a molecular understanding of nanoparticle–protein interactions , 2012, Biophysical Reviews.

[21]  N. Demaurex,et al.  The role of calcium signaling in phagocytosis , 2010, Journal of leukocyte biology.

[22]  Andrew Emili,et al.  Protein corona fingerprinting predicts the cellular interaction of gold and silver nanoparticles. , 2014, ACS nano.

[23]  Candace C. Fleischer,et al.  Nanoparticle–Cell Interactions: Molecular Structure of the Protein Corona and Cellular Outcomes , 2014, Accounts of chemical research.

[24]  M. Daha,et al.  Mini‐review: A pivotal role for innate immunity in the clearance of apoptotic cells , 2004, European journal of immunology.

[25]  Susan C Connor,et al.  Integration of metabolomics and transcriptomics data to aid biomarker discovery in type 2 diabetes. , 2010, Molecular bioSystems.

[26]  M. Gevrey,et al.  Review and comparison of methods to study the contribution of variables in artificial neural network models , 2003 .

[27]  J. Mesirov,et al.  The Molecular Signatures Database Hallmark Gene Set Collection , 2015 .

[28]  Thomas Hartung,et al.  Green toxicology. , 2014, ALTEX.

[29]  Melvin E. Andersen,et al.  Incorporating New Technologies Into Toxicity Testing and Risk Assessment: Moving From 21st Century Vision to a Data-Driven Framework , 2013, Toxicological sciences : an official journal of the Society of Toxicology.

[30]  Janet M. Allen,et al.  C‐reactive protein‐mediated phagocytosis and phospholipase D signalling through the high‐affinity receptor for immunoglobulin G (FcγRI) , 2002 .

[31]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[32]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[33]  Hua-Zhong Ying,et al.  Transcriptome profiling and pathway analysis of hepatotoxicity induced by tris (2-ethylhexyl) trimellitate (TOTM) in mice. , 2016, Environmental toxicology and pharmacology.

[34]  Aedín C. Culhane,et al.  iBBiG: iterative binary bi-clustering of gene sets , 2012, Bioinform..

[35]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[36]  Justin Guinney,et al.  GSVA: gene set variation analysis for microarray and RNA-Seq data , 2013, BMC Bioinformatics.

[37]  Raymond R Tice,et al.  Intersection of toxicogenomics and high throughput screening in the Tox21 program: an NIEHS perspective. , 2016, International journal of biotechnology.

[38]  Iseult Lynch,et al.  Protein-nanoparticle interactions: What does the cell see? , 2009, Nature nanotechnology.

[39]  Andrew Williams,et al.  Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials , 2015, Beilstein journal of nanotechnology.

[40]  Isidro Cortes-Ciriano,et al.  Prediction of the potency of mammalian cyclooxygenase inhibitors with ensemble proteochemometric modeling , 2015, Journal of Cheminformatics.

[41]  Samuel Kaski,et al.  Probabilistic retrieval and visualization of biologically relevant microarray experiments , 2009, BMC Bioinformatics.

[42]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .