Investigation of the influence of protein corona composition on gold nanoparticle bioactivity using machine learning approaches

Abstract The understanding of the mechanisms and interactions that occur when nanomaterials enter biological systems is important to improve their future use. The adsorption of proteins from biological fluids in a physiological environment to form a corona on the surface of nanoparticles represents a key step that influences nanoparticle behaviour. In this study, the quantitative description of the composition of the protein corona was used to study the effect on cell association induced by 84 surface-modified gold nanoparticles of different sizes. Quantitative relationships between the protein corona and the activity of the gold nanoparticles were modelled by using several machine learning-based linear and non-linear approaches. Models based on a selection of only six serum proteins had robust and predictive results. The Projection Pursuit Regression method had the best performances (r2 = 0.91; Q2loo = 0.81; r2ext = 0.79). The present study confirmed the utility of protein corona composition to predict the bioactivity of gold nanoparticles and identified the main proteins that act as promoters or inhibitors of cell association. In addition, the comparison of several techniques showed which strategies offer the best results in prediction and could be used to support new toxicological studies on gold-based nanomaterials.

[1]  Chang Wook Ahn,et al.  On the practical genetic algorithms , 2005, GECCO '05.

[2]  P. Gramatica,et al.  QSAR classification models for the screening of the endocrine-disrupting activity of perfluorinated compounds , 2012, SAR and QSAR in environmental research.

[3]  Jian Wang,et al.  Nano(Q)SAR: Challenges, pitfalls and perspectives , 2015, Nanotoxicology.

[4]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[5]  Guangchao Chen,et al.  Summary and Analysis of the Currently Existing Literature Data on Metal-based Nanoparticles Published for Selected Aquatic Organisms: Applicability for Toxicity Prediction by (Q)SARs , 2015, Alternatives to laboratory animals : ATLA.

[6]  Andrew Emili,et al.  Protein corona fingerprinting predicts the cellular interaction of gold and silver nanoparticles. , 2014, ACS nano.

[7]  Alaaldin M. Alkilany,et al.  Protein corona: Opportunities and challenges. , 2016, The international journal of biochemistry & cell biology.

[8]  Alexander Tropsha,et al.  k Nearest Neighbors QSAR Modeling as a Variational Problem: Theory and Applications , 2005, J. Chem. Inf. Model..

[9]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[10]  Ana L. Teixeira,et al.  Random forests for feature selection in QSPR Models - an application for predicting standard enthalpy of formation of hydrocarbons , 2013, Journal of Cheminformatics.

[11]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[12]  Haifeng Chen,et al.  Comparative Study of QSAR/QSPR Correlations Using Support Vector Machines, Radial Basis Function Neural Networks, and Multiple Linear Regression , 2004, J. Chem. Inf. Model..

[13]  Morteza Mahmoudi,et al.  Protein-Nanoparticle Interactions , 2013 .

[14]  Alexander Golbraikh,et al.  Quantitative Structure−Activity Relationship Analysis of Functionalized Amino Acid Anticonvulsant Agents Using k Nearest Neighbor and Simulated Annealing PLS Methods , 2002 .

[15]  K. Roy,et al.  Further exploring rm2 metrics for validation of QSPR models , 2011 .

[16]  Igor V. Tetko,et al.  Combinatorial QSAR Modeling of Chemical Toxicants Tested against Tetrahymena pyriformis , 2008, J. Chem. Inf. Model..

[17]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[18]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[19]  Tomoko Niwa,et al.  Using General Regression and Probabilistic Neural Networks To Predict Human Intestinal Absorption with Topological Descriptors Derived from Two-Dimensional Chemical Structures , 2003, J. Chem. Inf. Comput. Sci..

[20]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[21]  Peixun Liu,et al.  Current Mathematical Methods Used in QSAR/QSPR Studies , 2009, International journal of molecular sciences.

[22]  J. Mark Introduction to radial basis function networks , 1996 .

[23]  Roberto Todeschini,et al.  The K correlation index: theory development and its application in chemometrics , 1999 .

[24]  J. Doucet,et al.  QSAR models for 2-amino-6-arylsulfonylbenzonitriles and congeners HIV-1 reverse transcriptase inhibitors based on linear and nonlinear regression methods. , 2009, European journal of medicinal chemistry.

[25]  J. Doucet,et al.  Structure–activity relationship study of trifluoromethylketone inhibitors of insect juvenile hormone esterase: Comparison of several classification methods , 2014, SAR and QSAR in environmental research.

[26]  Annick Panaye,et al.  Three Dimensional QSAR: Applications in Pharmacology and Toxicology , 2010 .

[27]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[28]  R. S. Zhang,et al.  Quantitative structure-toxicity relationships (QSTRs): A comparative study of various non linear methods. General regression neural network, radial basis function neural network and support vector machine in predicting toxicity of nitro- and cyano- aromatics to Tetrahymena pyriformis , 2006, SAR and QSAR in environmental research.

[29]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[30]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[31]  Liew Chin Yee,et al.  Current Modeling Methods Used in QSAR/QSPR , 2012 .

[32]  Desire L. Massart,et al.  Local modelling with radial basis function networks , 2000 .

[33]  Johann Gasteiger,et al.  Neural networks in chemistry and drug design , 1999 .

[34]  J. Friedman,et al.  Projection Pursuit Regression , 1981 .

[35]  Igor V. Tetko,et al.  Benchmarking of Linear and Nonlinear Approaches for Quantitative Structure-Property Relationship Studies of Metal Complexation with Ionophores , 2006, J. Chem. Inf. Model..

[36]  J. Devillers,et al.  Structure–activity relationship study of trifluoromethylketones: inhibitors of insect juvenile hormone esterase , 2013, SAR and QSAR in environmental research.

[37]  K. Nishida,et al.  Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models , 2013, SAR and QSAR in environmental research.

[38]  Xue Z. Wang,et al.  (Q)SAR Modelling of Nanomaterial Toxicity - A Critical Review , 2015 .

[39]  J. Ruuskanen,et al.  Performance of (consensus) kNN QSAR for predicting estrogenic activity in a large diverse set of organic compounds , 2004, SAR and QSAR in environmental research.

[40]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[41]  Ivan Rusyn,et al.  Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. , 2011, Chemical research in toxicology.

[42]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[43]  Andrew Worth,et al.  Applying quantitative structure-activity relationship approaches to nanotoxicology: current status and future potential. , 2013, Toxicology.

[44]  Paola Gramatica,et al.  Screening of persistent organic pollutants by QSPR classification models: a comparative study. , 2008, Journal of molecular graphics & modelling.

[45]  Paola Gramatica,et al.  Principles of QSAR models validation: internal and external , 2007 .

[46]  Alexander Tropsha,et al.  Chemical basis of interactions between engineered nanoparticles and biological systems. , 2014, Chemical reviews.

[47]  Alexander Tropsha,et al.  Novel Variable Selection Quantitative Structure-Property Relationship Approach Based on the k-Nearest-Neighbor Principle , 2000, J. Chem. Inf. Comput. Sci..

[48]  A. Doucet-Panaye,et al.  Linear and non-linear modelling of the cytotoxicity of TiO2 and ZnO nanoparticles by empirical descriptors† , 2015, SAR and QSAR in environmental research.

[49]  Sudhir A. Kulkarni,et al.  Three-Dimensional QSAR Using the k-Nearest Neighbor Method and Its Interpretation , 2006, J. Chem. Inf. Model..

[50]  Paola Gramatica,et al.  QSARINS: A new software for the development, analysis, and validation of QSAR MLR models , 2013, J. Comput. Chem..

[51]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[52]  Yunqian Ma,et al.  Practical selection of SVM parameters and noise estimation for SVM regression , 2004, Neural Networks.

[53]  W. Chan,et al.  Prediction of nanoparticles-cell association based on corona proteins and physicochemical properties. , 2015, Nanoscale.

[54]  L. Buydens,et al.  Comparing support vector machines to PLS for spectral regression applications , 2004 .

[55]  C. Oksel,et al.  Current situation on the availability of nanostructure–biological activity data , 2015, SAR and QSAR in environmental research.

[56]  David Brown,et al.  Investigating the relationship between nanomaterial hazard and physicochemical properties: Informing the exploitation of nanomaterials within therapeutic and diagnostic applications. , 2012, Journal of controlled release : official journal of the Controlled Release Society.

[57]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[58]  A. Tropsha,et al.  Quantitative nanostructure-activity relationship modeling. , 2010, ACS nano.

[59]  Alberto Anguita,et al.  A Machine Learning Approach to Identify Clinical Trials Involving Nanodrugs and Nanodevices from ClinicalTrials.gov , 2014, PloS one.

[60]  Jean-Pierre Doucet,et al.  Nonlinear SVM Approaches to QSPR/QSAR Studies and Drug Design , 2007 .

[61]  Alberto Fernandez,et al.  Predicting Cell Association of Surface-Modified Nanoparticles Using Protein Corona Structure - Activity Relationships (PCSAR). , 2015, Current topics in medicinal chemistry.