Exogenous Chemicals Impact Virus Receptor Gene Transcription: Insights from Deep Learning

Despite the fact that coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has been disrupting human life and health worldwide since the outbreak in late 2019, the impact of exogenous substance exposure on the viral infection remains unclear. It is well-known that, during viral infection, organism receptors play a significant role in mediating the entry of viruses to enter host cells. A major receptor of SARS-CoV-2 is the angiotensin-converting enzyme 2 (ACE2). This study proposes a deep learning model based on the graph convolutional network (GCN) that enables, for the first time, the prediction of exogenous substances that affect the transcriptional expression of the ACE2 gene. It outperforms other machine learning models, achieving an area under receiver operating characteristic curve (AUROC) of 0.712 and 0.703 on the validation and internal test set, respectively. In addition, quantitative polymerase chain reaction (qPCR) experiments provided additional supporting evidence for indoor air pollutants identified by the GCN model. More broadly, the proposed methodology can be applied to predict the effect of environmental chemicals on the gene transcription of other virus receptors as well. In contrast to typical deep learning models that are of black box nature, we further highlight the interpretability of the proposed GCN model and how it facilitates deeper understanding of gene change at the structural level.

[1]  Lee M. Tatham,et al.  FXR inhibition may protect from SARS-CoV-2 infection by reducing ACE2 , 2022, Nature.

[2]  A. Suhrbier,et al.  Evolution of ACE2-independent SARS-CoV-2 infection and mouse adaption after passage in cells expressing human and mouse ACE2 , 2022, Virus evolution.

[3]  M. Santiago,et al.  Interferon resistance of emerging SARS-CoV-2 variants , 2022, Proceedings of the National Academy of Sciences of the United States of America.

[4]  A. Szymczak,et al.  The association of airborne particulate matter and benzo[a]pyrene with the clinical course of COVID-19 in patients hospitalized in Poland , 2022, Environmental Pollution.

[5]  G. Orshansky,et al.  Effect of Androgen Suppression on Clinical Outcomes in Hospitalized Men With COVID-19 , 2022, JAMA network open.

[6]  Ryan K Flannigan,et al.  Androgens and COVID-19: exploring the role of testosterone replacement therapy , 2022, International Journal of Impotence Research.

[7]  G. Jiang,et al.  Exogenous Chemical Exposure Increased Transcription Levels of the Host Virus Receptor Involving Coronavirus Infection , 2022, Environmental science & technology.

[8]  V. Latora,et al.  A novel methodology for epidemic risk assessment of COVID-19 outbreak , 2021, Scientific Reports.

[9]  Mariia Matveieva,et al.  Benchmarks for interpretation of QSAR models , 2021, Journal of Cheminformatics.

[10]  Yu Chen,et al.  Coinfection with influenza A virus enhances SARS-CoV-2 infectivity , 2021, Cell Research.

[11]  Hongyu Zhao,et al.  Androgen Signaling Regulates SARS-CoV-2 Receptor Levels and Is Associated with Severe COVID-19 Symptoms in Men , 2020, Cell Stem Cell.

[12]  Rachel C. Nethery,et al.  Air pollution and COVID-19 mortality in the United States: Strengths and limitations of an ecological regression analysis , 2020, Science Advances.

[13]  Chang-Yu Hsieh,et al.  Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models , 2020, Journal of Cheminformatics.

[14]  Ankur Khajuria,et al.  The effect of smoking on COVID‐19 severity: A systematic review and meta‐analysis , 2020, Journal of medical virology.

[15]  A. Schäffer,et al.  In vitro and in vivo identification of clinically approved drugs that modify ACE2 expression , 2020, Molecular systems biology.

[16]  Y. Bi,et al.  Glucocorticoids improve severe or critical COVID-19 by activating ACE2 and reducing IL-6 levels , 2020, International journal of biological sciences.

[17]  Quanlong Jiang,et al.  Individual variation of the SARS‐CoV‐2 receptor ACE2 gene expression and regulation , 2020, Aging cell.

[18]  Zhen-Yu Huang,et al.  Air pollution and temperature are associated with increased COVID-19 incidence: A time series study , 2020, International Journal of Infectious Diseases.

[19]  Q. Hamid,et al.  Airways Expression of SARS-CoV-2 Receptor, ACE2, and TMPRSS2 Is Lower in Children Than Adults and Increases with Smoking and COPD , 2020, Molecular Therapy - Methods & Clinical Development.

[20]  Supinda Bunyavanich,et al.  Nasal Gene Expression of Angiotensin-Converting Enzyme 2 in Children and Adults. , 2020, JAMA.

[21]  Y. Bossé,et al.  Tobacco Smoking Increases the Lung Gene Expression of ACE2, the Receptor of SARS-CoV-2 , 2020, American journal of respiratory and critical care medicine.

[22]  C. Lindskog,et al.  The protein expression profile of ACE2 in human tissues , 2020, bioRxiv.

[23]  Emily F. Stone,et al.  The delayed effect of wildfire season particulate matter on subsequent influenza season in a mountain west region of the USA. , 2020, Environment international.

[24]  Linqi Zhang,et al.  Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor , 2020, Nature.

[25]  Carlos Dobkin,et al.  Effect of Influenza Vaccination for the Elderly on Hospitalization and Mortality , 2020, Annals of Internal Medicine.

[26]  B. Graham,et al.  Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation , 2020, Science.

[27]  Antony J. Williams,et al.  EPA’s DSSTox database: History of development of a curated chemistry resource supporting computational toxicology research , 2019, Computational toxicology.

[28]  Alexandru Korotcov,et al.  Graph Convolutional Neural Networks as "General-Purpose" Property Predictors: The Universality and Limits of Applicability , 2019, J. Chem. Inf. Model..

[29]  Da Chen,et al.  Human Indoor Exposome of Chemicals in Dust and Risk Prioritization Using EPA's ToxCast Database. , 2019, Environmental science & technology.

[30]  David Kartchner,et al.  Short‐Term Elevation of Fine Particulate Matter Air Pollution and Acute Lower Respiratory Infection , 2018, American journal of respiratory and critical care medicine.

[31]  Joshua A. Bittker,et al.  The Carcinogenome Project: In Vitro Gene Expression Profiling of Chemical Perturbations to Predict Long-Term Carcinogenicity , 2018, bioRxiv.

[32]  Angela N. Brooks,et al.  A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles , 2017, Cell.

[33]  Vijay S. Pande,et al.  MoleculeNet: a benchmark for molecular machine learning , 2017, Chemical science.

[34]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[35]  Antony J. Williams,et al.  ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology. , 2016, Chemical research in toxicology.

[36]  R. Aebersold,et al.  On the Dependency of Cellular Protein Levels on mRNA Abundance , 2016, Cell.

[37]  Vijay S. Pande,et al.  Molecular graph convolutions: moving beyond fingerprints , 2016, Journal of Computer-Aided Molecular Design.

[38]  Andrew D. Rouillard,et al.  LINCS Canvas Browser: interactive web app to query, browse and interrogate LINCS L1000 gene expression signatures , 2014, Nucleic Acids Res..

[39]  Sereina Riniker,et al.  Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods , 2013, Journal of Cheminformatics.

[40]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[41]  Mark Chappell,et al.  A crucial role of angiotensin converting enzyme 2 (ACE2) in SARS coronavirus–induced lung injury , 2005, Nature Medicine.

[42]  C. Lipinski Lead- and drug-like compounds: the rule-of-five revolution. , 2004, Drug discovery today. Technologies.

[43]  T. Greenough,et al.  What’s new in the renin-angiotensin system? , 2004, Cellular and Molecular Life Sciences CMLS.

[44]  Roger Detels,et al.  Environmental Health: a Global Access Science Source Air Pollution and Case Fatality of Sars in the People's Republic of China: an Ecologic Study , 2022 .

[45]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[46]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[47]  Thomas D. Schmittgen,et al.  Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2 2 DD C T Method , 2022 .