Principal component analysis-based unsupervised feature extraction applied to in silico drug discovery for posttraumatic stress disorder-mediated heart disease

BackgroundFeature extraction (FE) is difficult, particularly if there are more features than samples, as small sample numbers often result in biased outcomes or overfitting. Furthermore, multiple sample classes often complicate FE because evaluating performance, which is usual in supervised FE, is generally harder than the two-class problem. Developing sample classification independent unsupervised methods would solve many of these problems.ResultsTwo principal component analysis (PCA)-based FE, specifically, variational Bayes PCA (VBPCA) was extended to perform unsupervised FE, and together with conventional PCA (CPCA)-based unsupervised FE, were tested as sample classification independent unsupervised FE methods. VBPCA- and CPCA-based unsupervised FE both performed well when applied to simulated data, and a posttraumatic stress disorder (PTSD)-mediated heart disease data set that had multiple categorical class observations in mRNA/microRNA expression of stressed mouse heart. A critical set of PTSD miRNAs/mRNAs were identified that show aberrant expression between treatment and control samples, and significant, negative correlation with one another. Moreover, greater stability and biological feasibility than conventional supervised FE was also demonstrated. Based on the results obtained, in silico drug discovery was performed as translational validation of the methods.ConclusionsOur two proposed unsupervised FE methods (CPCA- and VBPCA-based) worked well on simulated data, and outperformed two conventional supervised FE methods on a real data set. Thus, these two methods have suggested equivalence for FE on categorical multiclass data sets, with potential translational utility for in silico drug discovery.

[1]  K. Haque,et al.  Cardiac Involvement of Hepatitis B and C Virus Infection , 2013 .

[2]  L. Zentilin,et al.  MiR-378 Controls Cardiac Hypertrophy by Combined Repression of Mitogen-Activated Protein Kinase Pathway Factors , 2013, Circulation.

[3]  Mitsuo Iwadate,et al.  Bioinformatic Screening of Autoimmune Disease Genes and Protein Structure Prediction with FAMS for Drug Discovery , 2013, Protein and peptide letters.

[4]  Xinyan Li,et al.  Neuregulin-1/ErbB signaling and chronic heart failure. , 2010, Advances in pharmacology.

[5]  Z. Massy,et al.  miR-143 and miR-145: molecular keys to switch the phenotype of vascular smooth muscle cells. , 2011, Circulation. Cardiovascular genetics.

[6]  Hideo Matsuda,et al.  Gendoo: Functional profiling of gene and disease features using MeSH vocabulary , 2009, Nucleic Acids Res..

[7]  Y-h. Taguchi,et al.  Genes associated with genotype-specific DNA methylation in squamous cell carcinoma as candidate drug targets , 2014, BMC Systems Biology.

[8]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[9]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[10]  M. Rojas,et al.  Myocyte-Restricted Focal Adhesion Kinase Deletion Attenuates Pressure Overload–Induced Hypertrophy , 2006, Circulation research.

[11]  Nectarios Koziris,et al.  TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support , 2011, Nucleic Acids Res..

[12]  Ming-Hui Chen,et al.  Management of cardiac toxicity in patients receiving vascular endothelial growth factor signaling pathway inhibitors. , 2012, American heart journal.

[13]  E. Olson,et al.  A signature pattern of stress-responsive microRNAs that can evoke cardiac hypertrophy and heart failure , 2006, Proceedings of the National Academy of Sciences.

[14]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[15]  M. Burg,et al.  Posttraumatic stress disorder and risk for coronary heart disease: a meta-analytic review. , 2013, American heart journal.

[16]  Y-h. Taguchi,et al.  Integrative Analysis of Gene Expression and Promoter Methylation during Reprogramming of a Non-Small-Cell Lung Cancer Cell Line Using Principal Component Analysis-Based Unsupervised Feature Extraction , 2014, ICIC.

[17]  Offer Amir,et al.  Serum levels of microRNAs in patients with heart failure , 2012, European journal of heart failure.

[18]  G. Dorn,et al.  Fatty Acid Synthase Modulates Homeostatic Responses to Myocardial Stress* , 2011, The Journal of Biological Chemistry.

[19]  Yoshiki Murakami,et al.  Universal disease biomarker: can a fixed set of blood microRNAs diagnose multiple diseases? , 2014, BMC Research Notes.

[20]  Charles M. Bishop Variational principal components , 1999 .

[21]  Anthony J. Muslin,et al.  MAPK signalling in cardiovascular health and disease: molecular mechanisms and therapeutic targets. , 2008, Clinical science.

[22]  L. Au,et al.  Systemic comparison of repression activity for miRNA and siRNA associated with different types of target sequences. , 2011, Biochemical and biophysical research communications.

[23]  Yoshiki Murakami,et al.  Comprehensive miRNA Expression Analysis in Peripheral Blood Can Diagnose Liver Disease , 2012, PloS one.

[24]  L. Qiang,et al.  Expression of miR-126 and miR-508-5p in endothelial progenitor cells is associated with the prognosis of chronic heart failure patients. , 2013, International journal of cardiology.

[25]  Carissa Ritner,et al.  miR-125b Promotes Early Germ Layer Specification through Lin28/let-7d and Preferential Differentiation of Mesoderm in Human Embryonic Stem Cells , 2012, PloS one.

[26]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[27]  D. Engman,et al.  Chagas heart disease pathogenesis: one mechanism or many? , 2008, Current molecular medicine.

[28]  O. Rascol,et al.  Risk of heart failure following treatment with dopamine agonists in Parkinson's disease patients , 2014, Expert opinion on drug safety.

[29]  Yoshiki Murakami,et al.  Principal Component Analysis Based Feature Extraction Approach to Identify Circulating microRNA Biomarkers , 2013, PloS one.

[30]  Hideaki Umeyama,et al.  FAMS and FAMSBASE for Protein Structure , 2003, Current protocols in bioinformatics.

[31]  Shengshou Hu,et al.  MicroRNA-24 regulates cardiac fibrosis after myocardial infarction , 2012, Journal of cellular and molecular medicine.

[32]  Akira Okamoto,et al.  Principal Component Analysis for Bacterial Proteomic Analysis , 2012, PRIB.

[33]  C. Saenz,et al.  Cardiovascular disease is the leading cause of death among endometrial cancer patients. , 2012, Gynecologic oncology.

[34]  Martin Reczko,et al.  DIANA miRPath v.2.0: investigating the combinatorial effect of microRNAs in pathways , 2012, Nucleic Acids Res..

[35]  F. Staedtler,et al.  Perturbation of microRNAs in Rat Heart during Chronic Doxorubicin Treatment , 2012, PloS one.

[36]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[37]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[38]  A. Morabia,et al.  Cardiovascular Disease Hospitalizations in Relation to Exposure to the September 11, 2001 World Trade Center Disaster and Posttraumatic Stress Disorder , 2013, Journal of the American Heart Association.

[39]  Y. Jimbo,et al.  Axon Guidance of Sympathetic Neurons to Cardiomyocytes by Glial Cell Line-Derived Neurotrophic Factor (GDNF) , 2013, PloS one.

[40]  Inyoul Y. Lee,et al.  Molecular evidence of stress-induced acute heart injury in a mouse model simulating posttraumatic stress disorder , 2014, Proceedings of the National Academy of Sciences.

[41]  Amit J. Shah,et al.  Post-traumatic stress disorder and incidence of coronary heart disease: a twin study. , 2013, Journal of the American College of Cardiology.

[42]  N. Frangogiannis,et al.  The role of TGF-beta signaling in myocardial infarction and cardiac remodeling. , 2007, Cardiovascular research.

[43]  张振辉 MicroRNA-26 was decreased in rat cardiac hypertrophy model and may be a promising therapeutic target. , 2013 .

[44]  R. Yehuda POST-TRAUMATIC STRESS DISORDER , 1985, The Lancet.

[45]  R. Duisters,et al.  MIRNA-133 AND MIRNA-30 REGULATE CONNECTIVE TISSUE GROWTH FACTOR: IMPLICATIONS FOR A ROLE OF MIRNAS IN MYOCARDIAL MATRIX REMODELING , 2013 .

[46]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[47]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[48]  Le Song,et al.  Feature Selection via Dependence Maximization , 2012, J. Mach. Learn. Res..

[49]  Genki Terashi,et al.  Bioinformatics based Ligand-Docking and in-silico screening. , 2008, Chemical & pharmaceutical bulletin.

[50]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[51]  B. Finck,et al.  The PPAR regulatory system in cardiac physiology and disease. , 2007, Cardiovascular research.

[52]  Mitsuo Iwadate,et al.  TINAGL1 and B3GALNT1 are potential therapy target genes to suppress metastasis in non-small cell lung cancer , 2014, BMC Genomics.

[53]  B. Zupan,et al.  miR669a and miR669q prevent skeletal muscle differentiation in postnatal cardiac progenitors , 2011, The Journal of cell biology.

[54]  G. Andriole,et al.  Prostate Cancer Risk in Men with Baseline History of Coronary Artery Disease: Results from the REDUCE Study , 2012, Cancer Epidemiology, Biomarkers & Prevention.

[55]  P. Kang,et al.  Cardiac-Specific Overexpression of Cyclin-Dependent Kinase 2 Increases Smaller Mononuclear Cardiomyocytes , 2001, Circulation research.

[56]  Perry D Moerland,et al.  MiR423-5p As a Circulating Biomarker for Heart Failure , 2010, Circulation research.

[57]  Mario Medvedovic,et al.  Loss of the miR-144/451 cluster impairs ischaemic preconditioning-mediated cardioprotection by targeting Rac-1. , 2012, Cardiovascular research.

[58]  M. Meghji,et al.  Toxoplasmic infection in cardiac disease. , 1979, The American journal of cardiology.

[59]  A. Filipek,et al.  Hsp90 and its co-chaperone, Sgt1, as autoantigens in dilated cardiomyopathy , 2011, Heart and Vessels.

[60]  William Stanley,et al.  Cardiac mitochondria in heart failure: decrease in respirasomes and oxidative phosphorylation , 2008, Cardiovascular research.

[61]  H. Zhang,et al.  Identification of differentially expressed genes in human heart with ventricular septal defect using suppression subtractive hybridization. , 2006, Biochemical and biophysical research communications.

[62]  Jeffrey E. Thatcher,et al.  Dysregulation of microRNAs after myocardial infarction reveals a role of miR-29 in cardiac fibrosis , 2008, Proceedings of the National Academy of Sciences.

[63]  Brian Walters,et al.  Cardiotoxicity of the cancer therapeutic agent imatinib mesylate , 2006, Nature Medicine.

[64]  E. Masliah,et al.  Prion-Induced Amyloid Heart Disease with High Blood Infectivity in Transgenic Mice , 2006, Science.

[65]  M. Sternberg,et al.  Protein structure prediction on the Web: a case study using the Phyre server , 2009, Nature Protocols.

[66]  J. Nerbonne,et al.  MicroRNA-133a Protects Against Myocardial Fibrosis and Modulates Electrical Repolarization Without Affecting Hypertrophy in Pressure-Overloaded Adult Hearts , 2010, Circulation research.

[67]  Y. Cheng,et al.  Let-7 in Cardiovascular Diseases, Heart Development and Cardiovascular Differentiation from Stem Cells , 2013, International journal of molecular sciences.

[68]  Yee Whye Teh,et al.  Variational Bayesian Approach to Movie Rating Prediction , 2007, KDD 2007.

[69]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[70]  Jeffrey S. Morris,et al.  Prevalence of colorectal neoplasm among patients with newly diagnosed coronary artery disease. , 2007, JAMA.

[71]  N. Frangogiannis,et al.  The role of TGF-β Signaling in Myocardial Infarction and Cardiac Remodeling , 2007 .

[72]  Tsutomu Matsushita,et al.  Gap junction alterations in human cardiac disease. , 2004, Cardiovascular research.