Co-expression network analysis and genetic algorithms for gene prioritization in preeclampsia

BackgroundIn this study, we explored the gene prioritization in preeclampsia, combining co-expression network analysis and genetic algorithms optimization approaches. We analysed five public projects obtaining 1,146 significant genes after cross-platform and processing of 81 and 149 microarrays in preeclamptic and normal conditions, respectively.MethodsAfter co-expression network construction, modular and node analysis were performed using several approaches. Moreover, genetic algorithms were also applied in combination with the nearest neighbour and discriminant analysis classification methods.ResultsSignificant differences were found in the genes connectivity distribution, both in normal and preeclampsia conditions pointing to the need and importance of examining connectivity alongside expression for prioritization. We discuss the global as well as intra-modular connectivity for hubs detection and also the utility of genetic algorithms in combination with the network information. FLT1, LEP, INHA and ENG genes were identified according to the literature, however, we also found other genes as FLNB, INHBA, NDRG1 and LYN highly significant but underexplored during normal pregnancy or preeclampsia.ConclusionsWeighted genes co-expression network analysis reveals a similar distribution along the modules detected both in normal and preeclampsia conditions. However, major differences were obtained by analysing the nodes connectivity. All models obtained by genetic algorithm procedures were consistent with a correct classification, higher than 90%, restricting to 30 variables in both classification methods applied.Combining the two methods we identified well known genes related to preeclampsia, but also lead us to propose new candidates poorly explored or completely unknown in the pathogenesis of preeclampsia, which may have to be validated experimentally.

[1]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[2]  H. Kurahashi,et al.  Overproduction of the follistatin‐related gene protein in the placenta and maternal serum of women with pre‐eclampsia , 2007, BJOG : an international journal of obstetrics and gynaecology.

[3]  M. Langaas,et al.  Matrix metalloproteinase 1 in pre-eclampsia and fetal growth restriction: reduced gene expression in decidual tissue and protein expression in extravillous trophoblasts. , 2010, Placenta.

[4]  A. Franx,et al.  Evaluation of 7 Serum Biomarkers and Uterine Artery Doppler Ultrasound for First-Trimester Prediction of Preeclampsia: A Systematic Review , 2011, Obstetrical & gynecological survey.

[5]  H. Kurahashi,et al.  Microarray analysis of differentially expressed fetal genes in placental tissue derived from early and late onset severe pre-eclampsia. , 2007, Placenta.

[6]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[7]  S. Bernasconi,et al.  Changes of dimeric inhibin B levels in maternal serum throughout healthy gestation and in women with gestational diseases. , 1997, The Journal of clinical endocrinology and metabolism.

[8]  Paul A. Bates,et al.  Global topological features of cancer proteins in the human interactome , 2006, Bioinform..

[9]  Daniel R. Salomon,et al.  Strategies for aggregating gene expression data: The collapseRows R function , 2011, BMC Bioinformatics.

[10]  R. Romero,et al.  Preeclampsia is associated with alterations in DNA methylation of genes involved in collagen metabolism. , 2012, The American journal of pathology.

[11]  R. Yeh,et al.  Severe preeclampsia-related changes in gene expression at the maternal-fetal interface include sialic acid-binding immunoglobulin-like lectin-6 and pappalysin-2. , 2009, Endocrinology.

[12]  Y. Conley,et al.  Gene Expression in First Trimester Preeclampsia Placenta , 2011, Biological research for nursing.

[13]  Eduardo Tejera,et al.  Preeclampsia: a bioinformatics approach through protein-protein interaction networks analysis , 2012, BMC Systems Biology.

[14]  G. Gitas,et al.  Biomarkers in pre-eclampsia: A novel approach to early detection of the disease , 2012, Journal of obstetrics and gynaecology : the journal of the Institute of Obstetrics and Gynaecology.

[15]  G. Acharya,et al.  Differential placental gene expression in severe preeclampsia. , 2009, Placenta.

[16]  Ming Liu,et al.  High levels of activin A detected in preeclamptic placenta induce trophoblast cell apoptosis by promoting nodal signaling. , 2012, The Journal of clinical endocrinology and metabolism.

[17]  S. Wang,et al.  High leptin level and leptin receptor Lys656Asn variant are risk factors for preeclampsia. , 2013, Genetics and molecular research : GMR.

[18]  W. Robinson,et al.  Hypomethylation of the LEP gene in placenta and elevated maternal leptin concentration in early onset pre-eclampsia , 2013, Molecular and Cellular Endocrinology.

[19]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[20]  S. Tsai,et al.  Transcriptional profiling of human placentas from pregnancies complicated by preeclampsia reveals disregulation of sialic acid acetylesterase and immune signalling pathways. , 2011, Placenta.

[21]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[22]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[23]  B. Thilaganathan,et al.  Improved early prediction of pre‐eclampsia by combining second‐trimester maternal serum inhibin‐A and uterine artery Doppler , 2001, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[24]  R. Khalil,et al.  Risk factors and mediators of the vascular dysfunction associated with hypertension in pregnancy. , 2010, Cardiovascular & hematological disorders drug targets.

[25]  T. Jeffcoate Pre-Eclampsia and Eclampsia: The Disease of Theories , 1966, Proceedings of the Royal Society of Medicine.

[26]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[27]  W. Tong,et al.  Cross-Platform Comparison of Microarray-Based Multiple-Class Prediction , 2011, PloS one.

[28]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[29]  K. Nicolaides,et al.  Maternal Serum Activin A at 11–13 Weeks of Gestation in Hypertensive Disorders of Pregnancy , 2009, Fetal Diagnosis and Therapy.

[30]  Warren A Kibbe,et al.  nuID: a universal naming scheme of oligonucleotides for Illumina, Affymetrix, and other microarrays , 2007, Biology Direct.

[31]  R. Levine,et al.  Preeclampsia, a Disease of the Maternal Endothelium: The Role of Antiangiogenic Factors and Implications for Later Cardiovascular Disease , 2011, Circulation.

[32]  Samuel Parry,et al.  Longitudinal evaluation of predictive value for preeclampsia of circulating angiogenic factors through pregnancy. , 2012, American journal of obstetrics and gynecology.

[33]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[34]  T. Okai,et al.  Placenta-Derived, Cellular Messenger RNA Expression in the Maternal Blood of Preeclamptic Women , 2007, Obstetrics and gynecology.

[35]  S. Azam,et al.  Prediction of pre-eclampsia during early pregnancy in primiparas with soluble fms-like tyrosine kinase-1 and placental growth factor. , 2012, The National medical journal of India.

[36]  E. Moses,et al.  Increased endoplasmic reticulum stress in decidual tissue from pregnancies complicated by fetal growth restriction with and without pre-eclampsia. , 2011, Placenta.

[37]  Ibrahim Emam,et al.  ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments , 2010, Nucleic Acids Res..

[38]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[39]  Bin Zhang,et al.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R , 2008, Bioinform..

[40]  H. Kurahashi,et al.  Comparative gene expression profiling of placentas from patients with severe pre-eclampsia and unexplained fetal growth restriction , 2011, Reproductive biology and endocrinology : RB&E.

[41]  D. Kell BMC Medical Genomics , 2008 .

[42]  K. Salvesen,et al.  Decidual expression and maternal serum levels of heme oxygenase 1 are increased in pre‐eclampsia , 2008, Acta obstetricia et gynecologica Scandinavica.

[43]  M. Dong,et al.  Alterations of maternal serum and placental follistatin‐like 3 and myostatin in pre‐eclampsia , 2012, The journal of obstetrics and gynaecology research.

[44]  Dennis B. Troup,et al.  NCBI GEO: archive for functional genomics data sets—10 years on , 2010, Nucleic Acids Res..

[45]  C. Weinberg,et al.  Gene Selection and Sample Classification Using a Genetic Algorithm and k-Nearest Neighbor Method , 2003 .

[46]  David M. Rocke,et al.  Comparison of low and high dose ionising radiation using topological analysis of gene coexpression networks , 2012, BMC Genomics.

[47]  Weixiong Zhang,et al.  Analysis of Alzheimer's disease severity across brain regions by topological analysis of gene co-expression networks , 2010, BMC Systems Biology.

[48]  Werner Dubitzky,et al.  A Practical Approach to Microarray Data Analysis , 2003, Springer US.

[49]  Faramarz Valafar,et al.  Empirical comparison of cross-platform normalization methods for gene expression data , 2011, BMC Bioinformatics.

[50]  Y. Sadovsky,et al.  Increased expression of N-myc downstream-regulated gene 1 (NDRG1) in placentas from pregnancies complicated by intrauterine growth restriction or preeclampsia. , 2007, American journal of obstetrics and gynecology.

[51]  Heather J. Ruskin,et al.  Cross-Platform Microarray Data Normalisation for Regulatory Network Inference , 2010, PloS one.

[52]  Shinichiro Wachi,et al.  Interactome-transcriptome analysis reveals the high centrality of genes differentially expressed in lung cancer tissues , 2005, Bioinform..

[53]  Sean R. Davis,et al.  GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor , 2007, Bioinform..

[54]  Pan Du,et al.  lumi: a pipeline for processing Illumina microarray , 2008, Bioinform..

[55]  C. Tersigni,et al.  Alterations of maternal plasma HTRA1 level in preeclampsia complicated by IUGR. , 2012, Placenta.

[56]  L. Mandelbrot,et al.  Elevated soluble endothelial cell protein C receptor (sEPCR) levels in women with preeclampsia: a marker of endothelial activation/damage? , 2012, Thrombosis research.

[57]  James Lyons-Weiler,et al.  Altered global gene expression in first trimester placentas of women destined to develop preeclampsia. , 2009, Placenta.