From Classical to Modern Computational Approaches to Identify Key Genetic Regulatory Components in Plant Biology

The selection of plant genotypes with improved productivity and tolerance to environmental constraints has always been a major concern in plant breeding. Classical approaches based on the generation of variability and selection of better phenotypes from large variant collections have improved their efficacy and processivity due to the implementation of molecular biology techniques, particularly genomics, Next Generation Sequencing and other omics such as proteomics and metabolomics. In this regard, the identification of interesting variants before they develop the phenotype trait of interest with molecular markers has advanced the breeding process of new varieties. Moreover, the correlation of phenotype or biochemical traits with gene expression or protein abundance has boosted the identification of potential new regulators of the traits of interest, using a relatively low number of variants. These important breakthrough technologies, built on top of classical approaches, will be improved in the future by including the spatial variable, allowing the identification of gene(s) involved in key processes at the tissue and cell levels.

[1]  K. Vandepoele,et al.  Charting plant gene functions in the multi-omics and single-cell era. , 2022, Trends in plant science.

[2]  A. Quintero-Jiménez,et al.  Biotechnological Advances to Improve Abiotic Stress Tolerance in Crops , 2022, International journal of molecular sciences.

[3]  Y. Vigouroux,et al.  Evaluation of nine statistics to identify QTLs in bulk segregant analysis using next generation sequencing approaches , 2022, BMC Genomics.

[4]  M. Meijón,et al.  Chloroplast proteomics reveals transgenerational cross-stress priming in Pinus radiata , 2022, Environmental and Experimental Botany.

[5]  Fuguang Li,et al.  Integration of eQTL Analysis and GWAS Highlights Regulation Networks in Cotton under Stress Condition , 2022, International journal of molecular sciences.

[6]  C. Deane,et al.  Extracting Information from Gene Coexpression Networks of Rhizobium leguminosarum , 2022, J. Comput. Biol..

[7]  W. Luo,et al.  Proteomics and Co-expression Network Analysis Reveal the Importance of Hub Proteins and Metabolic Pathways in Nicotine Synthesis and Accumulation in Tobacco (Nicotiana tabacum L.) , 2022, Frontiers in Plant Science.

[8]  Zhonglin Shang,et al.  RNA-Seq Analysis Identifies Transcription Factors Involved in Anthocyanin Biosynthesis of ‘Red Zaosu’ Pear Peel and Functional Study of PpPIF8 , 2022, International journal of molecular sciences.

[9]  M. Mutwil,et al.  Exploiting plant transcriptomic databases: Resources, tools, and approaches , 2022, Plant communications.

[10]  Yang Wu,et al.  Time-course transcriptome and WGCNA analysis revealed the drought response mechanism of two sunflower inbred lines , 2022, PloS one.

[11]  Zhaojun Ding,et al.  Time-series transcriptome comparison reveals the gene regulation network under salt stress in soybean (Glycine max) roots , 2022, BMC plant biology.

[12]  Y. Gibon,et al.  High-throughput plant phenotyping: a role for metabolomics? , 2022, Trends in plant science.

[13]  Hong Zhang,et al.  Plant Public RNA‐seq Database: a comprehensive online database for expression analysis of ~45 000 plant public RNA‐Seq libraries , 2022, Plant biotechnology journal.

[14]  Jiahong Zhu,et al.  Comparative transcriptome and weighted correlation network analyses reveal candidate genes involved in chlorogenic acid biosynthesis in sweet potato , 2022, Scientific Reports.

[15]  C. Bailly,et al.  In-Depth Proteomic Analysis of the Secondary Dormancy Induction by Hypoxia or High Temperature in Barley Grains. , 2022, Plant & cell physiology.

[16]  B. Dubreucq,et al.  Custom methods to identify conserved genetic modules applied to novel transcriptomic data from Amborella trichopoda. , 2022, Journal of experimental botany.

[17]  P. Verma,et al.  Broadening the horizon of crop research: a decade of advancements in plant molecular genetics to divulge phenotype governing genes , 2022, Planta.

[18]  Benjamin T. Shealy,et al.  Addressing noise in co-expression network construction , 2021, Briefings Bioinform..

[19]  Natalie M. Clark,et al.  To the proteome and beyond: advances in single-cell omics profiling for plant systems , 2021, Plant physiology.

[20]  Baoshan Chen,et al.  Gene-coexpression network analysis identifies specific modules and hub genes related to cold stress in rice , 2021, BMC Genomics.

[21]  S. Roy,et al.  Expression profile, transcriptional and post-transcriptional regulation of genes involved in hydrogen sulphide metabolism connecting the balance between development and stress adaptation in plants: a data-mining bioinformatics approach. , 2021, Plant biology.

[22]  T. Guo,et al.  From Classical Radiation to Modern Radiation: Past, Present, and Future of Radiation Mutation Breeding , 2021, Frontiers in Public Health.

[23]  R. VanBuren,et al.  Representation and participation across 20 years of plant genome sequencing , 2021, Nature Plants.

[24]  Robert J. Schmitz,et al.  Cis-regulatory sequences in plants: Their importance, discovery, and future challenges , 2021, The Plant cell.

[25]  L. Fan,et al.  Twenty years of plant genome sequencing: achievements and challenges. , 2021, Trends in plant science.

[26]  Florian Auer,et al.  RCX—an R package adapting the Cytoscape Exchange format for biological networks , 2021, bioRxiv.

[27]  S. Giacomello,et al.  A new era for plant science: spatial single-cell transcriptomics. , 2021, Current opinion in plant biology.

[28]  O. Novák,et al.  In situ characterisation of phytohormones from wounded Arabidopsis leaves using desorption electrospray ionisation mass spectrometry imaging. , 2021, The Analyst.

[29]  P. Darriet,et al.  Biosynthesis and Cellular Functions of Tartaric Acid in Grapevines , 2021, Frontiers in Plant Science.

[30]  Nadezhda T. Doncheva,et al.  The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets , 2020, Nucleic Acids Res..

[31]  Kayla A Johnson,et al.  Robust normalization and transformation techniques for constructing gene coexpression networks from RNA-seq data , 2020, Genome Biology.

[32]  George M. Spyrou,et al.  ProtExA: A tool for post-processing proteomics data providing differential expression metrics, co-expression networks and functional analytics , 2020, Computational and structural biotechnology journal.

[33]  Samuel Chaffron,et al.  MiBiOmics: an interactive web application for multi-omics data exploration and integration , 2020, BMC Bioinformatics.

[34]  Biswapriya B Misra,et al.  Data normalization strategies in metabolomics: Current challenges, approaches, and tools , 2020, European journal of mass spectrometry.

[35]  A. Fait,et al.  Can metabolic tightening and expansion of co-expression network play a role in stress response and tolerance? , 2020, Plant science : an international journal of experimental plant biology.

[36]  Andreas Rempel,et al.  Comparison of Read Mapping and Variant Calling Tools for the Analysis of Plant NGS Data , 2020, bioRxiv.

[37]  Maojun Wang,et al.  Combined GWAS and eQTL analysis uncovers a genetic regulatory network orchestrating the initiation of secondary cell wall development in cotton. , 2020, The New phytologist.

[38]  T. Michael,et al.  Building near-complete plant genomes. , 2020, Current opinion in plant biology.

[39]  Claire O'Donovan,et al.  MetaboLights: a resource evolving in response to the needs of its scientific community , 2019, Nucleic Acids Res..

[40]  Shraddha Pai,et al.  RCy3: Network biology using Cytoscape from within R , 2019, bioRxiv.

[41]  A. Sebé-Pedrós,et al.  Origin and evolution of eukaryotic transcription factors. , 2019, Current opinion in genetics & development.

[42]  Xiaolan Rao,et al.  Co-expression networks for plant biology: why and how. , 2019, Acta biochimica et biophysica Sinica.

[43]  M. Schatz,et al.  GenomeScope 2.0 and Smudgeplots: Reference-free profiling of polyploid genomes , 2019, bioRxiv.

[44]  Kim-Anh Lê Cao,et al.  DIABLO: an integrative approach for identifying key molecular drivers from multi-omics assays , 2019, Bioinform..

[45]  M. Vinaixa,et al.  FELLA: an R package to enrich metabolomics data , 2018, BMC Bioinformatics.

[46]  Robert E. W. Hancock,et al.  MetaBridge: enabling network-based integrative analysis via direct protein interactors of metabolites , 2018, Bioinform..

[47]  A. Aharoni,et al.  DLEMMA-MS-Imaging for Identification of Spatially Localized Metabolites and Metabolic Network Map Reconstruction. , 2018, Analytical chemistry.

[48]  Jianguo Xia,et al.  OmicsNet: a web-based tool for creation and visual analysis of biological networks in 3D space , 2018, Nucleic Acids Res..

[49]  Ludovic Cottret,et al.  MetExplore: collaborative edition and exploration of metabolic networks , 2018, Nucleic Acids Res..

[50]  G. Glevarec,et al.  Ranking genome-wide correlation measurements improves microarray and RNA-seq based global and targeted co-expression networks , 2018, bioRxiv.

[51]  Karan Uppal,et al.  xMWAS: a data-driven integration and differential network analysis tool , 2018, Bioinform..

[52]  T. Van Du Tran,et al.  Condition-specific series of metabolic sub-networks and its application for gene set enrichment analysis , 2017, bioRxiv.

[53]  Björn Usadel,et al.  Plant genome and transcriptome annotations: from misconceptions to simple solutions , 2017, Briefings Bioinform..

[54]  E. Wit,et al.  Detecting epistatic selection with partially observed genotype data by using copula graphical models , 2017, 1710.00894.

[55]  J. C. Herrera,et al.  Multi-Omics and Integrated Network Analyses Reveal New Insights into the Systems Relationships between Metabolites, Structural Genes, and Transcriptional Regulators in Developing Grape Berries (Vitis vinifera L.) Exposed to Water Deficit , 2017, Front. Plant Sci..

[56]  Kim-Anh Lê Cao,et al.  mixOmics: An R package for ‘omics feature selection and multiple data integration , 2017, bioRxiv.

[57]  D. Wong,et al.  Constructing Integrated Networks for Identifying New Secondary Metabolic Pathway Regulators in Grapevine: Recent Applications and Future Opportunities , 2017, Front. Plant Sci..

[58]  Javad Zahiri,et al.  Gene co-expression network reconstruction: a review on computational methods for inferring functional information from plant-based expression data , 2017, Plant Biotechnology Reports.

[59]  Korbinian Schneeberger,et al.  The impact of third generation genomic technologies on plant genome assembly. , 2017, Current opinion in plant biology.

[60]  Yu Zhang,et al.  QUBIC: a bioconductor package for qualitative biclustering analysis of gene co‐expression data , 2016, Bioinform..

[61]  Chad L. Myers,et al.  Unraveling gene function in agricultural species using gene co-expression networks. , 2017, Biochimica et biophysica acta. Gene regulatory mechanisms.

[62]  A. Kushalappa,et al.  Functional molecular markers for crop improvement , 2016, Critical reviews in biotechnology.

[63]  Stéphanie Bougeard,et al.  MINT: a multivariate integrative method to identify reproducible molecular signatures across independent experiments and platforms , 2016, BMC Bioinformatics.

[64]  R. Hancock,et al.  Integrated proteomics and metabolomics to unlock global and clonal responses of Eucalyptus globulus recovery from water deficit , 2016, Metabolomics.

[65]  I. Mayrose,et al.  Whole-genome duplication as a key factor in crop domestication , 2016, Nature Plants.

[66]  A. Aharoni,et al.  Solanum pennellii backcross inbred lines (BILs) link small genomic bins with tomato traits. , 2016, The Plant journal : for cell and molecular biology.

[67]  Uwe Scholz,et al.  PGP repository: a plant phenomics and genomics data publication infrastructure , 2016, Database J. Biol. Databases Curation.

[68]  F. Thibaud-Nissen,et al.  Araport11: a complete reannotation of the Arabidopsis thaliana reference genome , 2016, bioRxiv.

[69]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[70]  D. Zamir,et al.  Mendelizing all Components of a Pyramid of Three Yield QTL in Tomato , 2015, Front. Plant Sci..

[71]  S. M. Shah,et al.  Ethyl methane sulfonate induced mutations in M2 generation and physiological variations in M1 generation of peppers (Capsicum annuum L.) , 2015, Front. Plant Sci..

[72]  J. Yates,et al.  Isobaric Labeling-Based Relative Quantification in Shotgun Proteomics , 2014, Journal of proteome research.

[73]  Atsushi Fukushima,et al.  A network perspective on nitrogen metabolism from model to crop plants using integrated 'omics' approaches. , 2014, Journal of experimental botany.

[74]  Gary D Bader,et al.  Biological Network Exploration with Cytoscape 3 , 2014, Current protocols in bioinformatics.

[75]  T. Michael Plant genome size variation: bloating and purging DNA. , 2014, Briefings in functional genomics.

[76]  Alexander Goesmann,et al.  The genome of the recently domesticated crop plant sugar beet (Beta vulgaris) , 2013, Nature.

[77]  Andrei L. Turinsky,et al.  Navigating the global protein-protein interaction landscape using iRefWeb. , 2014, Methods in molecular biology.

[78]  Gary D. Bader,et al.  GeneMANIA Prediction Server 2013 Update , 2013, Nucleic Acids Res..

[79]  R. Terauchi,et al.  QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. , 2013, The Plant Journal.

[80]  Sanzhen Liu,et al.  Mendelian and Non-Mendelian Regulation of Gene Expression in Maize , 2013, PLoS genetics.

[81]  Ignacio González,et al.  Visualising associations between paired ‘omics’ data sets , 2012, BioData Mining.

[82]  George W Bassel,et al.  Systems Analysis of Plant Functional, Transcriptional, Physical Interaction, and Metabolic Networks , 2012, Plant Cell.

[83]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[84]  Rasko Leinonen,et al.  The sequence read archive: explosive growth of sequencing data , 2011, Nucleic Acids Res..

[85]  David M. A. Martin,et al.  Genome sequence and analysis of the tuber crop potato , 2011, Nature.

[86]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[87]  B. Usadel,et al.  PlaNet: Combined Sequence and Expression Comparisons across Plant Networks Derived from Seven Species[W][OA] , 2011, Plant Cell.

[88]  David W. Koppenaal,et al.  Diurnal Rhythms Result in Significant Changes in the Cellular Protein Complement in the Cyanobacterium Cyanothece 51142 , 2011, PloS one.

[89]  A. Tanaka,et al.  Studies on biological effects of ion beams on lethality, molecular nature of mutation, mutation rate, and spectrum of mutation phenotype for mutation breeding in higher plants. , 2010, Journal of radiation research.

[90]  Bailin Li,et al.  Expression QTLs: applications for crop improvement , 2010, Molecular Breeding.

[91]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[92]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[93]  D. Kliebenstein Quantitative genomics: analyzing intraspecific variation using global gene expression polymorphisms or eQTLs. , 2009, Annual review of plant biology.

[94]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[95]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[96]  S. Hake,et al.  The art and design of genetic screens: maize , 2008, Nature Reviews Genetics.

[97]  Oliver Fiehn,et al.  Quality control for plant metabolomics: reporting MSI-compliant studies. , 2008, The Plant journal : for cell and molecular biology.

[98]  D. Kliebenstein,et al.  A Systems Biology Approach Identifies a R2R3 MYB Gene Subfamily with Distinct and Overlapping Functions in Regulation of Aliphatic Glucosinolates , 2007, PloS one.

[99]  D. Roff A CENTENNIAL CELEBRATION FOR QUANTITATIVE GENETICS , 2007, Evolution; international journal of organic evolution.

[100]  R. Doerge,et al.  Global eQTL Mapping Reveals the Complex Genetic Architecture of Transcript-Level Variation in Arabidopsis , 2007, Genetics.

[101]  Andy M. Yip,et al.  Gene network interconnectedness and the generalized topological overlap measure , 2007, BMC Bioinformatics.

[102]  Daniel J. Kliebenstein,et al.  Identification of QTLs controlling gene expression networks defined a priori , 2006, BMC Bioinformatics.

[103]  H. Senn,et al.  Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. , 2006, Analytical chemistry.

[104]  Lincoln Stein,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Res..

[105]  M. Hirai,et al.  Elucidation of Gene-to-Gene and Metabolite-to-Gene Networks in Arabidopsis by Integration of Metabolomics and Transcriptomics* , 2005, Journal of Biological Chemistry.

[106]  D. Zamir,et al.  Lycopersicon esculentum lines containing small overlapping introgressions from L. pennellii , 1992, Theoretical and Applied Genetics.

[107]  T. Mackay Complementing complexity , 2004, Nature Genetics.

[108]  M. Purugganan,et al.  Candidate Genes, Quantitative Trait Loci, and Functional Trait Evolution in Plants , 2003, International Journal of Plant Sciences.

[109]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[110]  M. Yano,et al.  Identification of heading date quantitative trait locus Hd6 and characterization of its epistatic interactions with Hd2 in rice using advanced backcross progeny. , 2000, Genetics.

[111]  Z B Zeng,et al.  Estimating the genetic architecture of quantitative traits. , 1999, Genetical research.

[112]  S. Lin,et al.  Fine mapping of quantitative trait loci Hd-1, Hd-2 and Hd-3, controlling heading date of rice, as single Mendelian factors , 1998, Theoretical and Applied Genetics.

[113]  M. Yano,et al.  Identification of quantitative trait loci controlling heading date in rice using a high-density linkage map , 1997, Theoretical and Applied Genetics.

[114]  D. Zamir,et al.  An introgression line population of Lycopersicon pennellii in the cultivated tomato enables the identification and fine mapping of yield-associated QTL. , 1995, Genetics.

[115]  G. Martin,et al.  Chromosome landing: a paradigm for map-based gene cloning in plants with large genomes. , 1995, Trends in genetics : TIG.