Bioinformatics Tools for Plant Genomics

The articles in this special issue reflect a convergence of developments in the fields of bioinformatics and plant genomics. Bioinformatics has its roots vaguely seated in the early 1980s, a time when personal computers began appearing in research laboratories and researchers began recognizing that those computers could be used as tools to organize, analyze and visualize their data. In the ensuing years bioinformatics tools began appearing at various sites including the European Molecular Biology Laboratory, the Molecular Biology Research Resource at the Dana-Farber Cancer Institute in the mid 1980s, the National Center for Biotechnology Information (NCBI) in 1988, the Genome Database Project at Johns Hopkins University in early 1989, and in countless laboratories throughout the world. These last efforts resulted in the development of many of the tools described in this special issue. Progress and interest in plant genomics have been accelerating since the time in late 2000 when the genome of Arabidopsis thaliana was published. Since then many genome sequencing projects have been undertaken that include poplar (Populus), grape (Vitis), the moss Physcomitrella, the biflagellate algae Chlamydomonas and several globally crucial crop plants such as corn (Maize) and rice (Oryza). However, as we have witnessed on numerous occasions, determining the sequence of a genome is only the first step toward understanding genome organization, gene structure, gene expression patterns, disease pathogenesis and a host of other features of both scientific and commercial interests. Computational tools of genomic annotation and comparative genomics must be applied to gain a useful understanding of any genome. In this special issue we present a collection of papers that together describe a powerful and impactful toolbox of applications and resources for plant genomic analysis. Among those articles you will find a description of research performed by the Mexican headquartered Generation Challenge Programme (GCP) which led to the GCP Platform (Bruskiewich et al.). This research support tool supports a number of data formats and web services and provides access to high performance computing facilities and platform-specific middleware collectively designed to support crop science research. Probably one of the most promising empirical tools for investigating gene expression developed in the last 15 or so years is that of microarray technology. While the technology has become commonplace, with tools for generating and hybridizing arrays available to all, the analysis of microarray-derived data has been challenging. Many laboratories have struggled not only with this challenge but also with the task of sorting through the plethora of analytical tools available in an effort to find the ones that may be best suited to their own work. In this issue there are two reviews by Page and Coulibaly which examine and describe bioinformatics tools for inferring functional information from plant microarray data. Together these papers step the reader through a collection of tools, and their applications, for analyzing the expression of single and multiple gene expression profiles. This theme of microarray analysis is continued in the description of the cross chip probe matching tool (CCPMT) by Page et al. Indeed it expands the readers horizons beyond the analysis of individual microarrays with the ability to associate probes across species. And of course, microarray analysis is facilitated by careful experimental design from the start so Robert Tempelman provides a review of statistical methods used to design efficient two-color microarray experiments. Taken together, these microarray papers provide an overview of the design of microarray experiments and the interpretation of the complex results of those experiments that will be informative for new and experienced laboratorians alike. Several other novel tools are described herein. One, Blast2GO is a suite of tools for the analysis and functional annotation of plant genomes (Conesa and Goetz). It provides an intuitive interface for identifying functional regions within DNA sequences. Another sequence analysis tool described by da Maia et al. is the SSR locator. That tool enables researchers to identify suitable targets for binding PCR primers in order to ensure that those targets are unique within the genome. It also assists with primer design and has a PCR simulator which facilitates comparisons of hypothetical amplification products among different species. Another challenge facing scientists today is the need to stay abreast of advances in a field that is progressing rapidly as a consequence of newly available technologies. In order to address this challenge there are two review articles that together provide insights into the discovery of relationships among a varied array of plant species. The first article, by Abdurakhmonov and Abdukarimov, describes the application of association mapping to understanding traits in crop species. Their work is directed toward novices within the crop breeding community in order to expose them to potential problems that they may face and solutions they may employ to overcome those problems. The second article describes the tools available for phylogenetic analyses and the increased use of Bayesian methods in those tools (Aris-Brosou and Xia). Constructing phylogenies has traditionally been a challenge to even the most experienced researcher but modern bioinformatics tools are lowering the bar for those interested in detecting adaptive evolution and estimating divergence among species. The wealth of information available to researchers today can be overwhelming. In order to address this potential, two papers describe information resources which consolidate and organize related information. PPNEMA is a database resource for those interested in plant-parasitic nematode ribosomal genes (Rubino et al.). That resource allows the user to browse, search and generally explore phytoparasite ribosomal DNA. A second database described in these pages is the MaizeGDB (Lawrence et al.). This resource contains information about Zea mays which includes genomic sequences as well as functional information and the tools to explore both. The body of the papers in this special issue represents the leading edge of plant genomics research. Together they provide the reader with descriptions of the tools and resources necessary to understand and promote advances in this important field. Gary R. Skuse Chunguang Du

[1]  R. N. Kackar,et al.  Approximations for Standard Errors of Estimators of Fixed and Random Effects in Mixed Linear Models , 1984 .

[2]  E R Martin,et al.  Letter to the Editor Correcting for a Potential Bias in the Pedigree Disequilibrium Test , 2022 .

[3]  B. R. Wiseman,et al.  Quantitative trait loci and metabolic pathways: genetic control of the concentration of maysin, a corn earworm resistance factor, in maize silks. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. T. Jackson,et al.  Predicting quantitative variation within rice germplasm using molecular markers , 1996, Heredity.

[5]  D. Curtis,et al.  Use of siblings as controls in case‐control association studies , 1997, Annals of human genetics.

[6]  Elizabeth A Kellogg,et al.  The evolution of nuclear genome structure in seed plants. , 2004, American journal of botany.

[7]  Garth R. Brown,et al.  Nucleotide diversity and linkage disequilibrium in loblolly pine. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M. Purugganan,et al.  Molecular population genetics of the Arabidopsis CLAVATA2 region. The genomic scale of variation and selection in a selfing species. , 2003, Genetics.

[9]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[10]  N L Kaplan,et al.  Removing the sampling restrictions from family-based tests of association for a quantitative-trait locus. , 2000, American journal of human genetics.

[11]  Chris Sander,et al.  Characterizing gene sets with FuncAssociate , 2003, Bioinform..

[12]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[14]  O. Hardy,et al.  Estimation of pairwise relatedness between individuals and characterization of isolation‐by‐distance processes using dominant genetic markers , 2003, Molecular ecology.

[15]  M. Carrington,et al.  A scan for linkage disequilibrium across the human genome. , 1999, Genetics.

[16]  N M Laird,et al.  A discordant-sibship test for disequilibrium and linkage: no need for parental data. , 1998, American journal of human genetics.

[17]  M. Robles,et al.  University of Birmingham High throughput functional annotation and data mining with the Blast2GO suite , 2022 .

[18]  M. Ganal,et al.  Analysis of molecular diversity, population structure and linkage disequilibrium in a worldwide survey of cultivated barley germplasm (Hordeum vulgare L.) , 2006, BMC Genetics.

[19]  C R Weinberg,et al.  Allowing for missing parents in genetic studies of case-parent triads. , 1999, American journal of human genetics.

[20]  Minoru Kanehisa,et al.  The KEGG database. , 2002, Novartis Foundation symposium.

[21]  Virginia Walbot,et al.  Translational Genomics for Bioenergy Production from Fuelstock Grasses: Maize as the Model Species , 2007, The Plant Cell Online.

[22]  Zheng Xie,et al.  AMADA: analysis of microarray data , 2001, Bioinform..

[23]  Kevin R. Thornton,et al.  Nucleotide Variation Along the Drosophila melanogaster Fourth Chromosome , 2002, Science.

[24]  J. Reif,et al.  Comparison of Linkage Disequilibrium in Elite European Maize Inbred Lines using AFLP and SSR Markers , 2006, Molecular Breeding.

[25]  Richard M. Clark,et al.  The PHYTOCHROME C photoreceptor gene mediates natural variation in flowering and growth responses of Arabidopsis thaliana , 2006, Nature Genetics.

[26]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[27]  Carsten O. Daub,et al.  The mutual information: Detecting and evaluating dependencies between variables , 2002, ECCB.

[28]  S. Costanzo,et al.  Linkage disequilibrium mapping of a Verticillium dahliae resistance quantitative trait locus in tetraploid potato (Solanum tuberosum) through a candidate gene approach , 2004, Theoretical and Applied Genetics.

[29]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[30]  R. Bernardo,et al.  In silico mapping of quantitative trait loci in maize , 2004, Theoretical and Applied Genetics.

[31]  D. Ware,et al.  Maintaining collections of mutants for plant functional genomics. , 2003, Methods in molecular biology.

[32]  Mattias Jakobsson,et al.  The Pattern of Polymorphism in Arabidopsis thaliana , 2005, PLoS biology.

[33]  P. Zimmermann,et al.  GENEVESTIGATOR. Arabidopsis Microarray Database and Analysis Toolbox1[w] , 2004, Plant Physiology.

[34]  F. G. Giesbrecht,et al.  Two-stage analysis based on a mixed model: large-sample asymptotic theory and small-sample simulation results , 1985 .

[35]  K. Kidd,et al.  Transmission/disequilibrium tests using multiple tightly linked markers. , 2000, American journal of human genetics.

[36]  Geoffrey J. Barton,et al.  GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes , 2004, BMC Bioinformatics.

[37]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[38]  Andreas Graner,et al.  Genic microsatellite markers in plants: features and applications. , 2005, Trends in biotechnology.

[39]  Thomas Thiel,et al.  In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species. , 2002, Cellular & molecular biology letters.

[40]  John M. Hancock,et al.  PlantProm: a database of plant promoter sequences , 2003, Nucleic Acids Res..

[41]  S. Wessler,et al.  Isolation of the transposable maize controlling elements Ac and Ds , 1983, Cell.

[42]  M. Yano,et al.  Genetic and molecular dissection of quantitative traits in rice , 1997, Plant Molecular Biology.

[43]  F. V. van Eeuwijk,et al.  A Mixed-Model Approach to Association Mapping Using Pedigree Information With an Illustration of Resistance to Phytophthora infestans in Potato , 2007, Genetics.

[44]  Weida Tong,et al.  Development of public toxicogenomics software for microarray data management and analysis. , 2004, Mutation research.

[45]  M. Morgante,et al.  Contrasting Effects of Selection on Sequence Diversity and Linkage Disequilibrium at Two Phytoene Synthase Loci Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.012526. , 2003, The Plant Cell Online.

[46]  John Quackenbush,et al.  The TIGR Gene Indices: reconstruction and representation of expressed gene sequences , 2000, Nucleic Acids Res..

[47]  Joaquín Dopazo,et al.  BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments , 2006, Nucleic Acids Res..

[48]  Nikolay A. Kolchanov,et al.  GeneNet in 2005 , 2004, Nucleic Acids Res..

[49]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[50]  G. Wenzel,et al.  Development and application of functional markers in maize , 2005, Euphytica.

[51]  J. Reif,et al.  Linkage disequilibrium in European elite maize germplasm investigated with SSRs , 2005, Theoretical and Applied Genetics.

[52]  E Brunner,et al.  Design and analysis of two-color microarray experiments using linear models. , 2005, Methods of information in medicine.

[53]  Heike Hofmann,et al.  MetNet: Software to Build and Model the Biogenetic Lattice of Arabidopsis , 2003, Comparative and functional genomics.

[54]  W. Wong,et al.  GoSurfer: a graphical interactive tool for comparative analysis of large gene sets in Gene Ontology space. , 2004, Applied bioinformatics.

[55]  R. Wu,et al.  Modeling Extent and Distribution of Zygotic Disequilibrium: Implications for a Multigenerational Canine Pedigree , 2006, Genetics.

[56]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[57]  K. Tokunaga,et al.  Comparison of statistical power between 2 * 2 allele frequency and allele positivity tables in case-control studies of complex disease genes. , 2001, Annals of human genetics.

[58]  Mehmet Bilgen,et al.  A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences , 2004, Bioinform..

[59]  Ramon C. Littell,et al.  Analysis of unbalanced mixed model data: A case study comparison of ANOVA versus REML/GLS , 2002 .

[60]  Hans-Peter Piepho,et al.  A Hitchhiker's guide to mixed models for randomized experiments , 2003 .

[61]  Lisa C. Harper,et al.  MaizeGDB's new data types, resources and activities , 2007, Nucleic Acids Res..

[62]  Yoshihiro Ugawa,et al.  Plant cis-acting regulatory DNA elements (PLACE) database: 1999 , 1999, Nucleic Acids Res..

[63]  Gonçalo R. Abecasis,et al.  GOLD-Graphical Overview of Linkage Disequilibrium , 2000, Bioinform..

[64]  Carolyn J. Lawrence-Dill,et al.  Comparative Plant Genomics Resources at PlantGDB1 , 2005, Plant Physiology.

[65]  E. Winzeler,et al.  Protein pathway and complex clustering of correlated mRNA and protein expression analyses in Saccharomyces cerevisiae , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[66]  L. Excoffier,et al.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. , 1995, Molecular biology and evolution.

[67]  Hidde de Jong,et al.  Genetic Network Analyzer: qualitative simulation of genetic regulatory networks , 2003, Bioinform..

[68]  Purvesh Khatri,et al.  Onto-Tools: an ensemble of web-accessible, ontology-based tools for the functional design and interpretation of high-throughput gene expression experiments , 2004, Nucleic Acids Res..

[69]  Kexuan Tang,et al.  Preference of simple sequence repeats in coding and non-coding regions of Arabidopsis thaliana , 2004, Bioinform..

[70]  A. McRae,et al.  Linkage disequilibrium in domestic sheep. , 2002, Genetics.

[71]  A I Saeed,et al.  TM4: a free, open-source system for microarray data management and analysis. , 2003, BioTechniques.

[72]  Graziano Pesole,et al.  CLEANUP: a fast computer program for removing redundancies from nucleotide sequence databases , 1996, Comput. Appl. Biosci..

[73]  N Risch,et al.  The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases I. DNA pooling. , 1998, Genome research.

[74]  Edward H. Coe,et al.  The Genetics of Corn , 1988 .

[75]  K. K. Dobbin,et al.  Characterizing dye bias in microarray experiments , 2005, Bioinform..

[76]  H. Piepho,et al.  Potential causes of linkage disequilibrium in a European maize breeding program investigated with computer simulations , 2007, Theoretical and Applied Genetics.

[77]  Richard Simon,et al.  A random variance model for detection of differential gene expression in small microarray experiments , 2003, Bioinform..

[78]  B. R. Wiseman,et al.  Maysin Content and Growth of Corn Earworm Larvae (Lepidoptera: Noctuidae) on Silks from First and Second Ears of Corn , 1993 .

[79]  Andrew B. Nobel,et al.  Significance analysis of functional categories in gene expression studies: a structured permutation approach , 2005, Bioinform..

[80]  S. R. Wilson,et al.  On extending the transmission/disequilibrium test (TDT) , 1997, Annals of human genetics.

[81]  Joaquín Dopazo,et al.  GEPAS, an experiment-oriented pipeline for the analysis of microarray gene expression data , 2005, Nucleic Acids Res..

[82]  R. Dixon,et al.  Plant metabolomics: large-scale phytochemistry in the functional genomics era. , 2003, Phytochemistry.

[83]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[84]  C. Falk,et al.  Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations , 1987, Annals of human genetics.

[85]  H. Ellegren Microsatellites: simple sequences with complex evolution , 2004, Nature Reviews Genetics.

[86]  W J Ewens,et al.  The TDT and other family-based tests for linkage disequilibrium and association. , 1996, American journal of human genetics.

[87]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[88]  Nan Wang,et al.  AgBase: a functional genomics resource for agriculture , 2006, BMC Genomics.

[89]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.

[90]  Zhenjun Hu,et al.  VisANT: an online visualization and analysis tool for biological interaction data , 2004, BMC Bioinformatics.

[91]  Chen-Tuo Liao,et al.  Statistical Designs for Two‐Color Spotted Microarray Experiments , 2007, Biometrical journal. Biometrische Zeitschrift.

[92]  R. Martienssen,et al.  ramosa2 Encodes a LATERAL ORGAN BOUNDARY Domain Protein That Determines the Fate of Stem Cells in Branch Meristems of Maize[W] , 2006, The Plant Cell Online.

[93]  P. Wincker,et al.  Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance , 2007, BMC Genomics.

[94]  R. Sinden,et al.  DNA Polymerase III Proofreading Mutants Enhance the Expansion and Deletion of Triplet Repeat Sequences in Escherichia coli * , 2000, The Journal of Biological Chemistry.

[95]  Xavier Estivill,et al.  Disorders: Filling the Gaps and Exploring Complexity in Genome-Wide Association Studies , 2022 .

[96]  G. Eizenga,et al.  Molecular diversity and genome-wide linkage disequilibrium patterns in a worldwide collection of Oryza sativa and its wild relatives , 2008, Euphytica.

[97]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[98]  L. Sandkuijl,et al.  Perspectives of identity by descent (IBD) mapping in founder populations , 1995, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[99]  Eli Stahl,et al.  Signature of balancing selection in Arabidopsis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[100]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[101]  E. Buckler,et al.  Structure of linkage disequilibrium in plants. , 2003, Annual review of plant biology.

[102]  David B. Allison,et al.  The PowerAtlas: a power and sample size atlas for microarray experimental design and research , 2006, BMC Bioinformatics.

[103]  G W Bird,et al.  Plant and soil nematodes: societal impact and focus for the future. , 1994, Journal of nematology.

[104]  L. Lukens,et al.  The origin of the naked grains of maize , 2005, Nature.

[105]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[106]  Gordon K. Smyth,et al.  Use of within-array replicate spots for assessing differential expression in microarray experiments , 2005, Bioinform..

[107]  P. Cregan,et al.  Single-nucleotide polymorphisms in soybean. , 2003, Genetics.

[108]  Jelle J. Goeman,et al.  Testing association of a pathway with survival using gene expression data , 2005, Bioinform..

[109]  Dan Nettleton,et al.  A Discussion of Statistical Methods for Design and Analysis of Microarray Experiments for Plant Scientists , 2006, The Plant Cell Online.

[110]  T. Rocheford,et al.  Dissection of Maize Kernel Composition and Starch Production by Candidate Gene Association , 2004, The Plant Cell Online.

[111]  E. Coe,et al.  The properties, origin, and mechanism of conversion-type inheritance at the B locus in maize. , 1966, Genetics.

[112]  Curtis E. Dyreson,et al.  Genome analysis Athena : a resource for rapid visualization and systematic analysis of Arabidopsis promoter sequences , 2005 .

[113]  F. Eeuwijk,et al.  Linkage Disequilibrium Mapping of Morphological, Resistance, and Other Agronomically Relevant Traits in Modern Spring Barley Cultivars , 2005, Molecular Breeding.

[114]  P. Cornelius,et al.  Approximate F-tests of multiple degree of freedom hypotheses in generalized least squares analyses of unbalanced split-plot experiments , 1996 .

[115]  P. Oefner,et al.  The extent of linkage disequilibrium in Arabidopsis thaliana , 2002, Nature Genetics.

[116]  D. Curtis,et al.  An extended transmission/disequilibrium test (TDT) for multi‐allele marker loci , 1995, Annals of human genetics.

[117]  F. Eeuwijk,et al.  Association mapping of quality traits in potato (Solanum tuberosum L.) , 2008, Euphytica.

[118]  John F. Monahan,et al.  Monte Carlo Comparison of ANOVA, MIVQUE, REML, and ML Estimators of Variance Components , 1984 .

[119]  Mark L. Blaxter,et al.  NEMBASE: a resource for parasitic nematode ESTs , 2004, Nucleic Acids Res..

[120]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[121]  Detlef Weigel,et al.  Large-scale identification of single-feature polymorphisms in complex genomes. , 2003, Genome research.

[122]  D. A. Palmieri,et al.  Frequency and distribution of microsatellites from ESTs of citrus , 2007 .

[123]  E. Nevo,et al.  Microsatellites within genes: structure, function, and evolution. , 2004, Molecular biology and evolution.

[124]  M. Waterman,et al.  A dynamic programming algorithm for haplotype block partitioning , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[125]  I. Romagosa,et al.  RFLP markers associated with major genes controlling heading date evaluated in a barley germ plasm pool , 1999, Heredity.

[126]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[127]  J. Gallagher,et al.  Association of Candidate Genes With Flowering Time and Water-Soluble Carbohydrate Content in Lolium perenne (L.) , 2007, Genetics.

[128]  G. Rubin,et al.  The Role of the Genome Project in Determining Gene Function: Insights from Model Organisms , 1996, Cell.

[129]  B. Mcclintock The origin and behavior of mutable loci in maize , 1950, Proceedings of the National Academy of Sciences.

[130]  M. Morgante,et al.  Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes , 2002, Nature Genetics.

[131]  Kiana Toufighi,et al.  The Botany Array Resource: E-northerns, Expression Angling, and Promoter Analyses , 2022 .

[132]  Dennis B. Troup,et al.  NCBI GEO: mining millions of expression profiles—database and tools , 2004, Nucleic Acids Res..

[133]  Hiroaki Kitano,et al.  CellDesigner: a process diagram editor for gene-regulatory and biochemical networks , 2003 .

[134]  Toby Hodgkin,et al.  In situ conservation of crop wild relatives: status and trends , 2004, Biodiversity & Conservation.

[135]  Richard Simon,et al.  Questions and answers on design of dual-label microarrays for identifying differentially expressed genes. , 2003, Journal of the National Cancer Institute.

[136]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[137]  Hans-Peter Piepho,et al.  Analysis of unbalanced data by mixed linear models using the MIXED procedure of the SAS System , 2005 .

[138]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[139]  J. Pounds,et al.  Data merging for integrated microarray and proteomic analysis. , 2006, Briefings in functional genomics & proteomics.

[140]  E. Buckler,et al.  Using natural allelic diversity to evaluate gene function. , 2003, Methods in molecular biology.

[141]  Pierre R. Bushel,et al.  Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models , 2001, J. Comput. Biol..

[142]  Mariana Benítez,et al.  Gene regulatory network models for plant development. , 2007, Current opinion in plant biology.

[143]  M. Sorrells,et al.  Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat , 2002, Plant Molecular Biology.

[144]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[145]  O. L. May,et al.  Genetic Similarity Indices for Ancestral Cotton Cultivars and their Impact on Genetic Diversity Estimates of Modern Cultivars , 1999 .

[146]  D. Levinson,et al.  Simulation studies of detection of a complex disease in a partially isolated population. , 2001, American journal of medical genetics.

[147]  Gavin Sherlock,et al.  The Longhorn Array Database (LAD): An Open-Source, MIAME compliant implementation of the Stanford Microarray Database (SMD) , 2003, BMC Bioinformatics.

[148]  Dorrie Main,et al.  Frequency, type, distribution and annotation of simple sequence repeats in Rosaceae ESTs , 2005, Functional & Integrative Genomics.

[149]  B. Palsson,et al.  The model organism as a system: integrating 'omics' data sets , 2006, Nature Reviews Molecular Cell Biology.

[150]  Xiangqin Cui,et al.  How Many Mice and How Many Arrays? Replication in Mouse cDNA Microarray Experiments , 2004 .

[151]  Steven G. Gilmour,et al.  Design of Microarray Experiments for Genetical Genomics Studies , 2006, Genetics.

[152]  Simon Tavaré,et al.  beadarray: R classes and methods for Illumina bead-based data , 2007, Bioinform..

[153]  A. Long,et al.  The Lowdown on Linkage Disequilibrium , 2003, The Plant Cell Online.

[154]  Keyan Zhao,et al.  An Arabidopsis Example of Association Mapping in Structured Samples , 2006, PLoS genetics.

[155]  B S Weir,et al.  Power studies for the transmission/disequilibrium tests with multiple alleles. , 1997, American journal of human genetics.

[156]  Torulf Mollestad,et al.  Additional Gene Ontology structure for improved biological reasoning , 2006, Bioinform..

[157]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[158]  Roderick D. Ball,et al.  Experimental Designs for Reliable Detection of Linkage Disequilibrium in Unstructured Random Population Association Studies , 2005, Genetics.

[159]  G F V Glonek,et al.  Factorial and time course designs for cDNA microarray experiments. , 2004, Biostatistics.

[160]  Kathleen Marchal,et al.  Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes1 , 2003, Plant Physiology.

[161]  C. Moritz,et al.  Reticulate evolution and the origins of ribosomal internal transcribed spacer diversity in apomictic Meloidogyne. , 1999, Molecular biology and evolution.

[162]  J. Terwilliger A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. , 1995, American journal of human genetics.

[163]  S. Subbotin,et al.  Application of the secondary structure model of rRNA for phylogeny: D2-D3 expansion segments of the LSU gene of plant-parasitic nematodes from the family Hoplolaimidae Filipjev, 1934. , 2007, Molecular phylogenetics and evolution.

[164]  Sudhir Gupta,et al.  Balanced Factorial Designs for cDNA Microarray Experiments , 2006 .

[165]  Xiaofeng Zhu,et al.  Association mapping, using a mixture model for complex traits , 2002, Genetic epidemiology.

[166]  S. Colowick,et al.  Methods in Enzymology , Vol , 1966 .

[167]  B. Gill,et al.  Development and mapping of EST-derived simple sequence repeat markers for hexaploid wheat. , 2004, Genome.

[168]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[169]  Hongyu Zhao,et al.  Test of Association for Quantitative Traits in General Pedigrees: The Quantitative Pedigree Disequilibrium Test , 2001, Genetic epidemiology.

[170]  Sergio Contrino,et al.  ArrayExpress—a public repository for microarray gene expression data at the EBI , 2004, Nucleic Acids Res..

[171]  R. Varshney,et al.  Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.) , 2003, Theoretical and Applied Genetics.

[172]  G. Zhong,et al.  Analysis of microsatellites in citrus unigenes. , 2006, Yi chuan xue bao = Acta genetica Sinica.

[173]  Raya Khanin,et al.  Near‐optimal designs for dual channel microarray studies , 2005 .

[174]  U. Mansmann,et al.  Testing Differential Gene Expression in Functional Groups , 2005, Methods of Information in Medicine.

[175]  E A Thompson,et al.  Linkage disequilibrium mapping: the role of population history, size, and structure. , 2001, Advances in genetics.

[176]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[177]  José Gadea,et al.  Microarray technology in agricultural research , 2007 .

[178]  K. Lange,et al.  A Conditional Inference Framework for Extending the Transmission/Disequilibrium Test , 1998, Human Heredity.

[179]  Hans Lehrach,et al.  GOblet: a platform for Gene Ontology annotation of anonymous sequence data , 2004, Nucleic Acids Res..

[180]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[181]  Juliet M Chapman,et al.  Detecting Disease Associations due to Linkage Disequilibrium Using Haplotype Tags: A Class of Tests and the Determinants of Statistical Power , 2003, Human Heredity.

[182]  Kevin F. Smith,et al.  SNP discovery, validation, haplotype structure and linkage disequilibrium in full-length herbage nutritive quality genes of perennial ryegrass (Lolium perenne L.) , 2007, Molecular Genetics and Genomics.

[183]  Amanda J. Garris,et al.  Population structure and its effect on haplotype diversity and linkage disequilibrium surrounding the xa5 locus of rice (Oryza sativa L.). , 2003, Genetics.

[184]  George A. Milliken,et al.  Experimental Design for Two-Color Microarrays Applied in a Pre-Existing Split-Plot Experiment , 2007 .

[185]  Ju-Kyung Yu,et al.  Nonrandom distribution and frequencies of genomic and EST-derived microsatellite markers in rice, wheat, and barley , 2005, BMC Genomics.

[186]  F. V. van Eeuwijk,et al.  Linkage Disequilibrium Mapping of Yield and Yield Stability in Modern Spring Barley Cultivars , 2004, Genetics.

[187]  Joaquín Dopazo,et al.  GEPAS: a web-based resource for microarray gene expression data analysis , 2003, Nucleic Acids Res..

[188]  N M Laird,et al.  Family-based tests of association in the presence of linkage. , 2000, American journal of human genetics.

[189]  Andrew J. Olson,et al.  GeneSeer: A sage for gene names and genomic resources , 2005, BMC Genomics.

[190]  D. Goldstein,et al.  Population genomics: Linkage disequilibrium holds the key , 2001, Current Biology.

[191]  Joaquín Dopazo,et al.  Next station in microarray data analysis: GEPAS , 2006, Nucleic Acids Res..

[192]  Helen E. Parkinson,et al.  ArrayExpress—a public database of microarray experiments and gene expression profiles , 2006, Nucleic Acids Res..

[193]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[194]  Peter D. Karp,et al.  MetaCyc: a multiorganism database of metabolic pathways and enzymes. , 2004, Nucleic acids research.

[195]  Jun Lu,et al.  Pathway level analysis of gene expression using singular value decomposition , 2005, BMC Bioinformatics.

[196]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[197]  B. Li,et al.  Analysis on Frequency and Density of Microsatellites in Coding Sequences of Several Eukaryotic Genomes , 2004, Genomics, proteomics & bioinformatics.

[198]  Y. Barrière,et al.  Genetic diversity associated with variation in silage corn digestibility for three O-methyltransferase genes involved in lignin biosynthesis , 2004, Theoretical and Applied Genetics.

[199]  Z. J. Zhang,et al.  Associations of simple sequence repeats with quantitative trait variation including biotic and abiotic stress tolerance in Hordeum spontaneum , 2003 .

[200]  E. Pang,et al.  An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: The basic concepts , 2005, Euphytica.

[201]  L. Singh,et al.  Genome-wide analysis of microsatellite repeats in humans: their abundance and density in specific genomic regions , 2003, Genome Biology.

[202]  F. Clerget-Darpoux,et al.  Statistical properties of the allelic and genotypic transmission/disequilibrium test for multiallelic markers , 1995, Genetic epidemiology.

[203]  M. Purugganan,et al.  The Extent of Linkage Disequilibrium in Rice (Oryza sativa L.) , 2007, Genetics.

[204]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[205]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[206]  W. Liang,et al.  9) TM4 Microarray Software Suite , 2006 .

[207]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[208]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[209]  Agim Ballvora,et al.  Assessing genetic potential in germplasm collections of crop plants by marker-trait association: a case study for potatoes with quantitative variation of resistance to late blight and maturity type , 2004, Molecular Breeding.

[210]  Axel Uhl,et al.  Model-Driven Architecture , 2002, OOIS Workshops.

[211]  Gavin Sherlock,et al.  The Stanford Microarray Database: implementation of new analysis tools and open source release of software , 2002, Nucleic Acids Res..

[212]  Falk Schreiber,et al.  VANTED: A system for advanced data analysis and visualization in the context of biological networks , 2006, BMC Bioinformatics.

[213]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[214]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[215]  L. Jorde,et al.  Linkage disequilibrium and the search for complex disease genes. , 2000, Genome research.

[216]  Nick James,et al.  NASCArrays: a repository for microarray data generated by NASC's transcriptomics service , 2004, Nucleic Acids Res..

[217]  Russell D. Wolfinger,et al.  The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster , 2001, Nature Genetics.

[218]  Olivier Poch,et al.  GOAnno: GO annotation based on multiple alignment , 2005, Bioinform..

[219]  T. Säll,et al.  Linkage disequilibrium mapping of the bolting gene in sea beet using AFLP markers. , 2001, Genetical research.

[220]  Juan Antonio Vizcaíno,et al.  Generation, annotation and analysis of ESTs from Trichoderma harzianum CECT 2413 , 2006, BMC Genomics.

[221]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[222]  Akihiko Konagaya,et al.  KnowledgeEditor: a new tool for interactive modeling and analyzing biological pathways based on microarray data , 2003, Bioinform..

[223]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[224]  Y. Barrière,et al.  Nucleotide diversity of the ZmPox3 maize peroxidase gene: Relationships between a MITE insertion in exon 2 and variation in forage maize digestibility , 2004, BMC Genetics.

[225]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[226]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[227]  D. Neale,et al.  Nucleotide Diversity and Linkage Disequilibrium in Cold-Hardiness- and Wood Quality-Related Candidate Genes in Douglas Fir , 2005, Genetics.

[228]  P. Sand A lesson not learned: allele misassignment , 2007, Behavioral and Brain Functions.

[229]  L. Jorde Linkage disequilibrium as a gene-mapping tool. , 1995, American journal of human genetics.

[230]  K. Roeder,et al.  The power of genomic control. , 2000, American journal of human genetics.

[231]  Debashish Bhattacharya,et al.  Cyanobacterial Contribution to Algal Nuclear Genomes Is Primarily Limited to Plastid Functions , 2006, Current Biology.

[232]  X. Cui,et al.  Improved statistical tests for differential gene expression by shrinking variance components estimates. , 2005, Biostatistics.

[233]  Mourad Sahbatou,et al.  Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease , 2001, Nature.

[234]  Martin Kuiper,et al.  Genetic Analysis of Variation in Gene Expression in Arabidopsis thaliana , 2005, Genetics.

[235]  D. Kell Metabolomics and systems biology: making sense of the soup. , 2004, Current opinion in microbiology.

[236]  Kathleen Marchal,et al.  PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences , 2002, Nucleic Acids Res..

[237]  M S Waterman,et al.  Sequence alignment and penalty choice. Review of concepts, case studies and implications. , 1994, Journal of molecular biology.

[238]  P. Hollingsworth,et al.  Neighbour joining trees, dominant markers and population genetic structure , 2004, Heredity.

[239]  David Martin,et al.  GOToolBox: functional analysis of gene datasets based on Gene Ontology , 2004, Genome Biology.

[240]  L. Stein,et al.  Gramene, a Tool for Grass Genomics , 2002, Plant Physiology.

[241]  R. Burdon,et al.  Gene-assisted selection: applications of association genetics for forest tree breeding , 2007 .

[242]  M. Purugganan,et al.  Epistatic interaction between Arabidopsis FRI and FLC flowering time genes generates a latitudinal cline in a life history trait. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[243]  J. W. Dudley,et al.  Corn and Corn Improvement , 1955 .

[244]  S. Gardner,et al.  Phylogenetic analysis of nematodes of the genus Pratylenchus using nuclear 26S rDNA. , 1997, Molecular phylogenetics and evolution.

[245]  C R Weinberg,et al.  A log-linear approach to case-parent-triad data: assessing effects of disease genes that act either directly or through maternal effects and that may be subject to parental imprinting. , 1998, American journal of human genetics.

[246]  J. Vanfleteren,et al.  Phylogenetic relationships within the cyst-forming nematodes (Nematoda, Heteroderidae) based on analysis of sequences from the ITS regions of ribosomal DNA. , 2001, Molecular phylogenetics and evolution.

[247]  N. H. Shah,et al.  CLENCH: a program for calculating Cluster ENriCHment using the Gene Ontology , 2004, Bioinform..

[248]  S. Tingey,et al.  Whole genome scan detects an allelic variant of fad2 associated with increased oleic acid levels in maize , 2007, Molecular Genetics and Genomics.

[249]  Sergei Egorov,et al.  Pathway studio - the analysis and navigation of molecular networks , 2003, Bioinform..

[250]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[251]  John P. Rice,et al.  TDT with covariates and genomic screens with mod scores: Their behavior on simulated data , 1995, Genetic epidemiology.

[252]  Gregory D Schuler,et al.  Sequence mapping by electronic PCR , 1997, Genome research.

[253]  Steven G. Schroeder,et al.  Physical and Genetic Structure of the Maize Genome Reflects Its Complex Evolutionary History , 2007, PLoS genetics.

[254]  M. Sorrells,et al.  Association Analysis as a Strategy for Improvement of Quantitative Traits in Plants , 2006 .

[255]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[256]  R. Evans,et al.  Polymorphisms in Cinnamoyl CoA Reductase (CCR) Are Associated With Variation in Microfibril Angle in Eucalyptus spp. , 2005, Genetics.

[257]  R. Wu,et al.  Estimation of Multilocus Linkage Disequilibria in Diploid Populations With Dominant Markers , 2007, Genetics.

[258]  Chen-Tuo Liao,et al.  Statistical designs for two-color microarray experiments involving technical replication , 2006, Comput. Stat. Data Anal..

[259]  M. Morgante,et al.  Corn and humans: recombination and linkage disequilibrium in two genomes of similar size. , 2004, Trends in genetics : TIG.

[260]  Pierre Baldi,et al.  A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes , 2001, Bioinform..

[261]  S. Mirkin,et al.  DNA structures, repeat expansions and human hereditary disorders. , 2006, Current opinion in structural biology.

[262]  John W. Pinney,et al.  Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis , 2006, Nucleic Acids Res..

[263]  G. Tuskan,et al.  Comparative sequence analysis between orthologous regions of the Arabidopsis and Populus genomes reveals substantial synteny and microcollinearity , 2003 .

[264]  Brandon S. Gaut,et al.  Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.) , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[265]  Roland Eils,et al.  GOPET: A tool for automated predictions of Gene Ontology terms , 2006, BMC Bioinformatics.

[266]  M. McMullen,et al.  Association analysis of candidate genes for maysin and chlorogenic acid accumulation in maize silks , 2005, Theoretical and Applied Genetics.

[267]  P. Hedrick,et al.  Gametic disequilibrium measures: proceed with caution. , 1987, Genetics.

[268]  Joachim Selbig,et al.  PaVESy: Pathway Visualization and Editing System , 2004, Bioinform..

[269]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[270]  Jun Hua,et al.  Extending the loop design for two-channel microarray experiments. , 2006, Genetical research.

[271]  Purvesh Khatri,et al.  Onto-Tools, the toolkit of the modern biologist: Onto-Express, Onto-Compare, Onto-Design and Onto-Translate , 2003, Nucleic Acids Res..

[272]  Juan P. Steibel,et al.  Reassessing Design and Analysis of two-Colour Microarray Experiments Using Mixed Effects Models , 2005, Comparative and functional genomics.

[273]  Sarah Hake,et al.  Advances in maize genomics: the emergence of positional cloning. , 2006, Current opinion in plant biology.

[274]  G. Churchill,et al.  Statistical design and the analysis of gene expression microarray data. , 2007, Genetical research.

[275]  P. Langridge,et al.  Extreme Population-Dependent Linkage Disequilibrium Detected in an Inbreeding Plant Species, Hordeum vulgare , 2006, Genetics.

[276]  M Knapp,et al.  Reconstructing parental genotypes when testing for linkage in the presence of association. , 2001, Theoretical population biology.

[277]  G. Pesole,et al.  Structural and evolutionary analysis of the ribosomal genes of the parasitic nematode Meloidogyne artiellia suggests its ancient origin. , 2002, Molecular and biochemical parasitology.

[278]  B. Walsh,et al.  Association mapping in plant populations. , 2001 .

[279]  G. Pertea,et al.  RESOURCERER: a database for annotating and linking microarray resources within and across species , 2001, Genome Biology.

[280]  Ingrid Lönnstedt Replicated microarray data , 2001 .

[281]  T. Mohapatra,et al.  Unigene derived microsatellite markers for the cereal genomes , 2006, Theoretical and Applied Genetics.

[282]  Daniel Rabinowitz,et al.  A Unified Approach to Adjusting Association Tests for Population Admixture with Arbitrary Pedigree Structure and Arbitrary Missing Marker Information , 2000, Human Heredity.

[283]  G. Wenzel,et al.  High levels of linkage disequilibrium and associations with forage quality at a Phenylalanine Ammonia-Lyase locus in European maize (Zea mays L.) inbreds , 2006, Theoretical and Applied Genetics.

[284]  G. Eizenga,et al.  Association mapping of yield and its components in rice cultivars , 2007, Molecular Breeding.

[285]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[286]  Kevin Y Yip,et al.  Comparing classical pathways and modern networks: towards the development of an edge ontology. , 2007, Trends in biochemical sciences.

[287]  R. Ball Statistical Analysis and Experimental Design , 2007 .

[288]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[289]  M. Gerstein,et al.  Genomic analysis of regulatory network dynamics reveals large topological changes , 2004, Nature.

[290]  D. Freckman,et al.  A world perspective on nematology : The role of the society , 1987 .