Searching and Mining Visually Observed Phenotypes of maize Mutants

There are thousands of maize mutants, which are invaluable resources for plant research. Geneticists use them to study underlying mechanisms of biochemistry, cell biology, cell development, and cell physiology. To streamline the understanding of such complex processes, researchers need the most current versions of genetic and physical maps, tools with the ability to recognize novel phenotypes or classify known phenotypes, and an intimate knowledge of the biochemical processes generating physiological and phenotypic effects. They must also know how all of these factors change and differ among species, diverse alleles, germplasms, and environmental conditions. While there are robust databases, such as MaizeGDB, for some of these types of raw data, other crucial components are missing. Moreover, the management of visually observed mutant phenotypes is still in its infant stage, let alone the complex query methods that can draw upon high-level and aggregated information to answer the questions of geneticists. In this paper, we address the scientific challenge and propose to develop a robust framework for managing the knowledge of visually observed phenotypes, mining the correlation of visual characteristics with genetic maps, and discovering the knowledge relating to cross-species conservation of visual and genetic patterns. The ultimate goal of this research is to allow a geneticist to submit phenotypic and genomic information on a mutant to a knowledge base and ask, "What genes or environmental factors cause this visually observed phenotype?".

[1]  G. Fink,et al.  A gene encoding the tryptophan synthase beta subunit of Arabidopsis thaliana. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Tosiyasu L. Kunii,et al.  Pictorial Data-Base Systems , 1981, Computer.

[3]  Chi-Ren Shyu,et al.  Developing a case-based reasoning knowledge repository to support a learning community—An example from the technology integration community , 2003 .

[4]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[5]  Shi-Kuo Chang,et al.  Pictorial Data-Base Systems , 1981, Computer.

[6]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[7]  S. Tanksley,et al.  Homoeologous relationships of rice, wheat and maize chromosomes , 1993, Molecular and General Genetics MGG.

[8]  Paul Taylor,et al.  Knowledge management with patterns , 2003, CACM.

[9]  J. Bennetzen,et al.  DNA Rearrangement in Orthologous Orp Regions of the Maize, Rice and Sorghum Genomes , 2005, Genetics.

[10]  Chi-Ren Shyu,et al.  Knowledge representation and sharing using visual semantic modeling for diagnostic medical image databases , 2005, IEEE Transactions on Information Technology in Biomedicine.

[11]  Vesa Välimäki,et al.  Antialiasing Oscillators in Subtractive Synthesis , 2007, IEEE Signal Processing Magazine.

[12]  Chi-Ren Shyu,et al.  Image Analysis for Mapping Immeasurable Phenotypes in Maize [Life Sciences] , 2007, IEEE Signal Processing Magazine.

[13]  Jon Atli Benediktsson,et al.  A new approach for the morphological segmentation of high-resolution satellite imagery , 2001, IEEE Trans. Geosci. Remote. Sens..

[14]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1988, IJCAI 1989.

[15]  Chi-Ren Shyu,et al.  Knowledge-Driven Multidimensional Indexing Structure for Biomedical Media Database Retrieval , 2007, IEEE Transactions on Information Technology in Biomedicine.

[16]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[17]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[18]  Azriel Rosenfeld,et al.  Digital Picture Processing , 1976 .

[19]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[20]  Arnold W. M. Smeulders,et al.  Content-Based Image Retrieval , 2004 .

[21]  S. Hake,et al.  barren inflorescence2 Encodes a Co-Ortholog of the PINOID Serine/Threonine Kinase and Is Required for Organogenesis during Inflorescence and Vegetative Development in Maize1[C][W][OA] , 2007, Plant Physiology.

[22]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.