A Comprehensive Overview on Application of Bioinformatics and Computational Statistics in Rice Genomics Toward an Amalgamated Approach for Improving Acquaintance Base

Rice (Oryza sativa L.) is a major crop in the world and provides the staple food for over half of the world population. From thousands of years of cultivation and breeding to recent genomics and systems biology approach, rice has been the focus of agriculture and plant research. Modern scientific research depends on computer technology to organize and analyze large datasets. Rice informatics – a relatively new discipline – has been developing rapidly as a subdiscipline of bioinformatics. Rice informatics devotes to leveraging the power of nature’s experiment of breeding and evolution to extract key findings from sequence and experimental data. Recent advances in high-throughput genotyping and sequencing technologies have changed the landscape of data collection and its analysis by using friendly database access and information retrieval. It focuses on developing and applying database tools and computationally intensive techniques and statistical software (e.g., pattern recognition, data mining, machine learning algorithms, R-statistical, MATLAB, and visualization) which give the opportunity to quickly and efficiently study heap of genomics information, chemical structure, and model generation study. Over recent years, various newly emerged diseases to rice varieties have an increasing concern to agriculturists and pathologists. The establishments of International Rice Information System, Rice Genome Research Project, Integrated Rice Genome Explorer, and Rice Proteome Databases are important initiatives for rice improvement using in silico software (e.g., homology modeling using SWISS Model, Modeler, and Autodock); the recent ongoing research on rice protein and its role in metabolic pathways works is being done around the world. Rice informatics has already started showing its profound impact on agricultural research and developments.

[1]  Sunil K. Mathur Statistical Bioinformatics with R , 2010 .

[2]  M. Gerstein,et al.  The current excitement in bioinformatics-analysis of whole-genome expression data: how does it relate to protein structure and function? , 2000, Current opinion in structural biology.

[3]  Mark E. Cooper,et al.  The International Crop Information System (ICIS) - connects genebank to breeder to farmer's field. , 1996 .

[4]  K. Shinozaki,et al.  Functional genomics using RIKEN Arabidopsis thaliana full-length cDNAs , 2009, Journal of Plant Research.

[5]  N. Provart,et al.  Web-Queryable Large-Scale Data Sets for Hypothesis Generation in Plant Biology , 2009, The Plant Cell Online.

[6]  Shiping Wang,et al.  RMD: a rice mutant database for functional analysis of the rice genome , 2005, Nucleic Acids Res..

[7]  Jeffrey J. Gray,et al.  High-resolution protein-protein docking. , 2006, Current opinion in structural biology.

[8]  M. Yano,et al.  A silicon transporter in rice , 2006, Nature.

[9]  Shri Ram,et al.  Global Information Resources on Rice for Research and Development , 2012 .

[10]  G. Moore,et al.  Cereal Genome Evolution: Grasses, line up and form a circle , 1995, Current Biology.

[11]  J. Kyozuka,et al.  Direct control of shoot meristem activity by a cytokinin-activating enzyme , 2007, Nature.

[12]  I. Singh,et al.  In silico study of interaction between rice proteins enhanced disease susceptibility 1 and phytoalexin deficient 4, the regulators of salicylic acid signalling pathway , 2012, Journal of Biosciences.

[13]  Kazuo Shinozaki,et al.  Phenome analysis in plant species using loss-of-function and gain-of-function mutants. , 2009, Plant & cell physiology.

[14]  Yoshiaki Nagamura,et al.  Rice TOGO Browser: A Platform to Retrieve Integrated Information on Rice Functional and Applied Genomics , 2011, Plant & cell physiology.

[15]  Kanako O. Koyanagi,et al.  Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana. , 2007, Genome research.

[16]  J. Zhai,et al.  The Cornucopia of Small RNAs in Plant Genomes , 2008, Rice.

[17]  B. Meyers,et al.  An expression atlas of rice mRNAs and small RNAs , 2007, Nature Biotechnology.

[18]  Y. Yamazaki,et al.  Oryzabase. An Integrated Biological and Genome Information Database for Rice1[OA] , 2005, Plant Physiology.

[19]  I. Grosse,et al.  Evidence and evolutionary analysis of ancient whole-genome duplication in barley predating the divergence from rice , 2009, BMC Evolutionary Biology.

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  Jun Yu,et al.  The Rice Genome Knowledgebase (RGKbase): an annotation database for rice comparative genomics and evolutionary biology , 2012, Nucleic Acids Res..

[22]  A. Sinha,et al.  Rice Mitogen Activated Protein Kinase Kinase and Mitogen Activated Protein Kinase Interaction Network Revealed by In-Silico Docking and Yeast Two-Hybrid Approaches , 2013, PloS one.

[23]  Manuel Ruiz,et al.  Oryza Tag Line, a phenotypic mutant database for the Génoplante rice insertion line library , 2007, Nucleic Acids Res..

[24]  B. Burr,et al.  International Rice Genome Sequencing Project: the effort to completely sequence the rice genome. , 2000, Current opinion in plant biology.

[25]  Babita Sharma,et al.  Structure and Mechanism of a Transmission Blocking Vaccine Candidate Protein Pfs25 from P. falciparum: A Molecular Modeling and Docking Study , 2008, Silico Biol..

[26]  M. Yano,et al.  An SNP Caused Loss of Seed Shattering During Rice Domestication , 2006, Science.

[27]  Gajendra P. S. Raghava,et al.  Machine learning techniques in disease forecasting: a case study on rice blast prediction , 2006, BMC Bioinformatics.

[28]  Tal Pupko,et al.  Structural Genomics , 2005 .

[29]  M. Tomita,et al.  OryzaPG-DB: Rice Proteome Database based on Shotgun Proteogenomics , 2011, BMC Plant Biology.

[30]  Mark H. Wright,et al.  Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa , 2011, Nature communications.

[31]  Kenichi Higo,et al.  Rice Proteome Database based on two-dimensional polyacrylamide gel electrophoresis: its status in 2003 , 2004, Nucleic Acids Res..

[32]  M. Ashburner,et al.  Annotating eukaryote genomes. , 2000, Current opinion in structural biology.

[33]  M. Yano,et al.  Genome-Wide Haplotype Changes Produced by Artificial Selection during Modern Rice Breeding in Japan , 2012, PloS one.

[34]  Jeffrey J. Gray,et al.  Structural model of the mAb 806-EGFR complex using computational docking followed by computational and experimental mutagenesis. , 2006, Structure.

[35]  Q. Qian,et al.  Cytokinin Oxidase Regulates Rice Grain Production , 2005, Science.

[36]  Kazumasa Murata,et al.  A large-scale collection of phenotypic data describing an insertional mutant population to facilitate functional analysis of rice genes , 2006, Plant Molecular Biology.

[37]  D. F. Cox,et al.  Statistical Procedures for Agricultural Research. , 1984 .

[38]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[39]  Yao-Cheng Lin,et al.  A rice phenomics study—phenotype scoring and seed propagation of a T-DNA insertion-induced rice mutant population , 2007, Plant Molecular Biology.

[40]  T. Fujiwara,et al.  An efflux transporter of silicon in rice , 2007, Nature.

[41]  M. Anputhas,et al.  Selection of Rice Varieties for Recommendation in Sri Lanka: A Complex-free Approach , 2010 .

[42]  H. Leung,et al.  Genetic, Physiological, and Gene Expression Analyses Reveal That Multiple QTL Enhance Yield of Rice Mega-Variety IR64 under Drought , 2013, PloS one.

[43]  C. Soderlund,et al.  The Oryza Map Alignment Project: The Golden Path to Unlocking the Genetic Potential of Wild Rice Species , 2005, Plant Molecular Biology.

[44]  Richard M. Bruskiewich,et al.  Linking genotype to phenotype: the International Rice Information System (IRIS) , 2003, ISMB.

[45]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[46]  Takashi Matsumoto,et al.  RiceGAAS: an automated annotation system and database for rice genome sequence , 2002, Nucleic Acids Res..

[47]  Jan Leach,et al.  Chemical- and Irradiation-induced Mutants of Indica Rice IR64 for Forward and Reverse Genetics , 2005, Plant Molecular Biology.

[48]  B. Antonio,et al.  Current status of rice informatics resources and breeding applications , 2010 .

[49]  Kenneth L. McNally,et al.  Genomewide SNP variation reveals relationships among landraces and modern varieties of rice , 2009, Proceedings of the National Academy of Sciences.