Modeling and managing experimental data using FuGE.

The Functional Genomics Experiment data model (FuGE) has been developed to increase the consistency and efficiency of experimental data modeling in the life sciences, and it has been adopted by a number of high-profile standardization organizations. FuGE can be used: (1) directly, whereby generic modeling constructs are used to represent concepts from specific experimental activities; or (2) as a framework within which method-specific models can be developed. FuGE is both rich and flexible, providing a considerable number of modeling constructs, which can be used in a range of different ways. However, such richness and flexibility also mean that modelers and application developers have choices to make when applying FuGE in a given context. This paper captures emerging best practice in the use of FuGE in the light of the experience of several groups by: (1) proposing guidelines for the use and extension of the FuGE data model; (2) presenting design patterns that reflect recurring requirements in experimental data modeling; and (3) describing a community software tool kit (STK) that supports application development using FuGE. We anticipate that these guidelines will encourage consistent usage of FuGE, and as such, will contribute to the development of convergent data standards in omics research.

[1]  C. Sander,et al.  The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data , 2004, Nature Biotechnology.

[2]  Stuart Kent,et al.  Model Driven Engineering , 2002, IFM.

[3]  Ela Hunt,et al.  An object model and database for functional genomics , 2004, Bioinform..

[4]  Nigel W. Hardy,et al.  The Functional Genomics Experiment model (FuGE): an extensible framework for standards in functional genomics , 2007, Nature Biotechnology.

[5]  Andrew R Jones,et al.  An Update on Data Standards for Gel Electrophoresis , 2007, Proteomics.

[6]  H KatzRandy Toward a unified framework for version modeling in engineering databases , 1990 .

[7]  Chris F. Taylor,et al.  A systematic approach to modeling, capturing, and disseminating proteomics experimental data , 2003, Nature Biotechnology.

[8]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[9]  Jason E. Stewart,et al.  Design and implementation of microarray gene expression markup language (MAGE-ML) , 2002, Genome Biology.

[10]  Henning Hermjakob,et al.  Entering the Implementation Era A report on the HUPO‐PSI Fall workshop 25–27 September 2006, Washington DC, USA , 2007, Proteomics.

[11]  Nigel W. Hardy,et al.  The first RSBI (ISA-TAB) workshop: "can a simple format work for complex studies?". , 2008, Omics : a journal of integrative biology.

[12]  Douglas C. Schmidt,et al.  Guest Editor's Introduction: Model-Driven Engineering , 2006, Computer.

[13]  Ivar Jacobson,et al.  The Unified Modeling Language User Guide , 1998, J. Database Manag..

[14]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[15]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[16]  Chris F. Taylor,et al.  A common open representation of mass spectrometry data and its application to proteomics research , 2004, Nature Biotechnology.

[17]  강문설 [서평]「The Unified Modeling Language User Guide」 , 1999 .

[18]  Sean Martin,et al.  Globally distributed object identification for biological knowledgebases , 2004, Briefings Bioinform..

[19]  Henning Hermjakob,et al.  The Gel Electrophoresis Markup Language (GelML) from the Proteomics Standards Initiative , 2010, Proteomics.

[20]  Martin Eisenacher,et al.  Using Laboratory Information Management Systems as central part of a proteomics data workflow , 2010, Proteomics.

[21]  Rob Pooley,et al.  The unified modelling language , 1999, IEE Proc. Softw..

[22]  Josef Spidlen,et al.  Data standards for flow cytometry. , 2006, Omics : a journal of integrative biology.

[23]  Nigel W. Hardy,et al.  The Metabolomics Standards Initiative , 2007, Nature Biotechnology.

[24]  Lennart Martens,et al.  The PSI formal document process and its implementation on the PSI website , 2007, Proteomics.

[25]  Randy H. Katz,et al.  Toward a unified framework for version modeling in engineering databases , 1990, CSUR.