A Laboratory-Targeted, Data Management and Processing System for the Early Detection Research Network

The National Institutes of Health (NIH), National Cancer Institute's Early Detection Research Network (EDRN) is a cross-institutional collaborative initiative seeking to accelerate the clinical application of cancer biomarker research. Over the past decade, it has been our role, as EDRN's Informatics Center (IC), to develop a comprehensive information services infrastructure as well as a set of software tools and services to support this overall initiative. We have recently developed a novel application called the Laboratory Catalog and Archive Service (LabCAS) whose focus is to extend EDRN IC data management and processing capabilities to EDRN laboratories. By leveraging the same technologies used to manage and process NASA Earth and Planetary data sets, we offer EDRN researchers an effective way of managing their laboratory data. More specifically, LabCAS enables EDRN researchers to reliably archive their experimental data, to optionally share these data in a controlled manner with other researchers, and to gain insight into these data through highly configurable data analysis pipelines tailored to the broad range of biomarker related experiments. This particular collaboration leverages expertise from NASA's Jet Propulsion Laboratory, Vanderbilt University Medical Center, and Dartmouth Medical School, as well as builds upon existing cross-governmental collaboration between NASA and the NIH.

[1]  Sean Kelly,et al.  A Reusable Process Control System Framework for the Orbiting Carbon Observatory and NPP Sounder PEATE Missions , 2009, 2009 Third IEEE International Conference on Space Mission Challenges for Information Technology.

[2]  Jack W. London,et al.  Using the Semantically Interoperable Biospecimen Repository Application, caTissue: End User Deployment Lessons Learned , 2010, 2010 IEEE International Conference on BioInformatics and BioEngineering.

[3]  D. Tabb,et al.  MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. , 2007, Journal of proteome research.

[4]  Mathias Wilhelm,et al.  Global proteome analysis of the NCI-60 cell line panel. , 2013, Cell reports.

[5]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[6]  Chris Mattmann,et al.  A Distributed Information Services Architecture to Support Biomarker Discovery in Early Detection of Cancer , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[7]  Ronald L. Rivest,et al.  The MD5 Message-Digest Algorithm , 1992, RFC.

[8]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[9]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[10]  Sudhir Srivastava,et al.  A national virtual specimen database for early cancer detection , 2003, 16th IEEE Symposium Computer-Based Medical Systems, 2003. Proceedings..

[11]  Chris Mattmann,et al.  An Extensible Biomarker Curation Approach and Software Infrastructure for the Early Detection of Cancer , 2009, HEALTHINF.

[12]  Chris Mattmann,et al.  Enabling effective curation of cancer biomarker research data , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[13]  Sudhir Srivastava,et al.  Early Detection Cancer Research Network , 2000, Laboratory Investigation.

[14]  Michael D. Litton,et al.  IDPicker 2.0: Improved protein assembly with high discrimination peptide identification filtering. , 2009, Journal of proteome research.