Big Data Analysis on Puerto Rico Testsite for Exploring Contamination Threats.

In this paper, we present the use of Principal Component Analysis and customized software, to accelerate the spectral analysis of biological samples. The work is part of the mission of the National Institute of Environmental Health Sciences sponsored Puerto Rico Testsite for Exploring Contamination Threats Center, establishing linkages between environmental pollutants and preterm birth. This paper provides an overview of the data repository developed for the Center, and presents a use case analysis of biological sample data maintained in the database system.

[1]  William J. Dally,et al.  The GPU Computing Era , 2010, IEEE Micro.

[2]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[3]  A. Calafat,et al.  Urinary Phthalate Metabolites in Relation to Preterm Birth in Mexico City , 2009, Environmental health perspectives.

[4]  パスコ,et al.  ArcGIS Spatial Analystユーザーズ・ガイド : GIS by ESRI , 2001 .

[5]  Ann-Beth Moller,et al.  National, regional, and worldwide estimates of preterm birth rates in the year 2010 with time trends since 1990 for selected countries: a systematic analysis and implications , 2012, The Lancet.

[6]  Iain Beattie,et al.  Ultra-performance liquid chromatography coupled to quadrupole-orthogonal time-of-flight mass spectrometry. , 2004, Rapid communications in mass spectrometry : RCM.

[7]  Jarno Tuimala,et al.  R, Programming Language , 2013 .

[8]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9]  P. Ruckart,et al.  Evaluation of contaminated drinking water and preterm birth, small for gestational age, and birth weight at Marine Corps Base Camp Lejeune, North Carolina: a cross-sectional study , 2014, Environmental Health.

[10]  J. Meeker Exposure to environmental endocrine disruptors and child development. , 2012, Archives of pediatrics & adolescent medicine.

[11]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[12]  Sebastian Gibb,et al.  MALDIquant: a versatile R package for the analysis of mass spectrometry data , 2012, Bioinform..

[13]  Masaru Tomita,et al.  Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis , 2012, Current bioinformatics.

[14]  N. Wake,et al.  Long-Term Effects of Polychlorinated Biphenyls and Dioxins on Pregnancy Outcomes in Women Affected by the Yusho Incident , 2008, Environmental health perspectives.

[15]  A. Calafat,et al.  Bisphenol a exposure in Mexico City and risk of prematurity: a pilot nested case control study , 2010, Environmental health : a global access science source.

[16]  Bjarne Stroustrup,et al.  C++ Programming Language , 1986, IEEE Softw..