Bootstrap Inference with Neural-Network Modeling for Gene-Disease Association Testing

Estimates derived from neural network modeling are used to test the significance of single nucleotide polymorphisms (SNPs) in the categorization of case control status in genetic association studies. Our artificial neural network (ANN) model of gene-disease correlation is represented by a fully connected 3-layered feedforward neural network with input nodes, corresponding to the number of studied SNPs, a hidden layer and a single output unit for the disease status. We used an evaluation procedure that measures the predictive significance of a single SNP, based on the change in the error function when the input is removed from the network. Two ANNs, one with all inputs and the other with a tested input removed are run in parallel and the change in error is calculated as a function of the relative out-of-sample performance of these two networks. With the help of a bootstrap technique (resampling with replacement) the valid inference of the tested input variables is derived. We report on the performance of the procedure as evaluated on simulated SNP datasets of varying complexity, on a real study dataset and in comparison to an SVM implementation of the procedure

[1]  Abdesselam Bouzerdoum,et al.  A generalized feedforward neural network architecture for classification and regression , 2003, Neural Networks.

[2]  Alfonso Palmer,et al.  Numeric sensitivity analysis applied to feedforward neural networks , 2003, Neural Computing & Applications.

[3]  M. Georgiopoulos,et al.  Feed-forward neural networks , 1994, IEEE Potentials.

[4]  Jennifer Couzin,et al.  Genomics. New mapping project splits the community. , 2002, Science.

[5]  Jacek M. Zurada,et al.  Sensitivity analysis for minimization of input data dimension for feedforward neural network , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[6]  Jeremy Wyatt,et al.  Clinical applications of artificial neural networks: Artificial neural networks: practical considerations for clinical application , 2001 .

[7]  Richard Dybowski,et al.  Clinical applications of artificial neural networks: Theory , 2001 .

[8]  Derek Partridge,et al.  Assessing the Impact of Input Features in a Feedforward Neural Network , 2000, Neural Computing & Applications.

[9]  Richard F Harris Hapmap flap , 2002, Current Biology.

[10]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[11]  Jennifer Couzin,et al.  New Mapping Project Splits the Community , 2002, Science.

[12]  Hiroyuki Honda,et al.  Artificial neural network approach for selection of susceptible single nucleotide polymorphisms and construction of prediction model on childhood allergic asthma , 2004, BMC Bioinformatics.

[13]  Lon R. Cardon,et al.  The complex interplay among factors that influence allelic association , 2004, Nature Reviews Genetics.

[14]  Jeffrey S. Racine,et al.  Statistical inference, the bootstrap, and neural-network modeling with application to foreign exchange rates , 2001, IEEE Trans. Neural Networks.

[15]  Michael Georgiopoulos,et al.  OPTIMAL FEED-FORWARD NEURAL NETWORK ARCHITECTURES , .

[16]  Enrico Smeraldi,et al.  Neural network analysis in pharmacogenetics of mood disorders , 2004, BMC Medical Genetics.

[17]  N. Schork,et al.  Single nucleotide polymorphisms and the future of genetic epidemiology , 2000, Clinical genetics.

[18]  Farid E Ahmed,et al.  Molecular Cancer BioMed Central Review , 2005 .

[19]  W Penny,et al.  Neural Networks in Clinical Medicine , 1996, Medical decision making : an international journal of the Society for Medical Decision Making.

[20]  Timothy Masters,et al.  Practical neural network recipes in C , 1993 .

[21]  D. Clayton,et al.  A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes. , 2002, American journal of human genetics.

[22]  J. Ott,et al.  Neural network analysis of complex traits , 1997, Genetic epidemiology.