Method for generating multiple risky barcodes of complex diseases using ant colony algorithm

BackgroundSusceptible barcode recognition plays an important role in the diagnosis and treatment of complex diseases. Numerous approaches have been proposed to identify risky barcodes involved in the progress of complex diseases. However, some methods only consider differences in barcode frequencies between the control and disease groups; as such, these methods may be partial or even wrong. For example, some barcodes with a high risk ratio yield a low frequency on cases or exhibit a high frequency on controls, which may unreasonable from a statistical point.ResultsIn our study, a stricter criteria, maximum discrepancy and maximum constituency, is designed to evaluate each barcode and ant colony algorithm is used to search combination space of epistasis. For complex diseases with multi-subtypes, our method can list several potential barcodes contributing to different subtypes of complex diseases. Another contribution of this work is to introduce a method for determining the length of barcodes and excluding noisy barcodes whose frequencies are abnormal. In addition, common pathogenic genes shared by different risky barcodes are also recognized, which may provide key clue for further study, such as gene function analysis.ConclusionsExperimental results reveal that our method can find multiple risky barcodes whose risk ratio and odds ratio are >1. These barcodes could be related to different subtypes of complex diseases.

[1]  Shili Lin,et al.  Multilocus LD measure and tagging SNP selection with generalized mutual information , 2005, Genetic epidemiology.

[2]  Li-Yeh Chuang,et al.  Generating SNP barcode to evaluate SNP-SNP interaction of disease by particle swarm optimization , 2009, Comput. Biol. Chem..

[3]  Li-Yeh Chuang,et al.  Evaluation of Breast Cancer Susceptibility Using Improved Genetic Algorithms to Generate Genotype SNP Barcodes , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[4]  Carlos Caldas,et al.  Common germline polymorphisms in COMT, CYP19A1, ESR1, PGR, SULT1E1 and STS and survival after a diagnosis of breast cancer , 2009, International journal of cancer.

[5]  N. Fortunati,et al.  Sex Hormone-binding Globulin (SHBG) and Estradiol Cross-talk in Breast Cancer Cells , 2006, Hormone and metabolic research = Hormon- und Stoffwechselforschung = Hormones et metabolisme.

[6]  References , 1971 .

[7]  G. Castoria,et al.  Targeting rapid action of sex steroid receptors in breast and prostate cancers. , 2011, Frontiers in bioscience.

[8]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[9]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[10]  Javed Siddiqui,et al.  Activating ESR1 mutations in hormone-resistant metastatic breast cancer , 2013, Nature Genetics.

[11]  Jason H. Moore,et al.  Role of genetic heterogeneity and epistasis in bladder cancer susceptibility and outcome: a learning classifier system approach , 2013, J. Am. Medical Informatics Assoc..

[12]  Giu-Cheng Hsu,et al.  Genetic variation in the genome-wide predicted estrogen response element-related sequences is associated with breast cancer development , 2011, Breast Cancer Research.

[13]  Z. Fuks,et al.  Sex hormone binding globulin (SHBG) in breast cancer: a correlation with obesity but not with estrogen receptor status. , 1984, European journal of cancer & clinical oncology.

[14]  David Chen,et al.  ESR1 ligand binding domain mutations in hormone-resistant breast cancer , 2013, Nature Genetics.

[15]  Nandita Mitra,et al.  Association of progesterone receptor gene (PGR) variants and breast cancer risk in African American women , 2013, Breast Cancer Research and Treatment.

[16]  B. Ponder,et al.  Identification of Common Variants in the SHBG Gene Affecting Sex Hormone-Binding Globulin Levels and Breast Cancer Risk in Postmenopausal Women , 2008, Cancer Epidemiology Biomarkers & Prevention.

[17]  P. Visscher,et al.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits , 2012, Nature Genetics.

[18]  Andrew G. Clark,et al.  Distilling Pathophysiology from Complex Disease Genetics , 2013, Cell.

[19]  Xiong Li,et al.  A new technique for generating pathogenic barcodes in breast cancer susceptibility analysis. , 2015, Journal of theoretical biology.

[20]  Alison M Dunning,et al.  Association between Common Variation in 120 Candidate Genes and Breast Cancer Risk , 2007, PLoS genetics.