Observations on Using Probabilistic C-Means for Solving a Typical Bioinformatics Problem

Recently, there has been great interest in bioinformatics among researches from various disciplines such as computer science, mathematics, statistics and artificial intelligence. Bioinformatics mainly deals with solving biological problems at molecular levels. One of the classic problems of bioinformatics which has gain a lot attention lately is haplotyping, the goal of which is categorizing SNP-fragments into two clusters and deducing a haplotype for each. Since the problem is proved to be NP-hard, several computational and heuristic methods have addressed the problem seeking feasible answers. In this work it is shown that using PCM to solve haplotyping problem in DALY dataset yields better results comparing to current available methods.

[1]  Christian Döring,et al.  Data analysis with fuzzy clustering methods , 2006, Comput. Stat. Data Anal..

[2]  Alessandro Panconesi,et al.  Fast Hare: A Fast Heuristic for Single Individual SNP Haplotype Reconstruction , 2004, WABI.

[3]  Ehsan Asgarian,et al.  Solving MEC model of haplotype reconstruction using information fusion, single greedy and parallel clustering approaches , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[4]  Ehsan Asgarian,et al.  Neural network-based approaches, solving haplotype reconstruction in MEC and MEC/GI models , 2013, 2008 Second Asia International Conference on Modelling & Simulation (AMS).

[5]  A. Chakravarti It's raining SNPs, hallelujah? , 1998, Nature Genetics.

[6]  Xiang-Sun Zhang,et al.  Haplotype reconstruction from SNP fragments by minimum error correction , 2005, Bioinform..

[7]  K. Weiss,et al.  Linkage disequilibrium mapping of complex disease: fantasy or reality? , 1998, Current opinion in biotechnology.

[8]  Harvey J. Greenberg,et al.  Opportunities for Combinatorial Optimization in Computational Biology , 2004, INFORMS J. Comput..

[9]  Wei Zhang,et al.  Minimum Conflict Individual Haplotyping from SNP Fragments and Related Genotype , 2006, Evolutionary bioinformatics online.

[10]  Paola Bonizzoni,et al.  The Haplotyping problem: An overview of computational models and solutions , 2003, Journal of Computer Science and Technology.

[11]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.