Consensus Clustering: A Resampling-Based Method for Building Radiation Hybrid Maps

Building Radiation Hybrid (RH) maps is a challenging process. Traditional RH mapping techniques are very time consuming, and do not work well on noisy datasets. In this presented research, we propose a new approach that uses resampling technique with consensus clustering technique to filter out unreliable markers, and build robust RH maps in a short time. The main aims of using the proposed approach is: first to reduce the mapping computational complexity, thus speeding up the mapping process. And second, to filter out unreliable markers, and map the remaining reliable markers to build robust maps. The proposed approach maps RH datasets in four steps, as follows: 1) uses Jackknife resampling technique to resample the RH dataset, and groups all resampled datasets into clusters. 2) Builds consensus clusters and filters out unreliable markers. 3) Maps the consensus clusters. 4) Connects the consensus clusters' maps to form the final map. To demonstrate the performance of our proposed approach, we compare the accuracy of the constructed maps with the corresponding physical maps. Also, we compare the running time of our constructed maps with the Carthagene tool maps running time. The results show that the proposed approach can construct robust maps in a comparatively very short time.

[1]  Shahryar F. Kianian,et al.  SIX RADIATION HYBRID MAPPING IN CROP PLANTS , 2022 .

[2]  Anne M. Denton,et al.  Iterative Framework Radiation Hybrid Mapping , 2014, SDM.

[3]  Anne M. Denton,et al.  A noise-aware method for building radiation hybrid maps , 2014, BCB.

[4]  Peter H. A. Sneath,et al.  Numerical Taxonomy: The Principles and Practice of Numerical Classification , 1973 .

[5]  D. Mester,et al.  Building reliable genetic maps: different mapping strategies may result in different maps , 2010 .

[6]  Anne M. Denton,et al.  Reliable Radiation Hybrid Maps: An Efficient Scalable Clustering-Based Approach , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[7]  William H. Press,et al.  Numerical recipes in Fortran 77 : the art of scientificcomputing. , 1992 .

[8]  Yogendra P. Chaubey,et al.  Resampling Methods: A Practical Guide to Data Analysis , 2000, Technometrics.

[9]  Raed I. Seetan,et al.  Radiation hybrid maps of the D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes , 2015, BMC Genomics.

[10]  S. Goss,et al.  New method for mapping genes in human chromosomes , 1975, Nature.

[11]  Simon de Givry,et al.  CarthaGene : multipopulation integrated genetic and radiation hybrid mapping , 2005 .

[12]  Raed I. Seetan,et al.  Dissection of Genetic Factors underlying Wheat Kernel Shape and Size in an Elite × Nonadapted Cross using a High Density SNP Linkage Map , 2016, The plant genome.

[13]  Syed Haider,et al.  Ensembl BioMarts: a hub for data retrieval across taxonomic space , 2011, Database J. Biol. Databases Curation.

[14]  Ying Cheng,et al.  Major submissions tool developments at the European nucleotide archive , 2011, Nucleic Acids Res..

[15]  D Cvijovicacute,et al.  Taboo search: an approach to the multiple minima problem. , 1995, Science.

[16]  Keld Helsgaun,et al.  An effective implementation of the Lin-Kernighan traveling salesman heuristic , 2000, Eur. J. Oper. Res..

[17]  N. Morton Sequential tests for the detection of linkage. , 1955, American journal of human genetics.

[18]  Abraham B. Korol,et al.  Multilocus consensus genetic maps (MCGM): Formulation, algorithms, and results , 2006, Comput. Biol. Chem..