Kernel based integration of Gene expression and DNA copy number

Kernel based integration data of gene expression and DNA copy number is used to analyze pattern of genes in breast cancer cell line. The integration data is clustered without any information about the number of k clusters. This paper proposes the use of intelligent kernel K-Means that is developed by combining intelligent K-Means and kernel K-Means. The technique is used to cluster data integration of gene expression and DNA copy number. The experiment results show that there are three clusters are successes to be found. Evaluation measure produce R value is 0.29.

[1]  Aristidis Likas,et al.  The Global Kernel $k$-Means Algorithm for Clustering in Feature Space , 2009, IEEE Transactions on Neural Networks.

[2]  Renée X. de Menezes,et al.  Integrated analysis of DNA copy number and gene expression microarray data using gene sets , 2009, BMC Bioinformatics.

[3]  Sheng-De Wang,et al.  Choosing the kernel parameters for support vector machines by the inter-cluster distance in the feature space , 2009, Pattern Recognit..

[4]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[5]  Pei Wang,et al.  Integrative analysis of DNA copy number and gene expression in metastatic oral squamous cell carcinoma identifies genes associated with poor survival , 2010, Molecular Cancer.

[6]  Aristidis Likas,et al.  Kernel-Based Weighted Multi-view Clustering , 2012, 2012 IEEE 12th International Conference on Data Mining.

[7]  Boris G. Mirkin,et al.  Intelligent Choice of the Number of Clusters in K-Means Clustering: An Experimental Study with Different Cluster Spreads , 2010, J. Classif..

[8]  Wen-Lin Kuo,et al.  A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. , 2006, Cancer cell.

[9]  Boris Mirkin,et al.  Clustering For Data Mining: A Data Recovery Approach (Chapman & Hall/Crc Computer Science) , 2005 .

[10]  Jaakko Astola,et al.  A strategy for identifying putative causes of gene expression variation in human cancers , 2004, J. Frankl. Inst..

[11]  John A. Berger,et al.  Jointly analyzing gene expression and copy number data in breast cancer using data reduction models , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  Nello Cristianini,et al.  Kernel-Based Data Fusion and Its Application to Protein Function Prediction in Yeast , 2003, Pacific Symposium on Biocomputing.

[13]  Ronald M. Levy,et al.  Pairwise and higher-order correlations among drug-resistance mutations in HIV-1 subtype B protease , 2009, BMC Bioinformatics.

[14]  J. Suykens,et al.  A kernel-based integration of genome-wide data for clinical decision support , 2009, Genome Medicine.

[15]  Vince D. Calhoun,et al.  Integrated Analysis of Gene Expression and Copy Number Data on Gene Shaving Using Independent Component Analysis , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  M. Ringnér,et al.  Impact of DNA amplification on gene expression patterns in breast cancer. , 2002, Cancer research.

[17]  Tijl De Bie,et al.  Kernel-based data fusion for gene prioritization , 2007, ISMB/ECCB.

[18]  Kong Dexi,et al.  A Fast and Effective Kernel-Based K-Means Clustering Algorithm , 2013, 2013 Third International Conference on Intelligent System Design and Engineering Applications.

[19]  Johan A. K. Suykens,et al.  Optimized Data Fusion for Kernel k-Means Clustering , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Francisco Azuaje,et al.  Cluster validation techniques for genome expression data , 2003, Signal Process..