Data and text mining Aneuploidy prediction and tumor classification with heterogeneous hidden conditional random fields

Motivation: The heterogeneity of cancer cannot always be recognized by tumor morphology, but may be reflected by the underlying genetic aberrations. Array comparative genome hybridization (array-CGH) methods provide high-throughput data on genetic copy numbers, but determining the clinically relevant copy number changes remains a challenge. Conventional classification methods for linking recurrent alterations to clinical outcome ignore sequential correlations in selecting relevant features. Conversely, existing sequence classification methods can only model overall copy number instability, without regard to any particular position in

[1]  Lars Hofmann,et al.  p73 poses a barrier to malignant transformation by limiting anchorage‐independent growth , 2008, The EMBO journal.

[2]  Emmanuel Barillot,et al.  Classification of arrayCGH data using fused SVM , 2008, ISMB.

[3]  Yu Li,et al.  Genomic and functional evidence for an ARID1A tumor suppressor role , 2007, Genes, chromosomes & cancer.

[4]  Kevin P. Murphy,et al.  Modeling recurrent DNA copy number alterations in array CGH data , 2007, ISMB/ECCB.

[5]  David J Smith,et al.  Loss of fibulin-2 expression is associated with breast cancer progression. , 2007, The American journal of pathology.

[6]  G. Tian,et al.  Statistical Applications in Genetics and Molecular Biology Sparse Logistic Regression with Lp Penalty for Biomarker Identification , 2011 .

[7]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[8]  Ajay N. Jain,et al.  Genomic and transcriptional aberrations linked to breast cancer pathophysiologies. , 2006, Cancer cell.

[9]  P. Nederlof,et al.  Array-CGH and breast cancer , 2006, Breast Cancer Research.

[10]  Stephen Chia,et al.  Amplification of EMSY, a novel oncogene on 11q13, in high grade ovarian surface epithelial carcinomas. , 2006, Gynecologic oncology.

[11]  Tara L. Naylor,et al.  Distinct genomic profiles in hereditary breast tumors identified by array-based comparative genomic hybridization. , 2005, Cancer research.

[12]  J. Fridlyand,et al.  Rare amplicons implicate frequent deregulation of cell fate specification pathways in oral squamous cell carcinoma , 2005, Oncogene.

[13]  Yuan Qi,et al.  Bayesian Conditional Random Fields , 2005, AISTATS.

[14]  P. Raychaudhuri,et al.  Cul4A Physically Associates with MDM2 and Participates in the Proteolysis of p53 , 2004, Cancer Research.

[15]  Yongdai Kim,et al.  Gradient LASSO for feature selection , 2004, ICML.

[16]  P. Raychaudhuri,et al.  Cul 4 A Physically Associates with MDM 2 and Participates in the Proteolysis of p 53 , 2004 .

[17]  Jun Ho Jeon,et al.  VDUP1 upregulated by TGF-β1 and 1,25-dihydorxyvitamin D3 inhibits tumor cell growth by blocking cell-cycle progression , 2003, Oncogene.

[18]  Carlos Caldas,et al.  Molecular Classification of Breast Carcinomas Using Tissue Microarrays , 2003, Diagnostic molecular pathology : the American journal of surgical pathology, part B.

[19]  Christian A. Rees,et al.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  David M. Rocke,et al.  A Model for Measurement Error for Gene Expression Arrays , 2001, J. Comput. Biol..

[21]  W. Kuo,et al.  Quantitative mapping of amplicon structure by array CGH identifies CYP24 as a candidate oncogene , 2000, Nature Genetics.

[22]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[23]  F. Mitelman,et al.  Primary chromosome abnormalities in human neoplasia. , 1989, Advances in cancer research.