Between-tumor and within-tumor heterogeneity in invasive potential

For women with access to healthcare and early detection, breast cancer deaths are caused primarily by metastasis rather than growth of the primary tumor. Metastasis has been difficult to study because it happens deep in the body, occurs over years, and involves a small fraction of cells from the primary tumor. Furthermore, within-tumor heterogeneity relevant to metastasis can also lead to therapy failures and is obscured by studies of bulk tissue. Here we exploit heterogeneity to identify molecular mechanisms of metastasis. We use “organoids”, groups of hundreds of tumor cells taken from a patient and grown in the lab, to probe tumor heterogeneity, with potentially thousands of organoids generated from a single tumor. We show that organoids have the character of biological replicates: within-tumor and between-tumor variation are of similar magnitude. We develop new methods based on population genetics and variance components models to build between-tumor and within-tumor statistical tests, using organoids analogously to large sibships and vastly amplifying the test power. We show great efficiency for tests based on the organoids with the most extreme phenotypes and potential cost savings from pooled tests of the extreme tails, with organoids generated from hundreds of tumors having power predicted to be similar to bulk tests of hundreds of thousands of tumors. We apply these methods to an association test for molecular correlates of invasion, using a novel quantitative invasion phenotype calculated as the spectral power of the organoid boundary. These new approaches combine to show a strong association between invasion and protein expression of Keratin 14, a known biomarker for poor prognosis, with p = 2 × 10−45 for within-tumor tests of individual organoids and p < 10−6 for pooled tests of extreme tails. Future studies using these methods could lead to discoveries of new classes of cancer targets and development of corresponding therapeutics. All data and methods are available under an open source license at https://github.com/baderzone/invasion_2019.

[1]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[2]  Andrew J Ewald,et al.  ECM microenvironment regulates collective migration and local dissemination in normal and malignant mammary epithelium , 2012, Proceedings of the National Academy of Sciences.

[3]  R. Matkowski,et al.  Circulating Tumor , 2014 .

[4]  Andrew J. Ewald,et al.  Three-dimensional organotypic culture: experimental models of mammalian biology and disease , 2014, Nature Reviews Molecular Cell Biology.

[5]  P. Sham,et al.  Family-based association tests for quantitative traits using pooled DNA , 2002, European Journal of Human Genetics.

[6]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[7]  D. Hartl,et al.  Principles of population genetics , 1981 .

[8]  N. Obuchowski,et al.  Quantitative classification of breast tumors in digitized mammograms. , 1996, Medical physics.

[9]  Andrew J. Ewald,et al.  A collective route to metastasis: Seeding by tumor cell clusters , 2016, Science.

[10]  B. Stanger,et al.  Pancreatic Cancer Metastases Harbor Evidence of Polyclonality. , 2015, Cancer discovery.

[11]  W. Ewens Genetics and analysis of quantitative traits , 1999 .

[12]  Sridhar Ramaswamy,et al.  Circulating Tumor Cell Clusters Are Oligoclonal Precursors of Breast Cancer Metastasis , 2014, Cell.

[13]  Rangaraj M. Rangayyan,et al.  Application of shape analysis to mammographic calcifications , 1994, IEEE Trans. Medical Imaging.

[14]  K. Inoue,et al.  A method for calculating the perimeter of objects for automatic recognition of circular defects , 1987 .

[15]  Michael A. Loss,et al.  The NIH Protein Capture Reagents Program (PCRP): a standardized protein affinity reagent toolbox , 2016, Nature Methods.

[16]  N Risch,et al.  Extreme discordant sib pairs for mapping quantitative trait loci in humans. , 1995, Science.

[17]  K. Polyak,et al.  Tumorigenesis: it takes a village , 2015, Nature Reviews Cancer.

[18]  T. Keck,et al.  Cancer cell invasion and EMT marker expression: a three‐dimensional study of the human cancer–host interface , 2014, The Journal of pathology.

[19]  Alex Pentland,et al.  Fractal-Based Description of Natural Scenes , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  J. Bader,et al.  Twist1-induced dissemination preserves epithelial identity and requires E-cadherin , 2014, The Journal of cell biology.

[21]  A W Partin,et al.  Fourier analysis of cell motility: correlation of motility with metastatic potential. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[22]  H. Levine,et al.  Clinical Medicine , 1997 .

[23]  M. McPeek,et al.  Modeling Three-Dimensional Morphological Structures Using Spherical Harmonics , 2009, Evolution; international journal of organic evolution.

[24]  A. Jemal,et al.  Cancer statistics, 2019 , 2019, CA: a cancer journal for clinicians.

[25]  James E. Verdone,et al.  Polyclonal breast cancer metastases arise from collective dissemination of keratin 14-expressing tumor cell clusters , 2016, Proceedings of the National Academy of Sciences.

[26]  L R Cardon,et al.  The power to detect linkage disequilibrium with quantitative traits in selected samples. , 2001, American journal of human genetics.

[27]  A. Jemal,et al.  Breast cancer statistics, 2017, racial disparity in mortality by state , 2017, CA: a cancer journal for clinicians.

[28]  A. Jemal,et al.  Cancer statistics, 2017 , 2017, CA: a cancer journal for clinicians.

[29]  J. Winstanley,et al.  Statistical association of basal cell keratins with metastasis-inducing proteins in a prognostically unfavorable group of sporadic breast cancers. , 2011, The American journal of pathology.

[30]  M. O’Donovan,et al.  DNA Pooling: a tool for large-scale association studies , 2002, Nature Reviews Genetics.

[31]  Kevin W Eliceiri,et al.  NIH Image to ImageJ: 25 years of image analysis , 2012, Nature Methods.

[32]  P. Sham,et al.  Optimal selection strategies for QTL mapping using pooled DNA samples , 2002, European Journal of Human Genetics.

[33]  Andrew J. Ewald,et al.  Collective Invasion in Breast Cancer Requires a Conserved Basal Epithelial Program , 2013, Cell.

[34]  A. Brú,et al.  Fractal analysis and tumour growth , 2008, Math. Comput. Model..

[35]  M. Soller,et al.  Selective DNA pooling for determination of linkage between a molecular marker and a quantitative trait locus. , 1994, Genetics.

[36]  G. Schwarz Estimating the Dimension of a Model , 1978 .