Bayesian mixture models for complex high dimensional count data in phage display experiments

Phage display is a biological process that is used to screen random peptide libraries for ligands that bind to a target of interest with high affinity. On the basis of a count data set from an innovative multistage phage display experiment, we propose a class of Bayesian mixture models to cluster peptide counts into three groups that exhibit different display patterns across stages. Among the three groups, the investigators are particularly interested in that with an ascending display pattern in the counts, which implies that the peptides are likely to bind to the target with strong affinity. We apply a Bayesian false discovery rate approach to identify the peptides with the strongest affinity within the group. A list of peptides is obtained, among which important ones with meaningful functions are further validated by biologists. To examine the performance of the Bayesian model, we conduct a simulation study and obtain desirable results. Copyright 2007 Royal Statistical Society.

[1]  L. Wasserman,et al.  Asymptotic inference for mixture models by using data‐dependent priors , 2000 .

[2]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Kim-Anh Do,et al.  Steps toward mapping the human vasculature by phage display , 2002, Nature Medicine.

[4]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[5]  D. Spiegelhalter,et al.  Bayes Factors for Linear and Log‐Linear Models with Vague Prior Information , 1982 .

[6]  Erkki Ruoslahti,et al.  Organ targeting In vivo using phage display peptide libraries , 1996, Nature.

[7]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[8]  A. Raftery A Note on Bayes Factors for Log‐Linear Contingency Table Models with Vague Prior Information , 1986 .

[9]  J. Albert Bayesian Testing and Estimation of Association in a Two-Way Contingency Table , 1997 .

[10]  T. Clackson,et al.  Phage display : a practical approach , 2004 .

[11]  P. Quesenberry,et al.  A Specific Heptapeptide from a Phage Display Peptide Library Homes to Bone Marrow and Binds to Primitive Hematopoietic Stem Cells , 2004, Stem cells.

[12]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..

[13]  John S. J. Hsu,et al.  Bayesian Methods: An Analysis for Statisticians and Interdisciplinary Researchers , 1999 .

[14]  Wadih Arap,et al.  Synchronous selection of homing peptides for multiple tissues by in vivo phage display , 2006, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[15]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[16]  A. Agresti Categorical data analysis , 1993 .

[17]  P. Müller,et al.  Optimal Sample Size for Multiple Testing , 2004 .

[18]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[19]  Deepayan Sarkar,et al.  Detecting differential gene expression with a semiparametric hierarchical mixture method. , 2004, Biostatistics.