Proteins interacting with cloning scars: a source of false positive protein-protein interactions

A common approach for exploring the interactome, the network of protein-protein interactions in cells, uses a commercially available ORF library to express affinity tagged bait proteins; these can be expressed in cells and endogenous cellular proteins that copurify with the bait can be identified as putative interacting proteins using mass spectrometry. Control experiments can be used to limit false-positive results, but in many cases, there are still a surprising number of prey proteins that appear to copurify specifically with the bait. Here, we have identified one source of false-positive interactions in such studies. We have found that a combination of: 1) the variable sequence of the C-terminus of the bait with 2) a C-terminal valine “cloning scar” present in a commercially available ORF library, can in some cases create a peptide motif that results in the aberrant co-purification of endogenous cellular proteins. Control experiments may not identify false positives resulting from such artificial motifs, as aberrant binding depends on sequences that vary from one bait to another. It is possible that such cryptic protein binding might occur in other systems using affinity tagged proteins; this study highlights the importance of conducting careful follow-up studies where novel protein-protein interactions are suspected.

[1]  D. Chalbos,et al.  PTPN13/PTPL1: an important regulator of tumor aggressiveness. , 2011, Anti-cancer agents in medicinal chemistry.

[2]  John H. Lewis,et al.  Crystal Structures of a Complexed and Peptide-Free Membrane Protein–Binding Domain: Molecular Basis of Peptide Recognition by PDZ , 1996, Cell.

[3]  G. Hattem,et al.  Controlling for Gene Expression Changes in Transcription Factor Protein Networks* , 2014, Molecular & Cellular Proteomics.

[4]  J. Yates,et al.  DTASelect and Contrast: tools for assembling and comparing protein identifications from shotgun proteomics. , 2002, Journal of proteome research.

[5]  S. Howell,et al.  ABIN-2 Forms a Ternary Complex with TPL-2 and NF-κB1 p105 and Is Essential for TPL-2 Protein Stability , 2004, Molecular and Cellular Biology.

[6]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[7]  M. Moran,et al.  Large-scale mapping of human protein–protein interactions by mass spectrometry , 2007, Molecular systems biology.

[8]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[9]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[10]  R. Beyaert,et al.  Identification of a Novel A20-binding Inhibitor of Nuclear Factor-κB Activation Termed ABIN-2* , 2001, The Journal of Biological Chemistry.

[11]  R. Aebersold,et al.  Mass spectrometry-based proteomics and network biology. , 2012, Annual review of biochemistry.

[12]  Chris Sander,et al.  A Specificity Map for the PDZ Domain Family , 2008, PLoS biology.

[13]  Embryonic neural inducing factor churchill is not a DNA-binding zinc finger protein: solution structure reveals a solvent-exposed beta-sheet and zinc binuclear cluster. , 2007, Journal of molecular biology.

[14]  C. W. Liew,et al.  Protein–protein interactions: Analysis of a false positive GST pulldown result , 2011, Proteins.

[15]  J. Yanagisawa,et al.  The Molecular Interaction of Fas and FAP-1 , 1997, The Journal of Biological Chemistry.

[16]  Merlin Crossley,et al.  Protein interactions: is seeing believing? , 2007, Trends in biochemical sciences.

[17]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[18]  F. Prinz,et al.  Believe it or not: how much can we rely on published data on potential drug targets? , 2011, Nature Reviews Drug Discovery.

[19]  L. Cantley,et al.  Recognition of Unique Carboxyl-Terminal Motifs by Distinct PDZ Domains , 1997, Science.

[20]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[21]  W. Neupert A mitochondrial odyssey. , 2012, Annual review of biochemistry.

[22]  S. Irie,et al.  Identification of IkappaBalpha as a substrate of Fas-associated phosphatase-1. , 2000, European journal of biochemistry.

[23]  Amber L. Couzens,et al.  The CRAPome: a Contaminant Repository for Affinity Purification Mass Spectrometry Data , 2013, Nature Methods.

[24]  M. Washburn,et al.  Refinements to label free proteome quantitation: how to deal with peptides shared by multiple proteins. , 2010, Analytical chemistry.

[25]  Markus Seiler,et al.  The transience of transient overexpression , 2013, Nature Methods.

[26]  Michael P Washburn,et al.  Proteomic analysis by multidimensional protein identification technology. , 2006, Methods in molecular biology.

[27]  Paul G. Blommel,et al.  Flexi vector cloning. , 2009, Methods in molecular biology.

[28]  Daniel MacArthur,et al.  Methods: Face up to false positives , 2012, Nature.

[29]  Beau Dabbs,et al.  Summary and discussion of : “ Controlling the False Discovery Rate : A Practical and Powerful Approach to Multiple Testing , 2014 .

[30]  S. Gygi,et al.  Defining the Human Deubiquitinating Enzyme Interaction Landscape , 2009, Cell.

[31]  James M. Anderson,et al.  Protein–protein interactions: PDZ domain networks , 1996, Current Biology.

[32]  K. Maekawa,et al.  Molecular cloning of a novel protein‐tyrosine phosphatase containing a membrane‐binding domain and GLGF repeats , 1994, FEBS letters.

[33]  J. Goodrich,et al.  Protein-protein interaction assays: eliminating false positive interactions , 2006, Nature Methods.

[34]  T. Nagase,et al.  Exploration of Human ORFeome: High-Throughput Preparation of ORF Clones and Efficient Characterization of Their Protein Products , 2008, DNA research : an international journal for rapid publication of reports on genes and genomes.

[35]  M. Eck,et al.  The FERM domain: organizing the structure and function of FAK , 2010, Nature Reviews Molecular Cell Biology.

[36]  J. LaBaer,et al.  High‐throughput cloning and expression library creation for functional proteomics , 2013, Proteomics.

[37]  Eric W. Deutsch,et al.  The PeptideAtlas project , 2005, Nucleic Acids Res..

[38]  Norman Pavelka,et al.  Statistical Similarities between Transcriptomics and Quantitative Shotgun Proteomics Data *S , 2008, Molecular & Cellular Proteomics.

[39]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[40]  S. Fields High‐throughput two‐hybrid analysis , 2005, The FEBS journal.

[41]  H. Dyson,et al.  The CBP/p300 TAZ1 domain in its native state is not a binding partner of MDM2. , 2004, The Biochemical journal.

[42]  L. Bonetta Protein–protein interactions: Interactome under construction , 2010, Nature.

[43]  Anne-Claude Gavin,et al.  Recent advances in charting protein-protein interaction: mass spectrometry-based approaches. , 2011, Current opinion in biotechnology.

[44]  P. Bork,et al.  The KIND module: a putative signalling domain evolved from the C lobe of the protein kinase fold. , 2003, Trends in biochemical sciences.

[45]  Jonathan F. Russell,et al.  If a job is worth doing, it is worth doing twice , 2013, Nature.

[46]  Taka-Aki Sato,et al.  Identification of IκBα as a substrate of Fas‐associated phosphatase‐1 , 2000 .

[47]  Gary D Bader,et al.  A draft map of the human proteome , 2014, Nature.

[48]  Seth G. N. Grant,et al.  PDZ Domain Proteins: Plug and Play! , 2003, Science's STKE.

[49]  H. Yamakawa High-throughput construction of ORF clones for production of the recombinant proteins. , 2009, Methods in molecular biology.