Evolutionary origin of a secondary structure: π-helices as cryptic but widespread insertional variations of α-helices that enhance protein functionality.

Formally annotated π-helices are rare in protein structures but have been correlated with functional sites. Here, we analyze protein structures to show that π-helices are the same as structures known as α-bulges, α-aneurisms, π-bulges, and looping outs, and are evolutionarily derived by the insertion of a single residue into an α-helix. This newly discovered evolutionary origin explains both why π-helices are cryptic, being rarely annotated despite occurring in 15% of known proteins, and why they tend to be associated with function. An analysis of π-helices in the diverse ferritin-like superfamily illustrates their tendency to be conserved in protein families and identifies a putative π-helix-containing primordial precursor, a "missing link" intermediary form of the ribonucleotide reductase family, vestigial π-helices, and a novel function for π-helices that we term a "peristaltic-like shift." This new understanding of π-helices paves the way for this generally overlooked motif to become a noteworthy feature that will aid in tracing the evolution of many protein families, guide investigations of protein and π-helix functionality, and contribute additional tools to the protein engineering toolkit.

[1]  J. Thornton,et al.  Helix geometry in proteins. , 1988, Journal of molecular biology.

[2]  P. Nordlund,et al.  The Radical Site in Chlamydial Ribonucleotide Reductase Defines a New R2 Subclass , 2004, Science.

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  H Luecke,et al.  Structure of bacteriorhodopsin at 1.55 A resolution. , 1999, Journal of molecular biology.

[5]  Catherine L. Worth,et al.  Structural and functional constraints in the evolution of protein families , 2009, Nature Reviews Molecular Cell Biology.

[6]  Liisa Holm,et al.  Searching protein structure databases with DaliLite v.3 , 2008, Bioinform..

[7]  George N Phillips,et al.  Structural consequences of effector protein complex formation in a diiron hydroxylase , 2008, Proceedings of the National Academy of Sciences.

[8]  C. Gomes,et al.  Could a Diiron‐Containing Four‐Helix‐Bundle Protein Have Been a Primitive Oxygen Reductase? , 2001, Chembiochem : a European journal of chemical biology.

[9]  J. G. Leahy,et al.  Evolution of the soluble diiron monooxygenases. , 2003, FEMS microbiology reviews.

[10]  M. Högbom,et al.  A Mycobacterium tuberculosis ligand-binding Mn/Fe protein reveals a new cofactor in a remodeled R2-protein scaffold , 2009, Proceedings of the National Academy of Sciences.

[11]  S. Lippard,et al.  Product bound structures of the soluble methane monooxygenase hydroxylase from Methylococcus capsulatus (Bath): protein motion in the alpha-subunit. , 2005, Journal of the American Chemical Society.

[12]  A. Dean,et al.  Mechanistic approaches to the study of evolution: the functional synthesis , 2007, Nature Reviews Genetics.

[13]  C. Walsh,et al.  Role of tyrosine residues in Hg(II) detoxification by mercuric reductase from Bacillus sp. strain RC607. , 1993, Biochemistry.

[14]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[15]  Valerie Daggett,et al.  The role of α‐, 310‐, and π‐helix in helix→coil transitions , 2003 .

[16]  D. Kurtz,et al.  Intrapeptide sequence homology in rubrerythrin from Desulfovibrio vulgaris: identification of potential ligands to the diiron site. , 1991, Biochemical and biophysical research communications.

[17]  Jennifer K. Schwartz,et al.  Geometric and electronic structure studies of the binuclear nonheme ferrous active site of toluene-4-monooxygenase: parallels with methane monooxygenase and insight into the role of the effector proteins in O2 activation. , 2008, Journal of the American Chemical Society.

[18]  I. C. O. B. Nomenclature IUPAC-IUB Commission on Biochemical Nomenclature. Abbreviations and symbols for the description of the conformation of polypeptide chains. Tentative rules (1969). , 1970, Biochemistry.

[19]  D. Oesterhelt,et al.  Structure of the light-driven chloride pump halorhodopsin at 1.8 A resolution. , 2000, Science.

[20]  Dirk W. Heinz,et al.  How amino-acid insertions are allowed in an α-helix of T4 lysozyme , 1993, Nature.

[21]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[22]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[23]  S. Al-Karadaghi,et al.  Occurrence, conformational features and amino acid propensities for the pi-helix. , 2002, Protein engineering.

[24]  P Andrew Karplus,et al.  On the occurrence of linear groups in proteins , 2009, Protein science : a publication of the Protein Society.

[25]  H. Luecke,et al.  Structural and functional characterization of pi bulges and other short intrahelical deformations. , 2004, Structure.

[26]  T. Creighton Proteins: Structures and Molecular Properties , 1986 .

[27]  M. Yeager,et al.  An archaeal antioxidant: characterization of a Dps-like protein from Sulfolobus solfataricus. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[28]  S. Andrews Iron storage in bacteria. , 1998, Advances in microbial physiology.

[29]  B. W. Low,et al.  THE π HELIX—A HYDROGEN BONDED CONFIGURATION OF THE POLYPEPTIDE CHAIN , 1952 .

[30]  Ruth Lloyd,et al.  Insights into trehalose synthesis provided by the structure of the retaining glucosyltransferase OtsA. , 2002, Chemistry & biology.

[31]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[32]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[33]  Frances H. Arnold,et al.  In the Light of Evolution III: Two Centuries of Darwin Sackler Colloquium: In the light of directed evolution: Pathways of adaptive protein evolution , 2009 .

[34]  Krzysztof Kuczera,et al.  Conformational Free Energy Surfaces of Ala10 and Aib10 Peptide Helices in Solution , 2001 .

[35]  P. Bottomley,et al.  Kinetic characterization of the soluble butane monooxygenase from Thauera butanivorans, formerly 'Pseudomonas butanovora'. , 2009, Microbiology.

[36]  E. Lattman,et al.  The alpha aneurism: a structural motif revealed in an insertion mutant of staphylococcal nuclease. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[37]  T. Weaver The π‐helix translates structure into function , 2008, Protein science : a publication of the Protein Society.

[38]  Alexander D. MacKerell,et al.  Force field influence on the observation of π-helical protein structures in molecular dynamics simulations , 2003 .

[39]  H. Nelson,et al.  Role of an alpha-helical bulge in the yeast heat shock transcription factor. , 2000, Journal of molecular biology.

[40]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[41]  K. Kuczera,et al.  Transitions from alpha to pi helix observed in molecular dynamics simulations of synthetic peptides. , 2000, Biochemistry.

[42]  P. Karplus,et al.  Crystal structure of a novel Plasmodium falciparum 1-Cys peroxiredoxin. , 2005, Journal of molecular biology.