Melody discrimination and protein fold classification

One of the greatest challenges in theoretical biophysics and bioinformatics is the identification of protein folds from sequence data. This can be regarded as a pattern recognition problem. In this paper we report the use of a melody generation software where the inputs are derived from calculations of evolutionary information, secondary structure, flexibility, hydropathy and solvent accessibility from multiple sequence alignment data. The melodies so generated are derived from the sequence, and by inference, of the fold, in ways that give each fold a sound representation that may facilitate analysis, recognition, or comparison with other sequences.

[1]  Miguel Ángel García-Ruíz,et al.  An overview of auditory display to assist comprehension of molecular information , 2006, Interact. Comput..

[2]  Joan-Emma Shea,et al.  Sequence periodicity and secondary structure propensity in model proteins , 2010, Protein science : a publication of the Protein Society.

[3]  A. Supper Sublime frequencies:  The construction of sublime listening experiences in the sonification of scientific data , 2014, Social studies of science.

[4]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[5]  S. Hovmöller,et al.  Conformations of amino acids in proteins. , 2002, Acta crystallographica. Section D, Biological crystallography.

[6]  R. Bywater,et al.  The preferred conformation of dipeptides in the context of biosynthesis , 2013, Naturwissenschaften.

[7]  Musical patterns for comparative epigenomics , 2015, Clinical Epigenetics.

[8]  R. Takahashi,et al.  Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns , 2007, Genome Biology.

[9]  Michael Levitt,et al.  Probing protein fold space with a simplified model. , 2008, Journal of molecular biology.

[10]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[11]  Jonathan N. Middleton,et al.  Web-Based Algorithmic Composition from Extramusical Resources , 2008, Leonardo.

[12]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[13]  Bruce N. Walker,et al.  Mappings and metaphors in auditory displays: An experimental assessment , 2005, TAP.

[14]  Paul Vickers,et al.  Sonification Design and Aesthetics , 2011 .

[16]  Mary Anne Clark,et al.  Life Music: The Sonification of Proteins , 1999, Leonardo.

[17]  S H White,et al.  Membrane partitioning: distinguishing bilayer effects from the hydrophobic effect. , 1993, Biochemistry.

[18]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[19]  David Thomas,et al.  A sequence and structural study of transmembrane helices , 2001, J. Comput. Aided Mol. Des..

[20]  Burkhard Rost,et al.  The PredictProtein server , 2003, Nucleic Acids Res..

[21]  R. Bywater Protein folding: a problem with multiple solutions , 2013, Journal of biomolecular structure & dynamics.

[22]  Davide Rocchesso,et al.  The Sonification Handbook , 2011 .

[23]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[24]  M. Staege A short treatise concerning a musical approach for the interpretation of gene expression data , 2015, Scientific Reports.