Stochastic Assessment of Protein Databases by Generalized Entropy Measures

The organization of a sample space for studying the probability density functions whose temporal variation is able to describe the evolution of protein domains as registered in biological almanacs (protein databases) is done through two concurrent processes. The “poissonization” of a binomial process, and a multinomial process leading to a Gibbs–Shannon Entropy. The present approach is aimed to span the bridge across the difficulties of constructing a new theory which will be able to describe the function and evolution of protein families and their association into clans with the usual methods of Statistical Physics.

[1]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[2]  E. Jaynes Probability theory : the logic of science , 2003 .

[3]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[4]  Robert D. Finn,et al.  Pfam: clans, web tools and services , 2005, Nucleic Acids Res..

[5]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..