The prediction of residue contacts in a protein solely from sequence information is a promising approach to computational structure prediction. Recent developments use statistical or information theoretic methods to extract contact information from a multiple sequence alignment. Despite good results, accuracy is limited due to usage of two-body interactions within a Potts model. In this paper we generalize this approach and propose a Hamiltonian with an additional three-body interaction term. We derive a mean-field approximation for inference of three-body couplings within a Potts model which is fast enough on modern computers. Finally, we show that our model has a higher accuracy in predicting residue contacts in comparison with the plain two-body-interaction model.
[1]
G. Giacomello,et al.
Proteins structure.
,
1957,
Scientia medica italica. English ed.
[2]
Richard P. Lippmann,et al.
Proceedings of the 1997 conference on Advances in neural information processing systems 10
,
1990
.
[3]
W. Pearson,et al.
Current Protocols in Bioinformatics
,
2002
.
[4]
Michael Schmidt,et al.
hoDCA: higher order direct-coupling analysis
,
2018,
BMC Bioinformatics.
[5]
Debora S Marks,et al.
Deep generative models of genetic variation capture the effects of mutations
,
2018,
Nature Methods.