论文信息 - Protein model quality assessment using 3D oriented convolutional neural networks

Protein model quality assessment using 3D oriented convolutional neural networks

Protein model quality assessment (QA) is a crucial and yet open problem in structural bioinformatics. The current best methods for single-model QA typically combine results from different approaches, each based on different input features constructed by experts in the field. Then, the prediction model is trained using a machine-learning algorithm. Recently, with the development of convolutional neural networks (CNN), the training paradigm has changed. In computer vision, the expert-developed features have been significantly overpassed by automatically trained convolutional filters. This motivated us to apply a three-dimensional (3D) CNN to the problem of protein model QA. We developed a novel method for single-model QA called Ornate. Ornate (Oriented Routed Neural network with Automatic Typing) is a residue-wise scoring function that takes as input 3D density maps. It predicts the local (residue-wise) and the global model quality through a deep 3D CNN. Specifically, Ornate aligns the input density map, corresponding to each residue and its neighborhood, with the backbone topology of this residue. This circumvents the problem of ambiguous orientations of the initial models. Also, Ornate includes automatic identification of atom types and dynamic routing of the data in the network. Established benchmarks (CASP 11 and CASP 12) demonstrate the state-of-the-art performance of our approach among singlemodel QA methods. The method is available at https://team.inria.fr/nanod/software/Ornate/. It consists of a C++ executable that transforms molecular structures into volumetric density maps, and a Python code based on the TensorFlow framework for applying the Ornate model to these maps.

Guillaume Pagès | Sergei Grudinin | Benoit Charmettant

[1] Jens Meiler,et al. ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[2] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[3] Yang Zhang,et al. Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[4] Kliment Olechnovič,et al. VoroMQA: Assessment of protein structure quality using interatomic contact areas , 2017, Proteins.

[5] Peter Kontschieder,et al. Decision Forests, Convolutional Networks and the Models in-Between , 2016, ArXiv.

[6] David Ryan Koes,et al. Protein-Ligand Scoring with Convolutional Neural Networks , 2016, Journal of chemical information and modeling.

[7] Yoshua Bengio,et al. Deep convolutional networks for quality assessment of protein folds , 2018, Bioinform..

[8] A. Tramontano,et al. Critical assessment of methods of protein structure prediction (CASP)—Round XII , 2018, Proteins.

[9] Björn Wallner,et al. Improved model quality assessment using ProQ2 , 2012, BMC Bioinformatics.

[10] Peter B. McGarvey,et al. UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[11] Jie Hou,et al. DeepQA: improving the estimation of single protein model quality with deep belief networks , 2016, BMC Bioinformatics.