Fertility-GRU: Identifying fertility-related proteins by incorporating deep gated recurrent units and original PSSM profiles.
暂无分享,去创建一个
Protein function prediction is one of the well-known problems in proteome research, attracting the attention of numerous researchers. However, the implementation of deep neural networks which helps to increase the protein function prediction still poses a big challenge. This study proposes a deep learning approach namely Fertility-GRU that incorporates gated recurrent units and position-specific scoring matrix profiles to predict the function of fertility-related protein, which is a highly crucial biological function. Fertility-related proteins had been also proven to be important in many biological entities (i.e., bone marrow and peripheral blood, postnatal mammalian ovary), and parameters (i.e., daily sperm production). As a result, our model can achieve a cross-validation accuracy of 85.8% and independent accuracy of 91.1%. We also solve the problem of overfitting in the dataset by adding dropout layers in the deep learning model. The independent testing results showed sensitivity, specificity, and MCC values of 90.5%, 91.7%, and 0.82, respectively. Fertility-GRU demonstrates superiority in performance against the state-of-the-art predictor on the same dataset. In our proposed study, we provided a method that enables more proteins to be discovered, especially proteins associated with fertility. Moreover, our achievement could promote the use of recurrent networks and gated recurrent units in proteome research. Source code and dataset can be freely accessible via https://github.com/khanhlee/fertility-gru.