Bio-inspired Computing: Theories and Applications: 14th International Conference, BIC-TA 2019, Zhengzhou, China, November 22–25, 2019, Revised Selected Papers, Part II

Determining the functional roles of proteins is a vital task to understand life at molecular level and has great biomedical and pharmaceutical implications. With the development of novel high-throughput techniques, enormous amounts of protein-protein interaction (PPI) data are collected and provide an important and feasible way for studying protein function predictions. According to this, many approaches assign biological functions to all proteins using PPI networks directly. However, due to the extreme complexity of the topology structure of real PPI networks, it is very difficult and time consuming to seek the global optimization or clustering on the networks. In addition, biological functions are often highly correlated, which makes functions assigned to proteins are not independent. To address these challenges, in this paper we propose a two-stage function annotation method with robust feature selection. First, we transform the network into the low-dimensional representations of nodes via manifold learning. Then, we integrate the functional correlation into the framework of multi-label linear regression, and introduce robust sparse penalty to achieve the function assignment and representative feature selection simultaneously. For the optimization, we design an efficient algorithm to iteratively solve several subproblems with closedform solutions. Extensive experiments against other baseline methods on Saccharomyces cerevisiae data demonstrate the effectiveness of the proposed approach.

[1]  Richard J. Lipton,et al.  Breaking DES using a molecular computer , 1995, DNA Based Computers.

[2]  R. Sharan,et al.  Network-based prediction of protein function , 2007, Molecular systems biology.

[3]  T. Mahalakshmi,et al.  Secure Data Transfer through DNA Cryptography using Symmetric Algorithm , 2016 .

[4]  E. Winfree,et al.  Construction of an in vitro bistable circuit from synthetic transcriptional switches , 2006, Molecular systems biology.

[5]  S. Mirkin,et al.  DNA H form requires a homopurine–homopyrimidine mirror repeat , 1987, Nature.

[6]  Teruo Fujii,et al.  Bottom-up construction of in vitro switchable memories , 2012, Proceedings of the National Academy of Sciences.

[7]  Donald Beaver,et al.  Factoring: The DNA Solution , 1994, ASIACRYPT.

[8]  Chithralekha Balamurugan,et al.  A Novel DNA Computing Based Encryption and Decryption Algorithm , 2015 .

[9]  Zhimin Chen,et al.  Aptamer-based regulation of transcription circuits. , 2019, Chemical communications.

[10]  Teruo Fujii,et al.  Predator-prey molecular ecosystems. , 2013, ACS nano.

[11]  Clifford R. Johnson,et al.  Solution of a 20-Variable 3-SAT Problem on a DNA Computer , 2002, Science.

[12]  X. Le,et al.  Rolling circle amplification: a versatile tool for chemical biology, materials science and medicine. , 2014, Chemical Society reviews.

[13]  Min-Ling Zhang,et al.  Lift: Multi-Label Learning with Label-Specific Features , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Erik Winfree,et al.  On applying molecular computation to the data encryption standard , 1999, DNA Based Computers.

[15]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[16]  S. Kasif,et al.  Whole-genome annotation by using evidence integration in functional-linkage networks. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17]  L M Adleman,et al.  Molecular computation of solutions to combinatorial problems. , 1994, Science.

[18]  Guangyuan Fu,et al.  NewGOA: Predicting New GO Annotations of Proteins by Bi-Random Walks on a Hybrid Graph , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  P. Varalakshmi,et al.  Enhanced DNA and ElGamal cryptosystem for secure data storage and retrieval in cloud , 2018, Cluster Computing.

[20]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[21]  Mona Singh,et al.  Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps , 2005, ISMB.

[22]  Cheng Song,et al.  One-time-pad cryptography scheme based on a three-dimensional DNA self-assembly pyramid structure , 2018, PloS one.

[23]  Kenli Li,et al.  Fast Parallel Molecular Algorithms for DNA-Based Computation: Solving the Elliptic Curve Discrete Logarithm Problem over GF(2n) , 2007, 2007 Frontiers in the Convergence of Bioscience and Information Technologies.

[24]  M. Guéron,et al.  A tetrameric DNA structure with protonated cytosine-cytosine base pairs , 1993, Nature.

[25]  C. Zhang,et al.  Entropy-driven DNA logic circuits regulated by DNAzyme , 2018, Nucleic acids research.

[26]  Wang Zichen One-time-pad cryptography algorithm based on DNA cryptography , 2014 .

[27]  Stephen Neidle,et al.  Structure of a G-quadruplex-ligand complex. , 2003, Journal of molecular biology.

[28]  John H. Reif,et al.  DNA-based Cryptography , 1999, Aspects of Molecular Computing.

[29]  Lu Ming-xin DNA Computation and DNA Cryptography , 2006 .

[30]  Richard Bonneau,et al.  deepNF: deep network fusion for protein function prediction , 2017, bioRxiv.

[31]  Jing Yang,et al.  A molecular cryptography model based on structures of DNA self-assembly , 2014 .

[32]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[33]  Jie Liu,et al.  Protein Function Prediction by Random Walks on a Hybrid Graph , 2016 .

[34]  Zheng Cheng,et al.  Nondeterministic Algorithm for Breaking Diffie-Hellman Key Exchange using Self-Assembly of DNA Tiles , 2014, Int. J. Comput. Commun. Control.

[35]  Jin Xu,et al.  One-Time-Pads encryption in the tile assembly model , 2008, 2008 3rd International Conference on Bio-Inspired Computing: Theories and Applications.

[36]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[37]  A. Turberfield,et al.  A DNA-fuelled molecular machine made of DNA , 2022 .

[38]  Yuriy Brun Arithmetic computation in the tile assembly model: Addition and multiplication , 2007, Theor. Comput. Sci..

[39]  David A Rusling,et al.  Triplex-forming oligonucleotides: a third strand for DNA nanotechnology , 2017, Nucleic acids research.

[40]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[41]  Guozhen Xiao,et al.  Symmetric-key cryptosystem with DNA technology , 2007, Science in China Series F: Information Sciences.

[42]  Min-Ling Zhang,et al.  Ml-rbf: RBF Neural Networks for Multi-Label Learning , 2009, Neural Processing Letters.

[43]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[44]  Jing Yang,et al.  Aptamer-Binding Directed DNA Origami Pattern for Logic Gates. , 2016, ACS applied materials & interfaces.

[45]  Y. Sakai,et al.  Programming an in vitro DNA oscillator using a molecular networking strategy , 2011, Molecular systems biology.

[46]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[47]  T. Takagi,et al.  Assessment of prediction accuracy of protein function from protein–protein interaction data , 2001, Yeast.