In Search of a Bridge Between Network Analysis in Computational Linguistics and Computational Biology - A Conceptual Note

Recently, the inference of biological networks has been studied whose vertices represent proteins and recurrent sequential patterns – called domain types – thereof; cf., for example, [1]. What makes this an outstanding research object from the point of view of data mining is the explorative analysis of large networks whose emergence is simulated in order to get insights into the dynamics of the focal area. This research program is connected to analyzing informational and, especially, textual networks as explored in the area of text mining. Consequently, the question is put forward which graph representation model is common to both areas of investigation. This paper addresses this question from the perspective of network motifs [2]. It reconstructs this notion from the point of view of syntagmatic and paradigmatic learning in computational linguistics. As an epiphenomenon, the paper sheds light on the applicability of text mining procedures in the area of bioinformatics.

[1]  P. Garvin,et al.  Prolegomena to a Theory of Language , 1953 .

[2]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[3]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[4]  Rens Bod,et al.  Beyond Grammar: An Experience-Based Theory of Language , 1998 .

[5]  Andrey Rzhetsky,et al.  Emergent behavior of growing knowledge about molecular interactions , 2005, Nature Biotechnology.

[6]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[7]  L.W.M. Bod An Experienced Based Theory of Language , 1998 .

[8]  Ferdinand de Saussure Grundfragen der allgemeinen Sprachwissenschaft , 1931 .

[9]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[10]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[11]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Michael Krauthammer,et al.  Probabilistic inference of molecular networks from noisy data sources , 2004, Bioinform..

[13]  Edda Leopold,et al.  On Semantic Spaces , 2005, LDV Forum.

[14]  Joshua B. Tenenbaum,et al.  The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth , 2001, Cogn. Sci..

[15]  Eytan Ruppin,et al.  Automatic Acquisition and Efficient Representation of Syntactic Structures , 2002, NIPS.

[16]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[17]  Matthias Dehmer,et al.  A Systems Biology approach for the classification of DNA Microarray Data , 2005 .

[18]  R. Milo,et al.  Subgraphs in random networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Alexander Mehler,et al.  Text Linkage in the Wiki Medium - A Comparative Study , 2006 .

[20]  Hinrich Schütze,et al.  Ambiguity resolution in language learning - computational and cognitive models , 1997, CSLI lecture notes series.

[21]  Burghard B. Rieger,et al.  Situation Semantics and Computational Linguistics: towards Informational Ecology , 1996 .

[22]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[23]  Michael Krauthammer,et al.  GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data , 2004, J. Biomed. Informatics.

[24]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[25]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..