Sequence Complexity and Composition

Local compositional complexity is a numerical measure of repetitiveness of sequences of symbols from a finite alphabet. Highly repetitive sequences are considered simple, whereas highly nonrepetitive sequences are considered complex. Keywords: alphabet; local compositional complexity; pattern; sequence analysis; sequence annotation

[1]  R. Britten,et al.  Repeated Sequences in DNA , 1968 .

[2]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part II , 1964, Inf. Control..

[3]  Gregory J. Chaitin,et al.  On the Length of Programs for Computing Finite Binary Sequences , 1966, JACM.

[4]  Lars Kai Hansen,et al.  On the Robustness of Maximum Entropy Relationships for Complexity Distributions of Nucleotide Sequences , 1993, Comput. Chem..

[5]  Donald C. Mikulecky,et al.  The Emergence of Complexity: Science Coming of Age Or Science Growing Old? , 2001, Comput. Chem..

[6]  Peter Salamon,et al.  A Maximum Entropy Principle for the Distribution of Local Complexity in Naturally Occurring Nucleotide Sequences , 1992, Comput. Chem..

[7]  John Horgan,et al.  From Complexity to Perplexity , 1995 .

[8]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[9]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[10]  Andrzej K. Konopka,et al.  Sequences and Codes: Fundamentals of Biomolecular Cryptology , 1994 .

[11]  A K Konopka,et al.  Complexity charts can be used to map functional domains in DNA. , 1990, Genetic analysis, techniques and applications.

[12]  David C. Torney,et al.  Repetitive DNA Sequences: Some Considerations for Simple Sequence Repeats , 1993, Comput. Chem..

[13]  D. Tautz,et al.  Cryptic simplicity in DNA is a major source of genetic variation , 1986, Nature.

[14]  John C. Wootton,et al.  Statistics of Local Complexity in Amino Acid Sequences and Sequence Databases , 1993, Comput. Chem..

[15]  Andrzej K. Konopka PLAUSIBLE CLASSIFICATION CODES AND LOCAL COMPOSITIONAL COMPLEXITY OF NUCLEOTIDE SEQUENCES , 1993 .