Hidden Markov Chains and the Analysis of Genome Structure

Abstract In this paper, statistical methods based on a hidden Markov chain model are used to study the structure of some small complete genomes and a human genome segment. A variety of discrete compositional domains are discovered and their correlations with genome function are explored.

[1]  A. Bird CpG-rich islands and the function of DNA methylation , 1986, Nature.

[2]  R. Staden,et al.  Nucleotide sequence of bacteriophage G4 DNA , 1978, Nature.

[3]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[4]  S. Aota,et al.  Giant G+C% mosaic structures of the human genome found by arrangement of GenBank human DNA sequences according to genetic positions. , 1990, Genomics.

[5]  R. Katz On Some Criteria for Estimating the Order of a Markov Chain , 1981 .

[6]  D. A. Clayton,et al.  Sequence and gene organization of mouse mitochondrial DNA , 1981, Cell.

[7]  David R. Wolf,et al.  Base compositional structure of genomes. , 1992, Genomics.

[8]  G. Churchill Stochastic models for heterogeneous DNA sequences. , 1989, Bulletin of mathematical biology.

[9]  B. Roe,et al.  The complete nucleotide sequence of the Xenopus laevis mitochondrial genome. , 1985, The Journal of biological chemistry.

[10]  M. Waterman,et al.  Statistical characterization of nucleic acid sequence functional domains. , 1983, Nucleic acids research.

[11]  F. Sanger,et al.  Sequence and organization of the human mitochondrial genome , 1981, Nature.

[12]  S. Weissman,et al.  The genome of simian virus 40. , 1978, Science.

[13]  P. Pevzner,et al.  Linguistics of nucleotide sequences. II: Stationary words in genetic texts and the zonal structure of DNA. , 1989, Journal of biomolecular structure & dynamics.

[14]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[15]  F. Sanger,et al.  Complete sequence of bovine mitochondrial DNA. Conserved features of the mammalian mitochondrial genome. , 1982, Journal of molecular biology.

[16]  A D Hershey,et al.  Segmental distribution of nucleotides in the DNA of bacteriophage lambda. , 1968, Journal of molecular biology.

[17]  D Benton,et al.  GenBank: current status and future directions , 1990 .

[18]  R. Elton,et al.  Theoretical models for heterogeneity of base composition in DNA. , 1974, Journal of theoretical biology.

[19]  R. Reeves,et al.  Xrep, a plasmid-stimulating X chromosomal sequence bearing similarities to the BK virus replication origin and viral enhancers. , 1986, Nucleic acids research.