论文信息 - Introducing statistical dependencies and structural constraints in variable-length sequence models

Introducing statistical dependencies and structural constraints in variable-length sequence models

In the field of natural language processing, as in many other domains, the efficiency of pattern recognition algorithms is highly conditioned to a proper description of the underlying structure of the data. However, this hidden structure is usually not known, and it has to be learned from examples. The multigram model [1, 2] was originally designed to extract variable-length regularities within streams of symbols, by describing the data as the concatenation of statistically independent sequences. Such a description seems especially appealing in the case of natural language corpora, since natural language syntactic regularities are clearly of variable length: sentences are composed of a variable number of syntagms, which in turn are made of a variable number of words, which contain a variable number of morphemes, etc... However, some previous experiments with this model [3] revealed the inadequacy of the independence assumption in the particular context of a graphemeto-phoneme transcription task. In this paper, our goal is therefore twofold:

Frédéric Bimbot | François Yvon | Sabine Deligne

[1] Kari Torkkola. An efficient way to learn English grapheme-to-phoneme rules automatically , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] Frédéric Bimbot,et al. Language modeling by variable length sequences: theoretical formulation and evaluation of multigrams , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4] Frédéric Bimbot,et al. Variable-length sequence matching for phonetic transcription using joint multigrams , 1995, EUROSPEECH.

[5] R. Pieraccini,et al. Variable-length sequence modeling: multigrams , 1995, IEEE Signal Processing Letters.

[6] François Yvon. Prononcer par analogie : motivation, formalisation et evaluation , 1996 .

[7] François Yvon. Grapheme-to-Phoneme Conversion using Multiple Unbounded Overlapping Chunks , 1996, ArXiv.