THE generation of proper word boundaries is an important part of several problems in information processing. Specifically, the speech recognition problem is often described as the production of a phonemic transcript, followed by the assembly of phonemes into complete words.1,2,3,4 The automatic translation of certain natural or artificial languages, such as, for example, Chinese and Japanese to English,5,6,7 or English to Braille8 also requires the generation of words in the output language which may correspond either to several items of input, or to only part of an input item. The segmentation problem is often complicated by the fact that each item of input may be associated with several possible output correspondents, only one of which is acceptable in any given context. Frequently, the reduction of each set of multiple correspondents is at least partly dependent upon the proper recognition of word boundaries. The English phoneme sequence/aban/ might, for example, correspond to the indefinite article "a" followed by the noun "ban", or it might form a verb or noun prefix as in "abandon", or "abandonment". Similarly, the Chinese character (dzi), which may be translated as "self" when standing alone, may in combination with other characters be translated variously as "freedom", "self-defence", "ego", "originality", "naturally", "freely", "liberalism", and so on. The generation of syntactically well-formed sentences in the output language is a common requirement for the set of problems under consideration. Since the material being processed does not, however, consist of complete syntactic units, it is first necessary to generate the appropriate structural information before any method based on syntax can be used. Two principal techniques are therefore proposed for the recognition of
[1]
Murray E Sherry.
PREDICTIVE SYNTACTIC ANALYSIS
,
1961
.
[2]
G. E. Peterson,et al.
Segmentation Techniques in Speech Synthesis
,
1958
.
[3]
K. Stevens.
Toward a Model for Speech Recognition
,
1960
.
[4]
Gerard Salton,et al.
The automatic transcription of machine shorthand
,
1899,
IRE-AIEE-ACM '59 (Eastern).
[5]
George A. Miller,et al.
Human memory and the storage of information
,
1956,
IRE Trans. Inf. Theory.
[6]
A. Oettinger,et al.
CURRENT RESEARCH ON AUTOMATIC TRANSLATION AT HARVARD UNIVERSITY 1 AND PREDICTIVE SYNTACTIC ANALYSIS
,
2010
.
[7]
Carol C. Lochbaum.
Segmentation of Speech into Voiced Sounds, Unvoiced Sounds, and Silence
,
1960
.
[8]
Victor H. Yngve,et al.
A model and an hypothesis for language structure
,
1960
.
[9]
Ida Rhodes.
A new approach to the mechanical syntactic analysis of Russian
,
1961,
Mech. Transl. Comput. Linguistics.
[10]
E. K. Blum,et al.
Proceedings of the National Symposium on Machine Translation.
,
1965
.
[11]
S. Watanabe,et al.
English-Japanese machine translation
,
1959,
IFIP Congress.
[12]
A. F. Parker-Rhodes.
An electronic computer program for translating Chinese into English
,
1956,
Mech. Transl. Comput. Linguistics.