An Algorithm for Translating Chemical Names to Molecular Formulas.

An algorithm for translating directly frnm chemical names to molecular formulas is described, ‘fire validity nf the algcmithm was tested both manually and hy computer. Molecular formulas of several hundred randomly selected chemicals were calculated successfully, verifying the linguistic analyses and the logic of the canputer program. The algorithm fnr manual human translation consists of eight simple operations, The procedure enables non-chemists to compute mnleculsr formulas quickly without drawing structural diagrams. The m~hine translation routine is rspid and requires a program of less than 100fl instructims. ff the experimental dictionary were expanded to include low frequency morphemes, fonrrulaa for all chemical names could be handled. The problem of chemical nom rnclature is discussed in terms of the information requirements of chemists. The approach of the Iin guist to the problem of nomenclature is cnntraated with that of the chutriat, It is shown that there is only one language of chemical nomenclature though there exist many systems of nomenclature, The difficulties in syntactically analyzing Chemical Abstracts (C. A.) “omaclatum results from C. A. ‘s ambiguous use of morfiemes such as imino, not the use of so-cal led trivial nomenclature, The more systematic 1.U. P. A.C, nomenclature includes idiomatic exlmes sions but eliminates all homonymous expressions. The structural linguist tries to dessribe a language cumpactly. While this study does not not inclu& a complete grammatical description of chemical nomenclature, all of the basic facets of such a grammar have been studied. These Iingriistic studies include a morpbnlngical analysis