A New Algorithm for the Alignment of Phonetic Sequences

Alignment of phonetic sequences is a necessary step in many applications in computational phonology. After discussing various approaches to phonetic alignment, I present a new algorithm that combines a number of techniques developed for sequence comparison with a scoring scheme for computing phonetic similarity on the basis of multivalued features. The algorithm performs better on cognate alignment, in terms of accuracy and efficiency, than other algorithms reported in the literature.

[1]  B. Fox Calculating Kth Shortest Paths , 1973 .

[2]  P. Ladefoged A course in phonetics , 1975 .

[3]  Harold L. Somers Similarity Metrics for Aligning Children's Articulation Data , 1998, COLING-ACL.

[4]  Robert A. Wagner,et al.  An Extension of the String-to-String Correction Problem , 1975, JACM.

[5]  Steven Lee Hartman A universal alphabet for experiments in comparative phonology , 1981, Comput. Humanit..

[6]  J. Connolly,et al.  Quantifying target—realization differences. Part I: Segments , 1997 .

[7]  Michael A. Covington,et al.  An Algorithm to Align Words for Historical Comparison , 1996, Comput. Linguistics.

[8]  Daniel Gildea,et al.  Learning Bias and Phonological-Rule Induction , 1996, CL.

[9]  O. Gotoh An improved algorithm for matching biological sequences. , 1982, Journal of molecular biology.

[10]  Harold L. Somers Aligning Phonetic Segments for Children's Articulation Assessment , 1999, Comput. Linguistics.

[11]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[12]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[13]  Grzegorz Kondrak Alignment of Phonetic Sequences , 1999 .

[14]  Shane S. Sturrock,et al.  Time Warps, String Edits, and Macromolecules – The Theory and Practice of Sequence Comparison . David Sankoff and Joseph Kruskal. ISBN 1-57586-217-4. Price £13.95 (US$22·95). , 2000 .

[15]  Michael A. Covington Alignment of Multiple Languages for Historical Comparison , 1998, COLING-ACL.

[16]  B. John Oommen NORTH-HOLLAND String Alignment With Substitution , Insertion , Deletion , 2022 .

[17]  John Nerbonne,et al.  Measuring Dialect Distance Phonetically , 1997, SIGMORPHON@EACL.

[18]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.