Indicators of Allophony and Phonemehood

Although we are only able to distinguish between a finite, small number of sound categories--i.e. a given language's phonemes--no two sounds are actually identical in the messages we receive. Given the pervasiveness of sound-altering processes across languages--and the fact that every language relies on its own set of phonemes--the question of the acquisition of allophonic rules by infants has received a considerable amount of attention in recent decades. How, for example, do English-learning infants discover that the word forms [kaet] and [kat] refer to the same animal species (i.e. cat), whereas [kaet] and [baet] (i.e. cat ~ bat) do not? What kind of cues may they rely on to learn that [siŋkiŋ] and [θiŋkiŋ] (i.e. sinking ~ thinking) can not refer to the same action? The work presented in this dissertation builds upon the line of computational studies initiated by Peperkamp et al. (2006), wherein research efforts have been concentrated on the definition of sound-to-sound dissimilarity measures indicating which sounds are realizations of the same phoneme. We show that solving Peperkamp et al.'s task does not yield a full answer to the problem of the discovery of phonemes, as formal and empirical limitations arise from its pairwise formulation. We proceed to circumvent these limitations, reducing the task of the acquisition of phonemes to a partitioning-clustering problem and using multidimensional scaling to allow for the use of individual phones as the elementary objects. The results of various classification and clustering experiments consistently indicate that effective indicators of allophony are not necessarily effective indicators of phonemehood. Altogether, the computational results we discuss suggest that allophony and phonemehood can only be discovered from acoustic, temporal, distributional, or lexical indicators when--on average--phonemes do not have many allophones in a quantized representation of the input.

[1]  Z. Harris From Phoneme to Morpheme , 1955 .

[2]  Erik D. Thiessen,et al.  When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. , 2003, Developmental psychology.

[3]  George Kingsley Zipf,et al.  The Psychobiology of Language , 2022 .

[4]  Michael R. Brent,et al.  An Efficient, Probabilistically Sound Algorithm for Segmentation and Word Discovery , 1999, Machine Learning.

[5]  W. T. Penzhorn,et al.  Experimental comparison on several distance measures for speech processing applications , 1988, COMSIG 88@m_Southern African Conference on Communications and Signal Processing. Proceedings.

[6]  Michael Cysouw,et al.  Comment on “Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa” , 2012, Science.

[7]  Myra Spiliopoulou,et al.  C-DBSCAN: Density-Based Clustering with Constraints , 2009, RSFDGrC.

[8]  Patrick Mair,et al.  Multidimensional Scaling Using Majorization: SMACOF in R , 2008 .

[9]  D. Steinley Validating Clusters with the Lower Bound for Sum-of-Squares Error , 2007 .

[10]  D. Sloane,et al.  An Introduction to Categorical Data Analysis , 1996 .

[11]  Alexander M. Bronstein,et al.  Fast Multidimensional Scaling using Vector Extrapolation , 2008 .

[12]  Howard J. Hamilton,et al.  DBRS: A Density-Based Spatial Clustering Method with Random Sampling , 2003, PAKDD.

[13]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[14]  T. Vance,et al.  Some Aspects of Rendaku in Japanese and Related Problems in Theoretical Issues in Japanese Linguistics. , 1980 .

[15]  Forrest W. Young,et al.  Nonmetric individual differences multidimensional scaling: An alternating least squares method with optimal scaling features , 1977 .

[16]  Asoo J. Vakharia,et al.  A comparative investigation of hierarchical clustering techniques and dissimilarity measures applied to the cell formation problem , 1995 .

[17]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[18]  Afra Alishahi,et al.  Computational Modeling of Human Language Acquisition , 2010, ACL.

[19]  Ewan Dunbar,et al.  A Single-Stage Approach to Learning Phonological Categories: Insights From Inuktitut , 2013, Cogn. Sci..

[20]  P. Kuhl Early language acquisition: cracking the speech code , 2004, Nature Reviews Neuroscience.

[21]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[22]  Zellig S. Harris,et al.  From Phoneme to Morpheme , 1955 .

[23]  Martin Kay,et al.  Regular Models of Phonological Rule Systems , 1994, CL.

[24]  Phipps Arabie,et al.  Combinatorial Data Analysis: Optimization by Dynamic Programming , 1987 .

[25]  Hans-Hermann Bock,et al.  Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data , 2000 .

[26]  John Goldsmith,et al.  Learning Phonological Categories , 2009 .

[27]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[28]  Colin Phillips,et al.  The influence of meaning on the perception of speech sounds. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Stephen Warshall,et al.  A Theorem on Boolean Matrices , 1962, JACM.

[30]  李幼升,et al.  Ph , 1989 .

[31]  J Bertoncini,et al.  Viewing The Development of Speech Perception As An Innately Guided Learning Process , 1988, Language and speech.

[32]  F. Dell,et al.  Les règles et les sons : introduction à la phonologie générative , 1985 .

[33]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[34]  Brian D. Ripley,et al.  Modern applied statistics with S, 4th Edition , 2002, Statistics and computing.

[35]  T. A. Cartwright,et al.  Distributional regularity and phonotactic constraints are useful for segmentation , 1996, Cognition.

[36]  Kenneth Ward Church,et al.  - 1-What ’ s Wrong with Adding One ? , 1994 .

[37]  R. Schiffer Psychobiology of Language , 1986 .

[38]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[39]  Paul Boersma,et al.  Learning abstract phonological from auditory phonetic catergories: an integrated model for the acquisition of language-specific sound categories , 2003 .

[40]  Julia Hirschberg,et al.  V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[41]  Franck Ramus,et al.  A psycholinguistic perspective on the acquisition of phonology , 2010 .

[42]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing , 2000 .

[43]  Luc Boruta,et al.  Combining Indicators of Allophony , 2011, ACL.

[44]  Daniel F. Pontillo,et al.  Comment on “Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa” , 2012, Science.

[45]  Guoyin Wang,et al.  Rough Sets, Fuzzy Sets, Data Mining and Granular Computing , 2011, Lecture Notes in Computer Science.

[46]  Frank Höppner,et al.  How Much True Structure Has Been Discovered? , 2009, MLDM.

[47]  C. Pardo,et al.  Combinación de métodos factoriales y de análisis de conglomerados en R: el paquete FactoClass , 2007 .

[48]  John R. Hershey,et al.  Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[49]  C. Horan Multidimensional scaling: Combining observations when individuals have different perceptual structures , 1969 .

[50]  J. Leeuw Convergence of the majorization method for multidimensional scaling , 1988 .

[51]  R. Duin,et al.  The dissimilarity representation for pattern recognition , a tutorial , 2009 .

[52]  Philip C. Woodland Speaker adaptation for continuous density HMMs: a review , 2001 .

[53]  Zeev Volkovich,et al.  On a Minimal Spanning Tree Approach in the Cluster Validation Problem , 2009, Informatica.

[54]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[55]  William Gregory Sakas,et al.  Psychocomputational Models of Human Language Acquisition , 2007 .

[56]  Ricard V. Solé,et al.  Least effort and the origins of scaling in human language , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Z. Harris,et al.  Methods in structural linguistics. , 1952 .

[58]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[59]  J. Wells British English pronunciation preferences: a changing scene , 1999, Journal of the International Phonetic Association.

[60]  P. Smolensky,et al.  Optimality Theory: Constraint Interaction in Generative Grammar , 2004 .

[61]  S. Dolnicar,et al.  An examination of indexes for determining the number of clusters in binary data sets , 2002, Psychometrika.

[62]  Robert P. W. Duin,et al.  The dissimilarity space: Bridging structural and statistical pattern recognition , 2012, Pattern Recognit. Lett..

[63]  Peter Sollich,et al.  Theory of Neural Information Processing Systems , 2005 .

[64]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[65]  Andrew McCallum,et al.  Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.

[66]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[67]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[68]  C. Anton Rytting,et al.  Segmenting words from natural speech: subsegmental variation in segmental cues. , 2010, Journal of child language.

[69]  Q. Atkinson Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa , 2011, Science.

[70]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[71]  Timothy J. Robinson,et al.  Linear Models With R , 2005, Technometrics.

[72]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[73]  Csaba Legány,et al.  Cluster validity measurement techniques , 2006 .

[74]  Mark Johnson,et al.  Using Adaptor Grammars to Identify Synergies in the Unsupervised Acquisition of Linguistic Structure , 2008, ACL.

[75]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[76]  E. Forgy Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[77]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[78]  Mark Johnson,et al.  Unsupervised Word Segmentation for Sesotho Using Adaptor Grammars , 2008, SIGMORPHON.

[79]  J. Munkres ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS* , 1957 .

[80]  DANA AVRAM LUPŞA,et al.  UNSUPERVISED SINGLE-LINK HIERARCHICAL CLUSTERING , 2005 .

[81]  Mark Steyvers,et al.  Online Learning Mechanisms for Bayesian Models of Word Segmentation , 2010 .

[82]  Morten H. Christiansen,et al.  Learning to Segment Speech Using Multiple Cues: A Connectionist Model , 1998 .

[83]  R. Miller,et al.  The Japanese Language , 1968 .

[84]  Timothy J. Vance,et al.  The sounds of Japanese , 2008 .

[85]  Daniel Gildea,et al.  Learning Bias and Phonological-Rule Induction , 1996, CL.

[86]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[87]  P. Mermelstein,et al.  Distance measures for speech recognition, psychological and instrumental , 1976 .

[88]  Ari Rappoport,et al.  The NVI Clustering Evaluation Measure , 2009, CoNLL.

[89]  Kim Plunkett,et al.  Theories of early language acquisition , 1997, Trends in Cognitive Sciences.

[90]  J. Faraway Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models , 2005 .

[91]  Paul Boersma,et al.  Gradual constraint-ranking learning algorithm predicts acquisition order , 1999 .

[92]  David A. van Leeuwen,et al.  Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[93]  William M. Austin Criteria for Phonetic Similarity , 1957 .

[94]  John Mount,et al.  The equivalence of logistic regression and maximum entropymodels , 2011 .

[95]  Shigeki Matsuda,et al.  Feature-dependent allophone clustering , 2000, INTERSPEECH.

[96]  Sharon Peperkamp,et al.  Testing the Robustness of Online Word Segmentation: Effects of Linguistic Diversity and Phonetic Variation , 2011, CMCL@ACL.

[97]  Mark Johnson,et al.  A Discovery Procedure for Certain Phonological Rules , 1984, ACL.

[98]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[99]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[100]  Rozenn Le Calvez Approche computationnelle de l'acquisition précoce des phonèmes , 2007 .

[101]  Jacob Benesty,et al.  Springer handbook of speech processing , 2007, Springer Handbooks.

[102]  E. Tarone,et al.  Think or Sink: Chinese Learners' Acquisition of the English Voiceless Interdental Fricative , 2009 .

[103]  Toni Giorgino,et al.  Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[104]  Timothy Gambell,et al.  Statistics Learning and Universal Grammar: Modeling Word Segmentation , 2004 .

[105]  Xiao-Li Meng,et al.  [Optimization Transfer Using Surrogate Objective Functions]: Discussion , 2000 .

[106]  Mark Tse Voice Transformation , 2003 .

[107]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[108]  Paul Boersma,et al.  Optimality theory and phonological acquisition , 2004 .

[109]  Jan de Leeuw,et al.  Modern Multidimensional Scaling: Theory and Applications (Second Edition) , 2005 .

[110]  Thomas L. Griffiths,et al.  Adaptor Grammars: A Framework for Specifying Compositional Nonparametric Bayesian Models , 2006, NIPS.

[111]  Michalis Vazirgiannis,et al.  Clustering validity assessment: finding the optimal partitioning of a data set , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[112]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[113]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[114]  J. Goldsmith,et al.  The handbook of phonological theory , 2011 .

[115]  J. Saffran Constraints on Statistical Language Learning , 2002 .

[116]  Sharon Peperkamp,et al.  Phonological Acquisition: Recent Attainments and New Challenges , 2003, Language and speech.

[117]  J. Nadal,et al.  The acquisition of allophonic rules: Statistical learning with linguistic constraints , 2006, Cognition.

[118]  Woo Hyeok Chang On the Relation between Phonetics and Phonology , 2012 .

[119]  J. Werker,et al.  Influences on infant speech processing: toward a new synthesis. , 1999, Annual review of psychology.

[120]  Ramiz M. Aliguliyev,et al.  Performance evaluation of density-based clustering methods , 2009, Inf. Sci..

[121]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[122]  Anand Venkataraman,et al.  A Statistical Model for Word Discovery in Transcribed Speech , 2001, CL.

[123]  安藤 広志,et al.  20世紀の名著名論:David Marr:Vision:a Computational Investigation into the Human Representation and Processing of Visual Information , 2005 .

[124]  R. Herrnstein,et al.  Natural concepts in pigeons. , 1976, Journal of experimental psychology. Animal behavior processes.

[125]  P. Rousseeuw,et al.  Explicit scale estimators with high breakdown point , 1992 .

[126]  Howard J. Hamilton,et al.  Density-Based Spatial Clustering in the Presence of Obstacles and Facilitators , 2004, PKDD.

[127]  Daniel Recasens,et al.  Coarticulation and Connected Speech Processes , 2010 .

[128]  Julien Ugon,et al.  Classes and clusters in data analysis , 2006, Eur. J. Oper. Res..

[129]  Etienne Barnard,et al.  Phone clustering using the Bhattacharyya distance , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[130]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[131]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[132]  K. Maekawa CORPUS OF SPONTANEOUS JAPANESE : ITS DESIGN AND EVALUATION , 2003 .

[133]  Armin W. Schulz,et al.  Interpretations of probability , 2003 .

[134]  Jafar Habibi,et al.  A Novel Method to Find Appropriate epsilon for DBSCAN , 2010, ACIIDS.

[135]  André Hardy,et al.  An examination of procedures for determining the number of clusters in a data set , 1994 .

[136]  Minho Kim,et al.  New indices for cluster validity assessment , 2005, Pattern Recognit. Lett..

[137]  Paul Boersma,et al.  A programme for bidirectional phonology and phonetics and their acquisition and evolution , 2011 .

[138]  A. Agresti An introduction to categorical data analysis , 1997 .

[139]  Pavel Matejka,et al.  Towards Lower Error Rates in Phoneme Recognition , 2004, TSD.

[140]  Sadaoki Furui,et al.  International Speech Communication Association , 2006 .

[141]  Robert P. W. Duin,et al.  One-Class LP Classifiers for Dissimilarity Representations , 2002, NIPS.

[142]  William A. Gale,et al.  Good-Turing Frequency Estimation Without Tears , 1995, J. Quant. Linguistics.

[144]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[145]  Brian Roark,et al.  Structural Zeros versus Sampling Zeros , 2005 .

[146]  J Makhoul,et al.  State of the art in continuous speech recognition. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[147]  Zellig S. Harris,et al.  Methods in structural linguistics. , 1952 .

[148]  Steven Abney,et al.  Linguistic Issues in Language Technology LiLT , 2011 .

[149]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. I. , 1962 .

[150]  Robert Daland,et al.  Learning Diphone-Based Segmentation , 2011, Cogn. Sci..

[151]  Francis Jack Smith,et al.  Extension of Zipf’s Law to Words and Phrases , 2002, COLING.

[152]  T. Griffiths,et al.  A Bayesian framework for word segmentation: Exploring the effects of context , 2009, Cognition.

[153]  James L. McClelland,et al.  Unsupervised learning of vowel categories from infant-directed speech , 2007, Proceedings of the National Academy of Sciences.

[154]  Hagai Aronowitz,et al.  A distance measure between GMMs based on the unscented transform and its application to speaker recognition , 2005, INTERSPEECH.

[155]  Boris Mirkin,et al.  Clustering For Data Mining: A Data Recovery Approach (Chapman & Hall/Crc Computer Science) , 2005 .

[156]  Marcin Gorawski,et al.  AEC Algorithm: A Heuristic Approach to Calculating Density-Based Clustering Eps Parameter , 2006, ADVIS.

[157]  Robert P. W. Duin,et al.  The Dissimilarity Representation for Pattern Recognition - Foundations and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[158]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[159]  B. Hayes,et al.  Phonological Acquisition in Optimality Theory: The Early Stages 1 , 1999 .

[160]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[161]  A. Bronstein,et al.  A MULTIGRID APPROACH FOR MULTI-DIMENSIONAL SCALING∗ , 2004 .

[162]  Amy Weinberg,et al.  Architectures and Mechanisms for Language Processing , 1999, Computational Linguistics.

[163]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[164]  Eve V. Clark,et al.  Language and Conceptual Development series How language acquisition builds on cognitive development , 2004 .

[165]  B. Hayes,et al.  Phonological Acquisition in Optimality Theory: the Early Stages 1 Submitted for a Forthcoming Volume on Phonological Acquisition and Typology, Edited Phonological Acquisition in Optimality Theory: the Early Stages , 1999 .

[166]  Luc Boruta A note on the generation of allophonic rules , 2011 .

[167]  P. Groenen,et al.  The tunneling method for global optimization in multidimensional scaling , 1996 .

[168]  Elizabeth K. Johnson,et al.  Word Segmentation by 8-Month-Olds: When Speech Cues Count More Than Statistics , 2001 .

[169]  Michael R. Berthold,et al.  Learning in parallel universes , 2010, Data Mining and Knowledge Discovery.

[170]  P. Boersma Modeling phonological category learning , 2012 .

[171]  Hans-Peter Kriegel,et al.  Clustering Multi-represented Objects with Noise , 2004, PAKDD.

[172]  H. Akaike A new look at the statistical model identification , 1974 .

[173]  K. Brannen The Role of Perception in Differential Substitution , 2002, Canadian Journal of Linguistics/Revue canadienne de linguistique.

[174]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[175]  J. Stemberger,et al.  Optimality Theory , 2003 .

[176]  Walter Daelemans,et al.  Learnability in Optimality Theory , 2000, Linguistic Inquiry.

[177]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[178]  Hans-Peter Kriegel,et al.  Density-Connected Subspace Clustering for High-Dimensional Data , 2004, SDM.

[179]  P. Rousseeuw,et al.  Alternatives to the Median Absolute Deviation , 1993 .

[180]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[181]  S. Dudoit,et al.  A prediction-based resampling method for estimating the number of clusters in a dataset , 2002, Genome Biology.

[182]  John Coleman,et al.  Phonological Representations: Their Names, Forms and Powers , 1998 .

[183]  M.-C. Su,et al.  A new cluster validity measure and its application to image compression , 2004, Pattern Analysis and Applications.

[184]  Ricard V. Solé,et al.  Two Regimes in the Frequency of Words and the Origins of Complex Lexicons: Zipf’s Law Revisited* , 2001, J. Quant. Linguistics.

[185]  Jae Hee Lee,et al.  Contribution of consonant versus vowel information to sentence intelligibility for young normal-hearing and elderly hearing-impaired listeners. , 2007, The Journal of the Acoustical Society of America.

[186]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[187]  Micha Elsner,et al.  Bootstrapping a Unified Model of Lexical and Phonetic Acquisition , 2012, ACL.

[188]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[189]  Hitoshi Isahara,et al.  Spontaneous Speech Corpus of Japanese , 2000, LREC.

[190]  P. Niyogi,et al.  Quantifying the functional load of phonemic oppositions, distinctive features, and suprasegmentals , 2006 .

[191]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[192]  S. Renals,et al.  Handbook of Phonetic Sciences , 2010 .

[193]  Sharon Peperkamp,et al.  Coping with phonological variation in early lexical acquisition , 2000 .

[194]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[195]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[196]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[197]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[198]  Yishai Tobin Phonology as Human Behavior: Theoretical Implications and Clinical Applications , 1997 .

[199]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[200]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[201]  Zrinka Jelaska,et al.  Prototypicality and the Concept of Phoneme , 2005 .

[202]  C. F. Hockett A Manual of Phonology , 2013 .

[203]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[204]  R. Mooney,et al.  Impact of Similarity Measures on Web-page Clustering , 2000 .

[205]  Shuly Wintner Formal Language Theory , 2010 .