Learning words from sights and sounds: a computational model

[1]  Cecile T. L. Kuijpers,et al.  Cross-language word segmentation by 9-month-olds , 2000, Psychonomic bulletin & review.

[2]  Deb Roy,et al.  Learning from multimodal observations , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[3]  Deb Roy,et al.  Integration of speech and vision using mutual information , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[4]  Michael R. Brent,et al.  An Efficient, Probabilistically Sound Algorithm for Segmentation and Word Discovery , 1999, Machine Learning.

[5]  Alex Pentland,et al.  Learning words from natural audio-visual input , 1998, ICSLP.

[6]  Alex Pentland,et al.  Word learning in a multimodal environment , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7]  Aaron F. Bobick,et al.  A State-Based Approach to the Representation and Recognition of Gesture , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Pat Langley,et al.  Machine Learning for Adaptive User Interfaces , 1997, KI.

[9]  J. Werker,et al.  Infants listen for more phonetic detail in speech perception than in word-learning tasks , 1997, Nature.

[10]  Catharine H. Echols,et al.  The perception of rhythmic units in speech by infants and adults. , 1997 .

[11]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[12]  Carl de Marcken,et al.  Unsupervised language acquisition , 1996, ArXiv.

[13]  J. Morgan A Rhythmic Bias in Preverbal Speech Segmentation , 1996 .

[14]  T. A. Cartwright,et al.  Distributional regularity and phonotactic constraints are useful for segmentation , 1996, Cognition.

[15]  J. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings , 1996, Cognition.

[16]  Bernt Schiele,et al.  Probabilistic object recognition using multidimensional receptive field histograms , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[17]  Alex Pentland,et al.  Real-time self-calibrating stereo person tracking using 3-D shape estimation from blob features , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[18]  Michael J. Pazzani,et al.  Syskill & Webert: Identifying Interesting Web Sites , 1996, AAAI/IAAI, Vol. 1.

[19]  Michael J. Carey,et al.  Statistical models for topic identification using phoneme substrings , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[20]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[21]  Carl de Marcken,et al.  The Unsupervised Acquisition of a Lexicon from Continuous Speech , 1995, ArXiv.

[22]  Mari Ostendorf,et al.  A dynamical system model for recognizing intonation patterns , 1995, EUROSPEECH.

[23]  D. Pisoni,et al.  Infants' Recognition of the Sound Patterns of Their Own Names , 1995, Psychological science.

[24]  Vassilios Digalakis,et al.  Speaker adaptation using constrained estimation of Gaussian mixtures , 1995, IEEE Trans. Speech Audio Process..

[25]  Rajesh P. N. Rao,et al.  Object indexing using an iconic sparse distributed memory , 1995, Proceedings of IEEE International Conference on Computer Vision.

[26]  Thad Starner,et al.  Visual Recognition of American Sign Language Using Hidden Markov Models. , 1995 .

[27]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[28]  Anthony J. Robinson,et al.  An application of recurrent nets to phone probability estimation , 1994, IEEE Trans. Neural Networks.

[29]  C. Mervis,et al.  Early object labels: the case for a developmental lexical principles framework , 1994, Journal of Child Language.

[30]  L. A. Hermens,et al.  A machine-learning apprentice for the completion of repetitive forms , 1994, IEEE Expert.

[31]  J. Reznick,et al.  Developmental and stylistic variation in the composition of early vocabulary , 1994, Journal of Child Language.

[32]  Hervé Bourlard,et al.  Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[33]  Henry Lieberman,et al.  Watch what I do: programming by demonstration , 1993 .

[34]  P. Jusczyk,et al.  Infants' preference for the predominant stress patterns of English words. , 1993, Child development.

[35]  Alexander H. Waibel,et al.  Improving connected letter recognition by lipreading , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[36]  Pascale Fung,et al.  The BBN/HARC spoken language understanding system , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[37]  Tom M. Mitchell,et al.  A Personal Learning Apprentice , 1992, AAAI.

[38]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[39]  K. Stevens,et al.  Linguistic experience alters phonetic perception in infants by 6 months of age. , 1992, Science.

[40]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[41]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[42]  A. Gorin On automated language acquisition , 1989 .

[43]  Jonathan Harrington,et al.  Word boundary detection in broad class and phoneme strings , 1989 .

[44]  S. Furui,et al.  Unsupervised speaker adaptation method based on hierarchical spectral clustering , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[45]  J. Hampton Women, Fire, and Dangerous Things , 1989 .

[46]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[47]  Victor Zue,et al.  The MIT SUMMIT Speech Recognition System: A Progress Report , 1989, HLT.

[48]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[49]  L. Gleitman,et al.  Language and Experience: Evidence from the Blind Child , 1988 .

[50]  Linda B. Smith,et al.  The importance of shape in early lexical learning , 1988 .

[51]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[52]  R. Baillargeon Object permanence in 3½- and 4½-month-old infants. , 1987 .

[53]  R. Schwartz,et al.  Rapid speaker adaptation using a probabilistic spectral mapping , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[54]  J. Huttenlocher,et al.  Early word meanings: The case of object names , 1987, Cognitive Psychology.

[55]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[56]  J. Bohannon,et al.  Intonation Patterns in Child-Directed Speech: Mother-Father Differences. , 1984 .

[57]  Raj Reddy,et al.  Steps Toward Graceful Interaction in Spoken and Written Man-Machine Communication , 1983, Int. J. Man Mach. Stud..

[58]  L. Cohen,et al.  Infant perception of correlations among attributes. , 1983, Child development.

[59]  Amye Warren Sex differences in speech to children , 1982 .

[60]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[61]  Seymour Papert,et al.  Mindstorms: Children, Computers, and Powerful Ideas , 1981 .

[62]  L. Bloom,et al.  Language development and language disorders , 1979 .

[63]  I. Bushnell Modification of the externality effect in young infants. , 1979, Journal of experimental child psychology.

[64]  H. Benedict,et al.  Early lexical development: comprehension and production , 1979, Journal of Child Language.

[65]  C. A. Ferguson,et al.  Talking to Children: Language Input and Acquisition , 1979 .

[66]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[67]  John R. Anderson,et al.  Induction of Augmented Transition Networks , 1977, Cogn. Sci..

[68]  A. E. Milewski,et al.  Infants' discrimination of internal and external pattern elements. , 1976, Journal of experimental child psychology.

[69]  M. Bornstein,et al.  Color vision and hue categorization in young human infants. , 1976, Journal of experimental psychology. Human perception and performance.

[70]  E. Rosch Cognitive Representations of Semantic Categories. , 1975 .

[71]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[72]  J. Kulikowski,et al.  Orientational selectivity of grating and line detectors in human vision. , 1973, Vision research.

[73]  P. Kay,et al.  Basic Color Terms: Their Universality and Evolution , 1973 .

[74]  C. Snow Mothers' Speech to Children Learning Language. , 1972 .

[75]  M. Posner,et al.  On the genesis of abstract ideas. , 1968, Journal of experimental psychology.

[76]  Geoffrey H. Ball,et al.  ISODATA, A NOVEL METHOD OF DATA ANALYSIS AND PATTERN CLASSIFICATION , 1965 .

[77]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[78]  W. Pitts,et al.  What the Frog's Eye Tells the Frog's Brain , 1959, Proceedings of the IRE.

[79]  D. Roy,et al.  Teachable Interfaces for Individuals with Dysarthric Speech and Severe Physical Disabilities , 2000 .

[80]  James W. Davis,et al.  The Representation and Recognition of Action Using Temporal Templates , 1997, CVPR 1997.

[81]  Michael I. Jordan Serial Order: A Parallel Distributed Processing Approach , 1997 .

[82]  Jerome A. Feldman,et al.  When push comes to shove: a computational model of the role of motor control in the acquisition of action verbs , 1997 .

[83]  Deb Roy,et al.  Toco the toucan: a synthetic character guided by perception, emotion, and story , 1997, SIGGRAPH '97.

[84]  Richard Rose,et al.  Word Spotting from Continuous Speech Utterances , 1996 .

[85]  Stephanie Seneff,et al.  Transcription and Alignment of the TIMIT Database , 1996 .

[86]  Kuldip K. Paliwal,et al.  Automatic Speech and Speaker Recognition , 1996 .

[87]  Emanuele Trucco,et al.  Computer and Robot Vision , 1995 .

[88]  J. Mehler,et al.  The periodicity bias , 1993 .

[89]  P. Jusczyk From general to language-specific capacities: the WRAPSA Model of how speech perception develops , 1993 .

[90]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[91]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[92]  Stephen A. Dyer,et al.  Digital signal processing , 2018, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[93]  Jeffrey Mark Siskind,et al.  Naive physics, event perception, lexical semantics, and language acquisition , 1992 .

[94]  Chris Sinha,et al.  Symbol Grounding or the Emergence of Symbols? Vocabulary Growth in Children and a Connectionist Net , 1992 .

[95]  A. Asadi,et al.  Automatic detection and modeling of new words in a large-vocabulary continuous speech recognition system , 1992 .

[96]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[97]  M. Ghiselin,et al.  Coevolution: Genes, Culture, and Human Diversity , 1991, Politics and the Life Sciences.

[98]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[99]  William Maguire,et al.  From Visual Structure to Perceptual Function , 1990 .

[100]  E. Markman Categorization and naming in children , 1989 .

[101]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[102]  Raj Reddy,et al.  Large-vocabulary speaker-independent continuous speech recognition: the sphinx system , 1988 .

[103]  K. F. Lee,et al.  Towards speaker-independent continuous speech recognition , 1988 .

[104]  G. Miller,et al.  Cognitive science. , 1981, Science.

[105]  P. D. Eimas,et al.  chapter 6 – Speech Perception in Early Infancy1 , 1975 .

[106]  Laurent Siklossy,et al.  Natural language learning by computer , 1968 .

[107]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[108]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[109]  Zellig S. Harris,et al.  Distributional Structure , 1954 .