A Pendulum Swung too Far

Today's students might be faced with a very different set of challenges from those of the 1990s in the not-too-distant future. What should they do when most of the low hanging fruit has been pretty much picked over? In the particular case of Machine Translation, the revival of statistical approaches (e.g., Brown et al. (1993)) started out with finite-state methods for pragmatic reasons, but gradually over time, researchers have become more and more receptive to the use of syntax to capture long-distance dependences, especially when there isn't very much parallel corpora, and for language pairs with very different word orders (e.g., translating between a subject-verb-object (SVO) language like English and a verb final language like Japanese). Going forward, we should expect Machine Translation research to make more and more use of richer and richer linguistic representations. So too, there will soon be a day when stress will become important for speech recognition. Since it isn't possible for textbooks in computational linguistics to cover all of these topics, we should work with colleagues in other departments to make sure that students receive an education that is broad enough to prepare them for all possible futures, or at least all probable futures.

[1]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[2]  J. Orbach Principles of Neurodynamics. Perceptrons and the Theory of Brain Mechanisms. , 1962 .

[3]  J. Pierce An introduction to information theory: symbols, signals & noise , 1980 .

[4]  Alaa A. Kharbouch,et al.  Three models for the description of language , 1956, IRE Trans. Inf. Theory.

[5]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[6]  Sergei Nirenburg,et al.  ALPAC: The (In)Famous Report , 2003 .

[7]  Kenneth Ward Church,et al.  Introduction to the Special Issue on Computational Linguistics Using Large Corpora , 1993, Comput. Linguistics.

[8]  J. R. Pierce Whither Speech Recognition?—II , 1970 .

[9]  Benjamin Kuipers,et al.  Computer power and human reason , 1976, SGAR.

[10]  C. Habel,et al.  Language , 1931, NeuroImage.

[11]  S. Ariel,et al.  Introduction to Theoretical Linguistics. , 1968 .

[12]  K. W. Church On memory limitations in natural language processing , 1982 .

[13]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[14]  John Cocke,et al.  A Statistical Approach to Language Translation , 1988, COLING.

[15]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[16]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[17]  Daniel Jurafsky,et al.  Studying the History of Ideas Using Topic Models , 2008, EMNLP.

[18]  A. Samuel,et al.  Whither speech recognition? , 1969, The Journal of the Acoustical Society of America.

[19]  John Sinclair,et al.  Looking up : an account of the COBUILD Project in lexical computing and the development of the Collins COBUILD English Language Dictionary , 1987 .

[20]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[21]  Mark Steedman,et al.  Last Words: On Becoming a Discipline , 2008, CL.

[22]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[23]  J.G. Wilpon,et al.  Whither speech recognition: the next 25 years , 1993, IEEE Communications Magazine.

[24]  John R. Pierce,et al.  Language and Machines: Computers in Translation and Linguistics , 1966 .

[25]  G. A. Miller,et al.  Finitary models of language users , 1963 .

[26]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[27]  John Sinclair,et al.  Collins COBUILD English Language Dictionary , 1987 .

[28]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.