Linguistic Issues in Language Technology LiLT

Computational linguistics is not a specialization of lin- guistics at all; it is a branch of computer science. A large majority of computational linguists have degrees in computer science and positions in computer science departments. It was founded as an offshoot of an en- gineering discipline (machine translation), and has been subsequently shaped by its place within artificial intelligence, and by a heavy influx of theory and method from speech recognition (another engineering discipline) and machine learning. But computation is a means to an end; the essential feature is data collection, analysis, and prediction on the large scale. I will call it data-intensive experimental linguistics. I wish to explain how data-intensive linguistics differs from mainstream practice, why I consider it to be genuine linguistics, and why I believe that it enables fundamental advances in our understanding of language.

[1]  Alaa A. Kharbouch,et al.  Three models for the description of language , 1956, IRE Trans. Inf. Theory.

[2]  Bruce W. Ballard,et al.  Proceedings of the second conference on Applied natural language processing , 1988 .

[3]  Charles F. Hockett,et al.  A mathematical theory of communication , 1948, MOCO.

[4]  Fei Xia Multilingual Structural Projection across Interlinearized Text , 2007 .

[5]  M. Sawicki,et al.  Human Genome Project. , 1993, American journal of surgery.

[6]  David Yarowsky,et al.  Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora , 2001, HLT.

[7]  Henry S. Tropp,et al.  Wiener, Norbert , 2003 .

[8]  A. Ross Structural Linguistics , 1953, Nature.

[9]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[10]  Sandy Lovie Shannon, Claude E , 2005 .

[11]  R. Darnell Translation , 1873, The Indian medical gazette.

[12]  C. Habel,et al.  Language , 1931, NeuroImage.

[13]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[14]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[15]  Thesaurus Linguae Graecae Thesaurus linguae graecae , 1992 .

[16]  E. Robinson Cybernetics, or Control and Communication in the Animal and the Machine , 1963 .

[17]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[18]  Martin Kay,et al.  A Life of Language , 2005, Computational Linguistics.

[19]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[20]  Roman Jakobson,et al.  Structure of Language and Its Mathematical Aspects , 1961 .

[21]  Steven J. DeRose,et al.  Grammatical Category Disambiguation by Statistical Optimization , 1988, CL.

[22]  Steven Abney,et al.  Statistical Methods and Linguistics , 2002 .

[23]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[24]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[25]  Fei Xia,et al.  Multilingual Structural Projection across Interlinear Text , 2007, HLT-NAACL.

[26]  John A. Goldsmith,et al.  Unsupervised Learning of the Morphology of a Natural Language , 2001, CL.