Linguistic computing with UNIX tools

This chapter presents an outline of applications to language analysis that open up through the combined use of two simple yet powerful programming languages with particularly short descriptions: sed and awk. We shall demonstrate how these two UNIX1 tools can be used to implement small, useful and customized applications ranging from text-formatting and text-transforming to sophisticated linguistic computing. Thus, the user becomes independent of sometimes bulky software packages which may be difficult to customize for particular purposes.

[1]  Larry Wall,et al.  Programming Perl , 1991 .

[2]  Steven V. Earhart UNIX programmer's manual , 1986 .

[3]  Frank A. Smadja,et al.  Lexical Co-occurrence: The Missing Link , 1989 .

[4]  Alfred V. Aho,et al.  The awk programming language , 1988 .

[5]  J. Milton,et al.  Qualification and Certainty in L1 and L2 Students' Writing , 1997 .

[6]  John M. Swales,et al.  Genre Analysis: English in Academic and Research Settings , 1993 .

[7]  Stephen Wolfram,et al.  Mathematica: a system for doing mathematics by computer (2nd ed.) , 1991 .

[8]  Averil Coxhead A New Academic Word List , 2000 .

[9]  E. Schmidt,et al.  Lex—a lexical analyzer generator , 1990 .

[10]  Anthony Fox,et al.  Linguistic Reconstruction: An Introduction to Theory and Method , 1995 .

[11]  Christopher Butler Computers in linguistics , 1985 .

[12]  Étienne Brunet,et al.  Méthodes quantitatives et informatiques dans l'étude des textes , 1986 .

[13]  P. S. Gingrich,et al.  The writer's workbench: Computer aids for text analysis , 1982 .

[14]  Ron Post,et al.  121 Common Mistakes of Japanese Students of English. James H. M. Webb , 1992 .

[15]  G. Little,et al.  Centennial Usage Studies , 1994 .

[16]  Lothar M. Schmitt,et al.  Pedagogical aspects of a UNIX-based network management system for English instruction 1 Both authors , 1998 .

[17]  Frank Tuzi,et al.  The impact of e-feedback on the revisions of L2 writers in an academic writing course , 2004 .

[18]  Leslie Lamport,et al.  Latex : A Document Preparation System , 1985 .

[19]  Brian W. Kernighan,et al.  The UNIX™ programming environment , 1979, Softw. Pract. Exp..

[20]  Michael Hoey,et al.  Patterns of Lexis In Text , 1991 .

[21]  Willem Meijs,et al.  Book Reviews: Theory and Practice in Corpus Linguistics , 1991, CL.

[22]  William F. Clocksin,et al.  Programming in Prolog , 1981, Springer Berlin Heidelberg.

[23]  W. Greub Linear Algebra , 1981 .

[24]  Sylviane Granger,et al.  Learner English on Computer , 1998 .

[25]  Alfred V. Aho,et al.  Awk — a pattern scanning and processing language , 1979, Softw. Pract. Exp..

[26]  Brian W. Kernighan,et al.  The C Programming Language , 1978 .

[27]  Murray Hill,et al.  Yacc: Yet Another Compiler-Compiler , 1978 .

[28]  Daniel Marcu,et al.  A Machine Learning Approach for Identification Thesis and Conclusion Statements in Student Essays , 2003, Comput. Humanit..

[29]  Kiel Christianson A TEXT ANALYSIS OF THE ENGLISH DOUBLE GENITIVE , 1997 .

[30]  Clyde A. Warden EFL Business Writing Behaviors in Differing Feedback Environments , 2000 .