Learning and using formal language

Regular expressions are a convenient and simple notation for expressing the members of a regular language. This simplicity enables regular expressions to serve as models of more complex context-free programming languages. To examine techniques in teaching programming, we exposed first year students with no knowledge of regular expressions to two tasks. The tasks asked participants to identify occurrences of a given expression (matching) or to create an expression to describe a set of occurrences (creation). Matching and creation can be viewed as parallel to reading and writing, suggesting that as reading skills help to develop writing skills, then similarly, matching skills help to develop creation skills. We hypothesised that a practice effect exists and that exposure to the matching task before the creation task improves performance on the latter. However, in an experimental setting, this hypothesis is not supported. Practice on recognition tasks provided no measurable benefits and suggests that the learning of formal language differs significantly from that of natural language. To verify this finding in an alternative setting, we performed a second experiment to test the hypothesis that performance will be similar when locating errors in HTML and when creating HTML. This experiment revealed no significant performance differences. One explanation for these results is that novices lack a lexicon of concepts to which language constructs can be mapped, and therefore view formal languages as rule-based systems. Consequently, the skills used for learning natural language can not be applied. These findings also suggest that for formal languages there is a greater need for instructional interventions such as performance feedback.

[1]  A. Lunsford What We Know. And Don't Know. About Remedial Writing , 1978 .

[2]  Noam Chomsky,et al.  On Certain Formal Properties of Grammars , 1959, Inf. Control..

[3]  N. Kanwisher Repetition blindness: Type recognition without token individuation , 1987, Cognition.

[4]  Brad A. Myers,et al.  Improving user performance on Boolean queries , 2000, CHI Extended Abstracts.

[5]  Howard R. Turtle Natural language vs. Boolean query evaluation: a comparison of retrieval performance , 1994, SIGIR '94.

[6]  Andrew Dillon,et al.  Speed and Accuracy using Four Boolean Query Systems , 1999 .

[7]  M. Posner,et al.  On the genesis of abstract ideas. , 1968, Journal of experimental psychology.

[8]  Steven Pinker,et al.  Words and rules , 1998 .

[9]  Mariolina Rizzi Salvatori,et al.  Reading and Writing a Text: Correlations between Reading and Writing Patterns , 1983 .

[10]  Rocky Ross,et al.  Mental models , 2004, SIGA.

[11]  Charles L. A. Clarke,et al.  On the use of regular expressions for searching text , 1997, TOPL.

[12]  Pertti Vakkari,et al.  Cognition and changes of search terms and tactics during task performance: A longitudinal case study , 2000, RIAO.

[13]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[14]  Andrew Singer,et al.  The natural language of interactive systems , 1980, CACM.

[15]  Louis M. Gomez,et al.  No IFs, ANDs, or ORs: A Study of Database Querying , 1990, Int. J. Man Mach. Stud..