The Topes Format Editor and Parser

It is currently difficult and time-consuming to validate and manipulate data in web applications, so we have developed an editor and a parser to simplify these tasks. Our editor enables end-user programmers to create and debug reusable, flexible data formats without learning a complex new language. Our parser uses these formats to turn strings into structured objects and to report its level of confidence that each string is a valid instance of the format. End-user programmers can use our system to create validation code that takes a graduated response to slightly invalid data. We evaluate our system’s expressiveness by defining formats for commonly-occurring web data. This work was funded in part by the National Science Foundation under ITR grant CCF-0325273 (via the EUSES Consortium) and by the National Science Foundation under ITR grants CCF-0438929 and CCF-0324861.

[1]  Reuven M. Lerner At the forge: Creating mashups , 2006 .

[2]  Rob Miller,et al.  Lightweight Structured Text Processing , 1999, USENIX Annual Technical Conference, General Track.

[3]  Mark Alan Jones,et al.  A Probabilistic Parser and Its Application , 1992 .

[4]  F. Mosteller,et al.  Quantifying Probabilistic Expressions , 1990 .

[5]  Bonnie A. Nardi,et al.  Collaborative, programmable intelligent agents , 1998, CACM.

[6]  Brad A. Myers,et al.  The EUSES Web macro Scenario Corpus, Version 1.0 , 2006 .

[7]  David Garlan,et al.  Lightweight structure in text , 2002 .

[8]  Christopher Scaffidi A Lightweight Model for End Users’ Domain-Specific Data , 2006, Visual Languages and Human-Centric Computing (VL/HCC'06).

[9]  Anna Maria Di Sciullo,et al.  Natural Language Understanding , 2009, SoMeT.

[10]  Ted Briscoe,et al.  Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars , 1993, CL.

[11]  Masaru Tomita,et al.  An Efficient Augmented-Context-Free Parsing Algorithm , 1987, Comput. Linguistics.

[12]  Jean-Claude Junqua,et al.  Robustness in Language and Speech Technology , 2001, Text, Speech and Language Technology.

[13]  Carolyn Penstein Rosé,et al.  BALANCING ROBUSTNESS AND EFFICIENCY IN UNIFICATION-AUGMENTED CONTEXT-FREE PARSERS FOR LARGE PRACTICAL APPLICATIONS , 2001 .

[14]  Mary Beth Rosson,et al.  As Easy as "Click": End-User Web Engineering , 2005, ICWE.

[15]  Henry Lieberman,et al.  Training Agents to Recognize Text by Example , 1999, AGENTS '99.

[16]  Andreas Stolcke,et al.  An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities , 1994, CL.

[17]  Mary Shaw,et al.  Trial By Water: Creating Hurricane Katrina "Person Locator" Web Sites , 2006 .

[18]  Brad A. Myers,et al.  Using HCI techniques to design a more usable programming system , 2002, Proceedings IEEE 2002 Symposia on Human Centric Computing Languages and Environments.

[19]  Alan F. Blackwell,et al.  SWYN: a visual representation for regular expressions , 2001 .

[20]  Christopher Scaffidi Unsupervised Inference of Data Formats in Human-Readable Notation , 2007, ICEIS.