Carbon: Domain-Independent Automatic Web Form Filling

Web forms are the main input mechanism for users to supply data to web applications. Users fill out forms in order to, for example, sign up to social network applications or do advanced searches in search-based web applications. This process is highly repetitive and can be optimized by reusing the user's data across web forms. In this paper, we present a novel framework for domainindependent automatic form filling. The main task is to automatically fill out a correct value for each field in a new form, based on web forms the user has previously filled. The key innovation of our approach is that we are able to extract relevant metadata from the previously filled forms, semantically enrich it, and use it for aligning fields between web forms.

[1]  Pedro M. Domingos,et al.  Learning to match ontologies on the Semantic Web , 2003, The VLDB Journal.

[2]  Luciano Serafini,et al.  Semantic Coordination: A New Approach and an Application , 2003, SEMWEB.

[3]  Edleno Silva de Moura,et al.  Automatically filling form-based web interfaces with free text inputs , 2009, WWW '09.

[4]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[5]  John Mylopoulos,et al.  The Semantic Web - ISWC 2003 , 2003, Lecture Notes in Computer Science.

[6]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[7]  Eero Hyvönen,et al.  Semantic Autocompletion , 2006, ASWC.

[8]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[9]  Fausto Giunchiglia,et al.  The Semantic Web - ASWC 2006, First Asian Semantic Web Conference, Beijing, China, September 3-7, 2006, Proceedings , 2006, ASWC.

[10]  Sriram Raghavan,et al.  Crawling the Hidden Web , 2001, VLDB.

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  Clement T. Yu,et al.  Automatic extraction of web search interfaces for interface schema integration , 2004, WWW Alt. '04.

[13]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[14]  Pedro M. Domingos,et al.  Learning to Match the Schemas of Data Sources: A Multistrategy Approach , 2003, Machine Learning.

[15]  Jens Lehmann,et al.  Discovering Unknown Connections - the DBpedia Relationship Finder , 2007, CSSW.

[16]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[17]  Ingmar Weber,et al.  Type less, find more: fast autocompletion search with a succinct index , 2006, SIGIR.

[18]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[19]  Juliana Freire,et al.  Learning to extract form labels , 2008, Proc. VLDB Endow..

[20]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .