Web automation applications are widely used for different purposes such as B2B integration and automated testing of web applications. Most current systems build the automatic web navigation component by using the APIs of conventional browsers. While this approach has its advantages, it suffers performance problems for intensive web automation tasks which require real time responses and/or a high degree of parallelism. In this paper, we outline a set of techniques to build a web navigation component able to efficiently execute web navigation sequences. These techniques detect what elements and scripts of the pages accessed during the navigation sequence are needed for the correct execution of the sequence (and, therefore, must be loaded and executed), and what parts of the pages can be discarded. The tests executed with real web sources show that the optimized navigation sequences run significantly faster and consume significantly less resources.
[1]
Sebastian G. Elbaum,et al.
Supporting end-users in the creation of dependable web clips
,
2007,
WWW '07.
[2]
Alberto Pan,et al.
Automated browsing in AJAX websites
,
2011,
Data Knowl. Eng..
[3]
Juliana Freire,et al.
Automating Web navigation with the WebVCR
,
2000,
Comput. Networks.
[4]
Ángel Viña,et al.
Semi-Automatic Wrapper Generation for Commercial Web Sources
,
2002,
Engineering Information Systems in the Internet Context.
[5]
I. V. Ramakrishnan,et al.
Computational aspects of resilient data extraction from semistructured sources (extended abstract)
,
2000,
PODS '00.
[6]
Rob Miller,et al.
Smart bookmarks: automatic retroactive macro recording on the web
,
2007,
UIST.