Automatic Judgment of Deep Web Query Interfaces

Traditional Web search engines work well for finding crawlable pages,but they ignore the tremendous amount information hidden behind query forms,in large searchable electronic databases.For obtaining dynamic information,firstly query interfaces must be extracted from massive Web forms to find the entrance to the datasets.This paper describes a technique for detecting query interface using naive Bayes classification and the test results are reported.