Monitoring scientific publications over the WWW

The World Wide Web has become an important medium for disseminating scientific publications. To make their research works accessible to other researchers, most research institutions list their publications in an index page that sometimes includes links to online versions of the publications. As the index page is usually updated whenever new research papers are published, researchers need to check these index pages frequently in order to know of any new publications published in the targeted Web site or page. This manual publication monitoring process is tedious and time‐consuming. In this paper, a publication monitoring system, known as PubWatcher, is proposed to automatically track Web publications from user‐specified Web sites or pages. A publication extraction technique has been developed to extract publication information listed in the index pages of the monitored Web sites and pages.