Unsupervised web event extraction framework

To acquire real event information published to internet effectively and easily,an unsupervised web event extraction framework is proposed.This framework extracts events from table WebPages by using DOM’s parallel structure,the events extracted from table WebPages are used as seeds to summary corresponding patterns from detail WebPages,then patterns summarized are used to further extract events from detail WebPages.Masses of websites are used to verify this framework and the result of extraction,which is compared to common wrapper-generation algorithm,indicated that this framework is feasible and better than wrapper-generation algorithm in quality of detail webpage extraction.