Auto-learning Web Information Extraction Based on XML

Internet provides us explosive information and involves massive important and useful knowledge within the abundant Web resources. Info explosion and knowledge deficiency are big troubles confronting modern civilization due to the inconvenience of locating the vital data interested by user via search engine. However,the auto-realization of Web info extraction could significantly enhance the efficiency of info absorbing. It can also discover as well as analyze targeted info,discard redundant data and extract user-knowledge-domain-info. This article analyzes Web info extraction methodology based on XML,discusses related technology concerning application of such methodology,establishes Web info extraction model in order to realize auto-extraction of Web info via auto-learning the regulations of Web info extraction.