Research and design of HTML parser based on page segmentation
暂无分享,去创建一个
The technologies of Web page parser were introduced. And after making a best estimation of the merits and weakness of the existing methods, a more effective method for segmenting the HTML page in the news Web site was proposed. And then,a HTML Parser named TVPS was designed and realized based on the requirement of the projects. The experimental results show that the system has achieved great performance and meets the needs.