Automatic metadata generation based on neural network

Metadata technologies can allow the proper search and processing of Web pages, which has been used in many fields, such as e-commerce, library science, web search, and so on. In the approach, a novel framework is proposed to automatically generate Dublin Core metadata about web pages, what is finally expressed in XML. The framework is located on a web server, where can provide detailed information about web pages to generate the metadata. Moreover, the framework utilizes a combination of a well-trained neural network and traditional statistical methods to filter proper value for the metadata. Experimental results show that the proposed framework is preferable to the traditional statistical method.

[1]  Ali Selamat,et al.  Web page feature selection and classification using neural networks , 2004, Inf. Sci..

[2]  Charlotte Jenkins,et al.  Server-side automatic metadata generation using qualified Dublin Core and RDF , 2000, Proceedings 2000 Kyoto International Conference on Digital Libraries: Research and Practice.

[3]  Philippe Blache,et al.  A semantic vector space and features-based approach for automatic information filtering , 2004, Expert Syst. Appl..

[4]  Chih-Ming Chen,et al.  Incremental personalized Web page mining utilizing self-organizing HCMAC neural network , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[5]  Irena Koprinska,et al.  A neural network based approach to automated e-mail classification , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[6]  Soe-Tsyr Yuan,et al.  A personalized and integrative comparison-shopping engine and its applications , 2003, Decis. Support Syst..