MetaData Pro: Ontology-Based Metadata Processing for Web Resources

Metadata is the foundation of Semantic Web. The MetaData Pro project seeks to build a metadata processing platform for web resource. The system architecture has three key components– Metadata Extraction, Ontology Management, and Metadata Retrieval. The system can automatically extract metadata about web resource: if the web resource itself contains metadata, extracts them; otherwise, automatically generates the metadata for the resource according to Dublin Core by applying automatic keyword extraction and text summarization techniques. To manage the metadata, MetaData Pro integrates Protege to create domain ontology, makes use of HowNet to help ontology construction, and provides an ontology-based metadata retrieval.

[1]  Stefan Decker,et al.  Creating Semantic Web Contents with Protégé-2000 , 2001, IEEE Intell. Syst..

[2]  Witold Abramowicz,et al.  Knowledge Discovery for Business Information Systems , 2001 .

[3]  Timothy W. Finin,et al.  Information retrieval on the Semantic Web: Integrating inference and retrieval , 2003, SIGIR 2003.

[4]  Aldo Gangemi,et al.  Ontology Learning and Its Application to Automated Terminology Translation , 2003, IEEE Intell. Syst..

[5]  Paolo Tonella,et al.  Using keyword extraction for Web site clustering , 2003, Fifth IEEE International Workshop on Web Site Evolution, 2003. Theme: Architecture. Proceedings..

[6]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.