Application of the Dublin Core format for automatic metadata generation and extraction

This article describes a set of services and tools to be used by information systems to obtain metadata collections in a automated fashion from online content or other electronic repositories. This multi-module software uses a service-oriented schema based on the analysis of electronic resources published on the web or local networks. Through several algorithms is capable of generate and extract metadata elements from documents, explicitly declared or as a result the document's content analysis. The adopted model makes it a scalable and distributed system, which may be extended to recognize more formats, types of meta-data and other sources.