The Digital Library and the Archiving System for Educational Institutes

At present, there are several formats that exist through which data is distributed among online stakeholders. An example of this is the XML, which like other such formats is helpful for traditional inquiry methods and for forming the foundation of query languages such as SPARQL and SQL. Information about primary representation demands a broader assistance for the languages where every piece of data from any resource can substantiate the original queries for searching. Such models are useful for XML based retrieval since several cooperative XML search engines have been developed already. These search engines perform semantic investigation of XML files with data surrounded by the important fields. Therefore, XML files are used to store and index data intended for competent retrieval. In this research, an attempt is made to fill this gap of customized representation and retrieval with a focus on the educational domain. An institute's repository of books, e-books, journals, articles and research theses has been used to retrieve results. A system has been proposed and developed to store the contents of Institute's Databank as an object of the Digital Library. A structured method has been proposed to organize all the data and a system has been developed which extracts meaningful information from the Data Bank. The information repository is established, and the entire data is represented in terms of a unit called Digital Object in the Digital Library. The single unit is represented by recording some quantitative data about it referred to as ‘Metadata'. The search is focused on extracting meaningful information from the repository by applying some filtration strategies to get relevant information, best matched with the query terms. At the end, a partitioning and parallelism focused architecture to archive the information for sharing, back-up and collaboration is also proposed. Comparison of the proposed scheme with state of the art schemes is provided in terms of computational complexity and recall measurement.