Metadata domain-knowledge driven search engine in "HyperManyMedia" E-learning resources

In this paper, we exploit the synergies between Information Retrieval and E-learning by describing the design of a system that uses "Information Retrieval" in the context of the Web and "E-learning". With the exponential growth of the web, we noticed that the "general-purpose" of web applications started to diminish and more domain-specific or personal aspects started to rise, e.g., the trend of personalized web pages, a user's history of browsing and purchasing, and topical/focused search engines. The huge explosion of the amount of information on the web makes it difficult for online students to find specific information with a specific media format unless a prior analysis has been made. In this paper, we present a metadata domain-driven search engine that indexes text, powerpoint, audio, video, podcast, and vodcast lectures. These lectures are stored in a prototype "HyperManyMedia" E-learning web-based platform. Each lecture in this platform has been tagged with metadata using the domain-knowledge of these resources.

[1]  Christine L. Borgman,et al.  Social aspects of digital libraries (working session) , 1996, DL '96.

[2]  Soumen Chakrabarti,et al.  Data mining for hypertext: a tutorial survey , 2000, SKDD.

[3]  Carl Lagoze,et al.  Metadata aggregation and "automated digital libraries": a retrospective on the NSDL experience , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[4]  Christoph Hölscher,et al.  Web search behavior of Internet experts and newbies , 2000, Comput. Networks.

[5]  Sally Jo Cunningham,et al.  Usage analysis of a digital library , 1998, DL '98.

[6]  Kevin Chen-Chuan Chang,et al.  A holistic paradigm for large scale schema matching , 2004, SGMD.

[7]  Jonathan L. Herlocker,et al.  Designing and understanding information retrieval systems using collaborative filtering in an academic library environment , 2007 .

[8]  Olfa Nasraoui,et al.  Personalized cluster-based semantically enriched web search for e-learning , 2008, ONISW '08.

[9]  Gautam Pant,et al.  Panorama: extending digital libraries with topical crawlers , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[10]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[11]  Jon M Kleinberg,et al.  Hubs, authorities, and communities , 1999, CSUR.

[12]  Olfa Nasraoui,et al.  Semantic Information Retrieval for Personalized E-Learning , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[13]  Amanda Spink,et al.  An analysis of Web searching by European AlltheWeb.com users , 2005, Inf. Process. Manag..

[14]  Maged M. Michael,et al.  Scalability of the Nutch search engine , 2007, ICS '07.

[15]  Michael Chau,et al.  Building domain-specific web collections for scientific digital libraries: a meta-search enhanced focused crawling method , 2004, JCDL.

[16]  Cong Zhou CNDROBOT : a robot for the CINDI digital library system , 2005 .

[17]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[18]  Salvatore J. Stolfo,et al.  Towards the digital government of the 21st century: a report from the workshop on research and development opportunities in federal information services , 2000, DG.O.

[19]  Dell Zhang,et al.  An efficient algorithm to rank Web resources , 2000, Comput. Networks.

[20]  Gerhard Weikum,et al.  A Time Machine for Text Search , 2022 .

[21]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[22]  Yilei Shao Exploring social networks in computer systems , 2007 .

[23]  Michael J. Cafarella,et al.  Building Nutch: Open Source Search , 2004, ACM Queue.

[24]  James D. Foley,et al.  Browsing affordance designs for the human-centered computing education digital library , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[25]  Yuxin Chen,et al.  A Novel Hybrid Focused Crawling Algorithm to Build Domain-Specific Collections , 2007 .

[26]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[27]  Miroslav Kubat,et al.  Search engine ranking efficiency evaluation tool , 2007, SGCS.