This project addresses the main research problem in information retrieval and semantic search. It proposes the smart search theory as new theory based on hypothesis that semantic meanings of a document can be described by a set of
keywords. With two experiments designed and carried out in this project, the experiment result demonstrates positive evidence that meet the smart search theory.
In the theory proposed in this project, the smart search aims to determine a set of keywords for any web documents, by which the semantic meanings of the documents can be uniquely identified. Meanwhile, the size of the set of keywords is supposed to be small enough which can be easily managed. This is the fundamental assumption for creating the smart semantic search engine. In this project, the rationale of the assumption and the theory based on it will be discussed, as well as the processes of how the theory can be applied to the keyword allocation and the data model to be
generated. Then the design of the smart search engine will be proposed, in order to create a solution to the efficiency problem while searching among huge amount of increasing information published on the web.
To achieve high efficiency in web searching, statistical method is proved to be an effective way and it can be interpreted from the semantic level. Based on the frequency of joint keywords, the keyword list can be generated and linked to each other to form a meaning structure. A data model is built when a proper keyword list is achieved and the model is applied to the design of the smart search engine.
[1]
Gerhard Weikum,et al.
Probabilistic information retrieval approach for ranking of database query results
,
2006,
TODS.
[2]
R. Studer,et al.
Semantic Web Technologies: Trends and Research in Ontology-based Systems
,
2006
.
[3]
Jane Hunter,et al.
A Semantic Search Engine for the Storage Resource Broker
,
2006
.
[4]
Thomas Keays,et al.
Semantic Web for the Working Ontologist
,
2008
.
[5]
Ranasinghe W.M. Tharanga Dilruk.
Search Engine : An Effective tool for exploring the Internet
,
2006
.
[6]
Ben He,et al.
Terrier : A High Performance and Scalable Information Retrieval Platform
,
2022
.
[7]
Michael McGill,et al.
Introduction to Modern Information Retrieval
,
1983
.
[8]
Frank van Harmelen,et al.
A semantic web primer
,
2004
.
[9]
Ophir Frieder,et al.
Information Retrieval: Algorithms and Heuristics
,
1998
.
[10]
James A. Hendler,et al.
The Semantic Web" in Scientific American
,
2001
.
[11]
Timothy W. Finin,et al.
Swoogle: a search and metadata engine for the semantic web
,
2004,
CIKM '04.
[12]
Brian Jacobsen,et al.
Topic-specific search engine
,
2003
.