A Semantic Search Engine in the Cloud

Due to the ease of data production within the Internet era, knowledge workers are increasingly overwhelmed by information from multiple information sources and yet still find it hard to navigate and search for accessing the specific information required for the task at hand. This implies that knowledge worker productivity is reduced and that organizations may be making decisions on the basis of incomplete knowledge. Most search engines in use today strongly rely on keywords matching and on the ability of the user in the query expression. This leads to the retrieval of a large amount of irrelevant information with a direct impact on the user that spends a lot of time in browsing the results and/or to construct more complex queries to refine the search output. To overcome this limitation semantic-based solution are increasingly adopted. In this work we propose a general architecture that implements a semantic search engine in the cloud that exploits semantic technologies to retrieve and present the right information to the user. Our search engine is aimed at providing support in the task of document composition, suggesting to the user the adequate section that could be inserted within a document.

[1]  Valeria Vittorini,et al.  Solution Workflows for Model-Based Analysis of Complex Systems , 2012, IEEE Transactions on Automation Science and Engineering.

[2]  Alexander Schill,et al.  Automatic indexing of scanned documents: a layout-based approach , 2012, Electronic Imaging.

[3]  Max Silberztein,et al.  NooJ: a Linguistic Annotation System for Corpus Processing , 2005, HLT.

[4]  Flora Amato,et al.  Knowledge Representation and Management for E-Government Documents , 2008, E-Government, ICT Professionalism and Competences Service Science.

[5]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[6]  Georg Gottlob,et al.  Ontology-based semantic search on the Web and its combination with the power of inductive reasoning , 2012, Annals of Mathematics and Artificial Intelligence.

[7]  Flora Amato,et al.  Building RDF Ontologies from Semi-Structured Legal Documents , 2008, 2008 International Conference on Complex, Intelligent and Software Intensive Systems.

[8]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[9]  Flora Amato,et al.  A Semantic-based Document Processing Framework: A Security Perspective , 2011, 2011 International Conference on Complex, Intelligent, and Software Intensive Systems.

[10]  Flora Amato,et al.  A system for semantic retrieval and long-term preservation of multimedia documents in the e-government domain , 2009, Int. J. Web Grid Serv..

[11]  Carlo Strapparava,et al.  WordNet for Italian and Its Use for Lexical Deiscrimination , 1997, AI*IA.

[12]  Simonetta Vietri,et al.  Data Mining Modular Software System , 2010, SWWS.

[13]  Prabhakar Raghavan,et al.  Information retrieval algorithms: a survey , 1997, SODA '97.

[14]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[15]  Flora Amato,et al.  A semantic based methodology to classify and protect sensitive data in medical records , 2010, 2010 Sixth International Conference on Information Assurance and Security.

[16]  Evgeniy Gabrilovich,et al.  Concept-Based Information Retrieval Using Explicit Semantic Analysis , 2011, TOIS.

[17]  Mark A. Musen,et al.  Semantic Wiki Search , 2009, ESWC.

[18]  Dunja Mladenic,et al.  Challenges of Semantic Knowledge Management , 2009, Semantic Knowledge Management.

[19]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.