Performance of elasticsearch in cloud environment with nGram and non-nGram indexing

The fact that technology have changed the lives of human beings cannot be denied. It has drastically reduced the effort needed to perform a particular task and has increased the productivity and efficiency. Computers especially have been playing a very important role in almost all fields in today's world. They are used to store large amount of data in almost all sectors, be it business and industrial sectors, personal lives or any other. The research areas of science and technology uses computers to solve complex and critical problems. Information is the most important requirement of each individual. In this era of quick-growing and huge data, it has become increasingly illogical to analyse it with the help of traditional techniques or relational databases. New big data instruments, architectures and designs have come into existence to give better support to the requirements of organizations/institutions in analysing large data. Specifically, Elasticsearch, a full-text java based search engine, designed keeping cloud environment in mind solves issues of scalability, search in real time, and efficiency that relational databases were not able to address. In this paper, we present our own experience with Elasticsearch an open source, Apache Lucene based, full-text search engine that provides near real-time search ability, as well as a RESTful API for the ease of user in the field of research.

[1]  Clinton Gormley,et al.  Elasticsearch: The Definitive Guide , 2015 .

[2]  Yong Deng,et al.  Cloud storage and search for mass spatio-temporal data through Proxmox VE and Elasticsearch cluster , 2014, 2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems.

[3]  Jun Bai,et al.  Feasibility analysis of big log data real time search based on Hbase and ElasticSearch , 2013, 2013 Ninth International Conference on Natural Computation (ICNC).

[4]  Michael W. Godfrey,et al.  Mining modern repositories with elasticsearch , 2014, MSR 2014.