Enabling top-n file retrieval in cloud storage using hadoop distributed file system

A storage system in cloud is well thought-out as a very big scale storage system that has independent storage servers. The service that cloud storage provides is, that can store user's data from remote through network and other authenticated users can access the data easily. Hadoop distributed file system is used to store large files consistently and to retrieve those files at very high bandwidth to user applications. Hadoop splits the files into large blocks and distributes them amongst the nodes in the cluster. When we retrieve data from the cloud, it is very important that the computation and communication overhead should be reduced. To reduce the communication overhead the server should send only the top-n files based on the keyword when the user asks for the data files. Since the owner need not maintain the copy of the files, it is all the more necessary to make check on the files available and also check the originality of the files stored in the server periodically. In HDFS the computation is done in parallel so that the execution time is drastically reduced. In the proposed system for retrieving top-n files we use Hadoop Distributed File System, so that the search time and the communication overhead is greatly reduced.

[1]  Bo Zhang,et al.  An efficient public key encryption with conjunctive-subset keywords search , 2011, J. Netw. Comput. Appl..

[2]  J. Jospin Jeya,et al.  Multi Key Word Search and Trusted Auditing System to Verify the Integrity of Outsourced Data in Cloud Computing , 2014 .

[3]  Jun Zhang,et al.  Dirichlet Process Mixture Model for Document Clustering with Feature Partition , 2013, IEEE Transactions on Knowledge and Data Engineering.

[4]  Minglu Li,et al.  Toward Secure Multikeyword Top-k Retrieval over Encrypted Cloud Data , 2013, IEEE Transactions on Dependable and Secure Computing.

[5]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[6]  J. Jospin Jeya,et al.  EFFICIENT RANKED AND SECURE FILE RETRIEVAL IN CLOUD COMPUTING , 2014 .

[7]  Xi Chen,et al.  Efficient Proxy Re-encryption with Private Keyword Searching in Untrusted Storage , 2011, International Journal of Computer Network and Information Security.

[8]  Bruno Crispo,et al.  Enforcing Multi-user Access Policies to Encrypted Cloud Databases , 2011, 2011 IEEE International Symposium on Policies for Distributed Systems and Networks.