Neighbourhood systems based knowledge acquisition using MapReduce from Big Data over cloud computing

Cloud computing represents a paradigm shift - a transition from computing-as-a-product to computing-as-aservice. It can be applied to a range of areas, including ecommerce, health, education, communities, etc which are emerging as the important sectors in today's market. Day-byday more knowledge is added to the internet and is shared amongst the users in addition to the official use over the cloud. As a result the energy consumption by the networks is increasing and it needs to be managed. This usage can be brought into account for measuring and hence conserving the energy. The consumption is all together considered for the processing, storage and transport of the knowledge granules over the cloud. Since the data accessed in the cloud is “ondemand”, the prediction techniques like those using rough sets can be used to minimize the transfer of data over the cloud networks. The data over the cloud can be procured with the help of rough set based methods efficiently which can help in conserving the energy. Recently, the basic rough set theory has been extended to the notion of neighbourhood based rough sets where information systems with heterogeneous features are prevalent. In this paper, we propose a neighbourhood based rough set approach for knowledge acquisition using MapReduce from Big Data.

[1]  Jignesh M. Patel,et al.  A comparison of join algorithms for log processing in MaPreduce , 2010, SIGMOD Conference.

[2]  Yi Pan,et al.  Parallel rough set based knowledge acquisition using MapReduce from big data , 2012, BigMine '12.

[3]  Michael C. Schatz,et al.  Rapid parallel genome indexing with MapReduce , 2011, MapReduce '11.

[4]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[5]  Benjamin Moseley,et al.  Fast clustering using MapReduce , 2011, KDD.

[6]  Qinghua Hu,et al.  Mixed feature selection based on granulation and approximation , 2008, Knowl. Based Syst..

[7]  Christoforos E. Kozyrakis,et al.  Evaluating MapReduce for Multi-core and Multiprocessor Systems , 2007, 2007 IEEE 13th International Symposium on High Performance Computer Architecture.

[8]  Da Ruan,et al.  A parallel method for computing rough set approximations , 2012, Inf. Sci..

[9]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[10]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[11]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[12]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[13]  Jimmy J. Lin,et al.  Design patterns for efficient graph algorithms in MapReduce , 2010, MLG '10.

[14]  Geoffrey C. Fox,et al.  Twister: a runtime for iterative MapReduce , 2010, HPDC '10.

[15]  Naga K. Govindaraju,et al.  Mars: A MapReduce Framework on graphics processors , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).

[16]  Yi Liu,et al.  A novel architecture towards trusted E-commerce cloud , 2012, 2012 International Conference on E-Learning and E-Technologies in Education (ICEEE).

[17]  Zdzisław Pawlak,et al.  Rough sets and information systems , 1988 .