Towards a New Model of Storage and Access to Data in Big Data and Cloud Computing

Thetechnologicalrevolutionintegratingmultipleinformationsourcesandextensionofcomputer scienceindifferentsectorsledtotheexplosionofthedataquantities,whichreflectsthescalingofvolumes,numbersandtypes.Thesemassiveincreaseshaveresultedinthedevelopmentofnewlocation techniquesandaccesstodata.Thefinalstepsinthisevolutionhaveemergednewtechnologies:Cloud andBigData.ThereferenceimplementationoftheCloudsandBigDatastorageisincontestablythe HadoopDistributedFileSystem(HDFS).Thislatterisbasedontheseparationofmetadatatodatathat consistsinthecentralizationandisolationofthemetadataofstorageservers.Inthispaper,theauthors proposeanapproachtoimprovetheservicemetadataforHadooptomaintainconsistencywithout muchcompromisingperformanceandscalabilityofmetadatabysuggestingamixedsolutionbetween centralizationanddistributionofmetadatatoenhancetheperformanceandscalabilityofthemodel. KeywoRDS Big Data, Clouds of Storage, Hadoop, HDFS, MapReduce, Metadata

[1]  Ghalem Belalem,et al.  A Migration Approach for Fault Tolerance in Cloud Computing , 2014, Int. J. Grid High Perform. Comput..

[2]  Nilanjan Dey,et al.  Replication and Resubmission Based Adaptive Decision for Fault Tolerance in Real Time Cloud Computing: A New Approach , 2016, Int. J. Serv. Sci. Manag. Eng. Technol..

[3]  Ghalem Belalem,et al.  Lightweight coordinated checkpointing in cloud computing , 2014, J. High Speed Networks.

[4]  Alan L. Cox,et al.  The Hadoop distributed filesystem: Balancing portability and performance , 2010, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS).

[5]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[6]  Hal R. Varian,et al.  Reprint: How Much Information? , 2000 .

[7]  Shyam Antony,et al.  Data Management Challenges in Cloud Computing Infrastructures , 2010, DNIS.

[8]  Divyakant Agrawal,et al.  Big data and cloud computing , 2010, Proc. VLDB Endow..

[9]  Michael Stonebraker,et al.  MapReduce and parallel DBMSs: friends or foes? , 2010, CACM.

[10]  Daniel J. Abadi,et al.  Data Management in the Cloud: Limitations and Opportunities , 2009, IEEE Data Eng. Bull..

[11]  Vishal Bhatnagar,et al.  Movie Analytics for Effective Recommendation System using Pig with Hadoop , 2016, Int. J. Rough Sets Data Anal..

[12]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[13]  Abraham Silberschatz,et al.  HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads , 2009, Proc. VLDB Endow..

[14]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.

[15]  Guy. Auteur du texte Chesnot,et al.  Big data et cloud : stockage et traitement de données du futur (2e éd.) Guy Chesnot , 2017 .

[16]  Kevin Curran,et al.  Cloud Computing Security , 2011, Int. J. Ambient Comput. Intell..

[17]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[18]  Sanjay P. Ahuja,et al.  State of Big Data Analysis in the Cloud , 2013, Netw. Commun. Technol..

[19]  Nathan Marz,et al.  Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[20]  Asser N. Tantawi,et al.  On the Modeling and Management of Cloud Data Analytics , 2010 .

[21]  N. B. Venkateswarlu,et al.  A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance , 2016, Int. J. Ambient Comput. Intell..

[22]  Ghalem Belalem,et al.  Task scheduling strategy based on data replication in scientific Cloud workflows , 2016, Multiagent Grid Syst..