Processing Technology of Massive Human Health Data Based on Hadoop

With the development of science and medical industry, people pay more and more attention to their health status. And massive human health data are generated in this process. As an important component of the cloud computing technology, the open source framework Hadoop provides us with a platform for storing and processing massive data. For the bottleneck of the existing Hadoop framework to deal with the small files in human health data, this paper proposes two optimization strategy: index optimization and metadata prefetching. At the end of the paper, the simulation results show that the method has excellent performance.