The Distributed Storage System Based on MPP for Mass Data

The traditional database technology can't meet the rapidly growing information and the demands of extreme scalability, high availability and reliability for the mass data. In this paper, we have designed a distributed storage system based MPP(massively parallel processing) architecture on RDBMS to solve these problems. First, we present the main framework and the function architecture of the design. Then we do some experiments and performance tests. At last, we conclude that with the good scalability and massively parallel processing advantage, the system can solve the mass data storage problem. The idea of distributed storage system based MPP architecture in relational databases speeds up reading and writing, improving the shortcomings of traditional database.