Korea has a single National Health Insurance program and all citizens are covered under this program, accounting 97% of the popu-lation, approximately 50 million people. Claims submitted by Health care providers are reviewed by Health Insurance Review and Assessment (HIRA) for the reimbursement. HIRA database contains not only individual beneficiary's information, but also healthcare service information such as diagnosis, procedures, prescriptions and tests for them. HRA database has gained attention as impor-tance source for research due to its rich healthcare information and the demand of HIRA database has increased. Due to its tremen-dous size, however, researchers have had problems in accessing the database to conduct research. To meet this demand, we con-ducted a study to develop the inpatient sample data from HIRA database for research. This study has two purposes: 1) to determine a needed sample size; 2) to test reliability and validity of the sample data. We determined an adequate sample size to ensure repre-sentativeness and generality with additional consideration for convenience of calculation. The minimum sample size was 729,904 for the generality, and 488,861 for representativeness. After considering the convenience of calculation, our final sample size was 13% of the population, which was about 7.7 million beneficiaries. Age (5 years interval) and gender were used as stratification vari-ables for sampling. In order to examine whether this sample data appropriately reflect population, we tested the reliability and va-lidity of the sample data. From the sample data, we computed average expenditure of total claims per inpatient for 2011, frequency of top 30 disease, estimation of the number of stroke patients from the sample data, and then compared them to those from the population. Results confirmed reliability and validity of the sample data .Keywords: Administrative data; National Health Insurance; Claims data; Inpatient; Sampling; Generality; Reliability; Validity