The exploration of information extraction and analysis about science and technology policy in China

Purpose Science and technology policy plays an important role in promoting the development of economic and social development in China. At present, the research on science and technology policy is mainly focused on the basic theories and some quantitative research. The analyses for contents of massive science and technology policies are relatively less. This paper makes use of semantic technologies to extract and analyze the relatively important information from massive science and technology policies. The purpose of this paper is to facilitate users to quickly and effectively obtain valuable information from the massive science and technology policies. The key methods and study results are presented in the paper. The study results can provide references for further study and application in China. Design/methodology/approach The paper presented the analysis model and method for science and technology policy in China. The terms and sentences are the important information in the science and technology policy. The study adopted the technology of natural language processing to analyze the linguistics characteristics of terms and combined with statistical analyses to extract the terms from Chinese science and technology policy. Then, the authors designed an algorithm, calculated and analyzed the important sentences in Chinese science and technology policies. The experiments were run on the Java test platform. Findings This paper put forward the analysis model and method for science and technology policy in China. The study obtained the following conclusions: term extraction of science and technology policy: the paper analyzed characteristic of terms in Chinese science and technology policy and designed a method of extracting a term that was suitable for the science and technology policy. The calculation of important sentences for science and technology policy: the paper designed an algorithm and calculated the importance of the sentences to obtain valuable information from the massive science and technology policies. Research limitations/implications In our methods, there are some defects to be improved or solved in the future. For example, the precision of algorithm needs to be improved. The significance of this paper is to propose and use the analysis model to process Chinese science and technology policy; we can provide an auxiliary tool to help policy beneficiaries. Enterprises and individuals can be more effective to extraction and mining information from massive science and technology policy and find the target policy. Practical implications To verify the effectiveness of the method, the paper selected the real policies about the new energy vehicles as experimental data; at the same time, the paper added uncorrelated policies. It used the proposed analysis model of science and technology policy to calculate and find out the relatively important sentences. The results of study showed that the proposed method can obtain better performance. It verified the validity of this method. The model and method have been applied to actual retrieval system. Social implications The proposed model and method in the paper have been applied to actual retrieval system for users. Originality/value The paper proposed the new analysis model and method to analyze science and technology policies in China. The presented model and method are a new attempt. According to the experimental results, this exploration and study are valuable. In addition, the idea and method will give a good start for improving information services of massive science and technology policies in China.