Common data manipulations with R in biological researches.
暂无分享,去创建一个
R is a computer language and has been widely used in science community due to the powerful capability in data analysis and visualization; and these functions are mainly provided by the developed packages. Because every package has strict format definitions on the inputted data, it is always required to appropriately manipulate the original data in advance. Unfortunately, users, especially for the beginners, are always confused by the extreme flexibility with R in data manipulation. In the present paper, we roughly categorize the common manipulations with R for biological data into four classes, including overview of data, transformation, summarization, and reshaping. Subsequently, these manipulations are exemplified in a sample data of clinical records of diabetic patients. Our main purpose is to provide a better landscape on the data manipulation with R and hence facilitate the practical applications in biological researches.
[1] Sylvia Tippmann,et al. Programming tools: Adventures with R , 2014, Nature.
[2] Ross Ihaka,et al. Gentleman R: R: A language for data analysis and graphics , 1996 .