Dealing with Identifier Variables in Data Management and Analysis
暂无分享,去创建一个
Identifier variables are prominent in most data files and, more often than not, are essential to fully use the information in a Stata dataset. However, rendering them in the proper format and relevant number of digits appropriate for data management and statistical analysis might pose unnerving challenges to inexperienced or even veteran Stata users. To lessen these challenges, I provide some useful tips and guard against some pitfalls by featuring two official Stata routines: the string() function and its elaborated wrapper, the tostring command. I illustrate how to use these two routines to address the difficulties caused by identifier variables in managing and analyzing data from private institutions and U.S. government agencies.
[1] Nicholas J. Cox,et al. Speaking Stata: On Numbers and Strings , 2002 .
[2] Nicholas J. Cox. Stata Tip 33: Sweet Sixteen: Hexadecimal Formats and Precision Problems , 2006 .
[3] Jeremy B. Wernow,et al. Changing numeric variables to string , 2001 .
[4] Nicholas J. Cox. Speaking Stata: Fun and Fluency with Functions , 2011 .
[5] P. Wilner Jeanty. Managing the U.S. Census 2000 and World Development Indicators databases for statistical analysis in Stata , 2011 .