A smooth nonparametric approach to determining cut-points of a continuous scale

The problem of determining cut-points of a continuous scale according to an establish categorical scale is often encountered in practice for the purposes such as making diagnosis or treatment recommendation, determining study eligibility, or facilitating interpretations. A general analytic framework was recently proposed for assessing optimal cut-points defined based on some pre-specified criteria. However, the implementation of the existing nonparametric estimators under this framework and the associated inferences can be computationally intensive when more than a few cut-points need to be determined. To address this important issue, a smoothing-based modification of the current method is proposed and is found to substantially improve the computational speed as well as the asymptotic convergence rate. Moreover, a plug-in type variance estimation procedure is developed to further facilitate the computation. Extensive simulation studies confirm the theoretical results and demonstrate the computational benefits of the proposed method. The practical utility of the new approach is illustrated by an application to a mental health study.

[1]  Limin Peng,et al.  A general approach to categorizing a continuous scale according to an ordinal outcome. , 2016, Journal of statistical planning and inference.

[2]  D. Pollard,et al.  Simulation and the Asymptotics of Optimization Estimators , 1989 .

[3]  Gengsheng Qin,et al.  New Nonparametric Confidence Intervals for the Youden Index , 2012, Journal of biopharmaceutical statistics.

[4]  Glenn Heller Smoothed Rank Regression With Censored Data , 2007 .

[5]  Andrew L Baughman,et al.  Mixture model analysis for establishing a diagnostic cut‐off point for pertussis antibody levels , 2006, Statistics in medicine.

[6]  Douglas G. Altman,et al.  Categorizing Continuous Variables , 2005 .

[7]  B. Reiser,et al.  Estimation of the Youden Index and its Associated Cutoff Point , 2005, Biometrical journal. Biometrische Zeitschrift.

[8]  W. Zung A SELF-RATING DEPRESSION SCALE. , 1965, Archives of general psychiatry.

[9]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[10]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[11]  B. Bassler Letters to the Editor , 2011, Journal of microbiology & biology education.

[12]  Qi Li,et al.  Nonparametric Econometrics: Theory and Practice , 2006 .

[13]  Gengsheng Qin,et al.  Improved confidence intervals for the sensitivity at a fixed level of specificity of a continuous-scale diagnostic test. , 2005, Statistics in medicine.

[14]  R. Feynman,et al.  The Theory of a general quantum system interacting with a linear dissipative system , 1963 .

[15]  Carl Tim Kelley,et al.  Iterative methods for optimization , 1999, Frontiers in applied mathematics.

[16]  Enrique F Schisterman,et al.  Youden Index and the optimal threshold for markers with mass at zero , 2008, Statistics in medicine.

[17]  D. Sheehan,et al.  The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. , 1998, The Journal of clinical psychiatry.

[18]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[19]  James Ir,et al.  Estimation of the mixing proportion in a mixture of two normal distributions from simple, rapid measurements. , 1978 .

[20]  Dongliang Wang,et al.  Smoothed empirical likelihood for the Youden index , 2017, Comput. Stat. Data Anal..

[21]  W. Sauerbrei,et al.  Dangers of using "optimal" cutpoints in the evaluation of prognostic factors. , 1994, Journal of the National Cancer Institute.

[22]  B. Silverman,et al.  Weak and Strong Uniform Consistency of the Kernel Estimate of a Density and its Derivatives , 1978 .

[23]  Xinhua Liu,et al.  Classification accuracy and cut point selection , 2012, Statistics in medicine.

[24]  P. Hall,et al.  Bandwidth selection for the smoothing of distribution functions , 1998 .

[25]  Lili Tian,et al.  A new diagnostic accuracy measure and cut-point selection criterion , 2017, Statistical methods in medical research.

[26]  Anne-Laure Boulesteix,et al.  Maximally Selected Chi‐Square Statistics and Binary Splits of Nominal Variables , 2006, Biometrical journal. Biometrische Zeitschrift.

[27]  Lei Pang,et al.  Variance estimation in censored quantile regression via induced smoothing , 2012, Comput. Stat. Data Anal..

[28]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[29]  N. Perkins,et al.  The inconsistency of "optimal" cutpoints obtained using two criteria based on the receiver operating characteristic curve. , 2006, American journal of epidemiology.

[30]  Yong Zhou,et al.  Smoothed Rank Regression for the Accelerated Failure Time Competing Risks Model with Missing Cause of Failure. , 2019, Statistica Sinica.

[31]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[32]  Berthold Lausen,et al.  Maximally selected rank statistics , 1992 .

[33]  Rebecca A. Betensky,et al.  Maximally Selected x2 Statistics for k× 2 Tables , 1999 .

[34]  B. M. Brown,et al.  Induced smoothing for rank regression with censored survival times , 2007, Statistics in medicine.

[35]  Lili Tian,et al.  Exact confidence interval estimation for the Youden index and its corresponding optimal cut-point , 2012, Comput. Stat. Data Anal..

[36]  M. Schumacher,et al.  Evaluating the effect of optimized cutoff values in the assessment of prognostic factors , 1996 .

[37]  D. Siegmund,et al.  Maximally Selected Chi Square Statistics , 1982 .

[38]  Madhu Mazumdar,et al.  Categorizing a prognostic variable: review of methods, code for easy implementation and applications to decision-making about cancer treatments. , 2000 .

[39]  Achim Zeileis,et al.  Generalized Maximally Selected Statistics , 2008, Biometrics.

[40]  K. S. Banerjee Generalized Inverse of Matrices and Its Applications , 1973 .

[41]  Bernard W. Silverman,et al.  Addendum to Weak and Strong Uniform Consistency of the Kernel Estimate of a Density and Its Derivatives , 1980 .

[42]  Yong Zhou,et al.  Composite Estimating Equation Method for the Accelerated Failure Time Model with Length‐biased Sampling Data , 2016 .

[43]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[44]  C Brownie,et al.  Selecting a screening cut-off point or diagnostic criterion for comparing prevalences of disease. , 1984, Biometrics.

[45]  Vikram Patel,et al.  Depression, chronic diseases, and decrements in health: results from the World Health Surveys , 2007, The Lancet.