Multistate recursively imputed survival trees for time-to-event data analysis: an application to AIDS and mortality post-HIV infection data

AbstractsBackgroundThis study aimed to introduce recursively imputed survival trees into multistate survival models (MSRIST) to analyze these types of data and to identify the prognostic factors influencing the disease progression in patients with intermediate events. The proposed method is fully nonparametric and can be used for estimating transition probabilities.MethodsA general algorithm was provided for analyzing multi-state data with a focus on the illness-death and progressive multi-state models. The model considered both beyond Markov and Non-Markov settings. We also proposed a multi-state random survival method (MSRSF) and compared their performance with the classical multi-state Cox model. We applied the proposed method to a dataset related to HIV/AIDS patients based on a retrospective cohort study extracted in Tehran from April 2004 to March 2014 consist of 2473 HIV-infected patients.ResultsThe results showed that MSRIST outperformed the classical multistate method using Cox Model and MSRSF in terms of integrated Brier score and concordance index over 500 repetitions. We also identified a set of important risk factors as well as their interactions on different states of HIV and AIDS progression.ConclusionsThere are different strategies for modelling the intermediate event. We adapted two newly developed data mining technique (RSF and RIST) for multistate models (MSRSF and MSRIST) to identify important risk factors in different stages of the diseases. The methods can capture any complex relationship between variables and can be used as a useful tool for identifying important risk factors in different states of this disease.

[1]  Hemant Ishwaran,et al.  Identifying Important Risk Factors for Survival in Patient With Systolic Heart Failure Using Random Survival Forests , 2011, Circulation. Cardiovascular quality and outcomes.

[2]  Anneke Grobler,et al.  Integration of antiretroviral therapy with tuberculosis treatment. , 2011, The New England journal of medicine.

[3]  Udaya B. Kogalur,et al.  Consistency of Random Survival Forests. , 2008, Statistics & probability letters.

[4]  M. Mohraz,et al.  Predictors of tuberculosis and human immunodeficiency virus co-infection: a case-control study , 2014, Epidemiology and health.

[5]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[6]  R. Bajpai,et al.  Demographic Correlates of Survival in Adult HIV Patients Registered at ART Centers in Andhra Pradesh, India: A Retrospective Cohort Study , 2014 .

[7]  F. Harrell,et al.  Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors , 2005 .

[8]  S. Cole,et al.  Effect of tuberculosis on the survival of HIV-infected men in a country with low tuberculosis incidence , 2008, AIDS.

[9]  O Borgan,et al.  Covariate Adjustment of Event Histories Estimated from Markov Chains: The Additive Approach , 2001, Biometrics.

[10]  David R. Cox,et al.  The Theory of Stochastic Processes , 1967, The Mathematical Gazette.

[11]  Torsten Hothorn,et al.  Bagging survival trees , 2002, Statistics in medicine.

[12]  J. Kingman THE THEORY OF STOCHASTIC PROCESSES 2 , 1976 .

[13]  Hemant Ishwaran,et al.  Random survival forests for competing risks. , 2014, Biostatistics.

[14]  Jalal Poorolajal,et al.  Survival rate of AIDS disease and mortality in HIV-infected patients in Hamadan, Iran: a registry-based retrospective cohort study (1997–2011) , 2013, International journal of STD & AIDS.

[15]  Jalal Poorolajal,et al.  A comparative study of support vector machines and artificial neural networks for predicting precipitation in Iran , 2014, Theoretical and Applied Climatology.

[16]  S. Vong,et al.  Earlier versus later start of antiretroviral therapy in HIV-infected adults with tuberculosis. , 2011, The New England journal of medicine.

[17]  V.B.L.S. Lammens Estimating the prediction error in multistate models , 2014 .

[18]  R H Jones,et al.  Multi-state models and diabetic retinopathy. , 1995, Statistics in medicine.

[19]  Maryam FARHADIAN,et al.  Identifying Important Risk Factors for Survival in Kidney Graft Failure Patients Using Random Survival Forests , 2016, Iranian journal of public health.

[20]  Juan Eloy Ruiz-Castro,et al.  Non‐homogeneous Markov models in the analysis of survival after breast cancer , 2001 .

[21]  J. Kalbfleisch,et al.  The Analysis of Panel Data under a Markov Assumption , 1985 .

[22]  Michael R Kosorok,et al.  Recursively Imputed Survival Trees , 2012, Journal of the American Statistical Association.

[23]  B. Gazzard,et al.  CD4 Cell Count and the Risk of AIDS or Death in HIV-Infected Adults on Combination Antiretroviral Therapy with a Suppressed Viral Load: A Longitudinal Cohort Study from COHERE , 2012, PLoS medicine.

[24]  S Esbjerg,et al.  Multi-state models for bleeding episodes and mortality in liver cirrhosis. , 2000, Statistics in medicine.

[25]  Gerhard Tutz,et al.  A survival tree method for the analysis of discrete event times in clinical and epidemiological studies , 2016, Statistics in medicine.

[26]  Carmen Cadarso-Suárez,et al.  Multi-state models for the analysis of time-to-event data , 2009, Statistical methods in medical research.

[27]  H Putter,et al.  Tutorial in biostatistics: competing risks and multi‐state models , 2007, Statistics in medicine.

[28]  P. Andersen,et al.  Multistate models in survival analysis: a study of nephropathy and mortality in diabetes. , 1988, Statistics in medicine.

[29]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[30]  E. Waldman,et al.  Predictors of progression to AIDS after HIV infection diagnosis in the pre- and post-HAART eras in a Brazilian AIDS-free cohort. , 2014, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[31]  Niels Keiding,et al.  Non- and semi-parametric estimation of transition probabilities from censored observation of a non-homogeneous Markov process , 1991 .

[32]  J. McNeil,et al.  HIV treatment outcomes among people who inject drugs in Victoria, Australia , 2014, BMC Infectious Diseases.

[33]  C. Begg,et al.  A study of the use of the probability-of-being-in-response function as a summary of tumor response data. , 1982, Biometrics.

[34]  Hein Putter,et al.  mstate: An R Package for the Analysis of Competing Risks and Multi-State Models , 2011 .

[35]  Ren Johansen An Empirical Transition Matrix for Non-homogeneous Markov Chains Based on Censored Observations , 1978 .

[36]  S. Lawn,et al.  Antiretroviral Therapy for Control of the HIV-associated Tuberculosis Epidemic in Resource-Limited Settings , 2009, Clinics in chest medicine.

[37]  P. Bühlmann,et al.  Survival ensembles. , 2006, Biostatistics.

[38]  N. Keiding,et al.  Multi-state models for event history analysis , 2002, Statistical methods in medical research.

[39]  Sabine Van Huffel,et al.  Survival SVM: a practical scalable algorithm , 2008, ESANN.

[40]  John P. Klein,et al.  Additive hazards Markov regression models illustrated with bone marrow transplant data , 2005 .

[41]  John P. Klein,et al.  Identification by random forest method of HLA class I amino acid substitutions associated with lower survival at day 100 in unrelated donor hematopoietic cell transplantation , 2011, Bone Marrow Transplantation.

[42]  Stefan Dietrich,et al.  Investigation of the machine learning method Random Survival Forest as an exploratory analysis tool for the identification of variables associated with disease risks in complex survival data , 2016 .

[43]  Daniel Commenges,et al.  MKVPCI: a computer program for Markov models with piecewise constant intensities and covariates , 2001, Comput. Methods Programs Biomed..

[44]  P. G. Choe,et al.  Incidence and Risk Factors of Tuberculosis in Patients with Human Immunodeficiency Virus Infection , 2013, Journal of Korean medical science.

[45]  Martin Schumacher,et al.  Competing Risks and Multistate Models with R , 2011 .

[46]  Martin Schumacher,et al.  Multi-state model for studying an intermediate event using time-dependent covariates: application to breast cancer , 2013, BMC Medical Research Methodology.

[47]  J. Klein,et al.  MODELING MULTISTATE SURVIVAL ILLUSTRATED IN BONE MARROW TRANSPLANTATION , 2007 .

[48]  J P Klein,et al.  Plotting summary predictions in multistate survival models: probabilities of relapse and death in remission for bone marrow transplantation patients. , 1993, Statistics in medicine.