Under‐reported data analysis with INAR‐hidden Markov chains

In this work, we deal with correlated under-reported data through INAR(1)-hidden Markov chain models. These models are very flexible and can be identified through its autocorrelation function, which has a very simple form. A naïve method of parameter estimation is proposed, jointly with the maximum likelihood method based on a revised version of the forward algorithm. The most-probable unobserved time series is reconstructed by means of the Viterbi algorithm. Several examples of application in the field of public health are discussed illustrating the utility of the models. Copyright © 2016 John Wiley & Sons, Ltd.

[1]  Dimitris Karlis,et al.  Some properties of multivariate INAR(1) processes , 2013, Comput. Stat. Data Anal..

[2]  David R. Hunter,et al.  mixtools: An R Package for Analyzing Mixture Models , 2009 .

[3]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[4]  J. Sobel,et al.  Global Occurrence of Infant Botulism, 1976–2006 , 2008, Pediatrics.

[5]  Mohamed Alosh,et al.  FIRST‐ORDER INTEGER‐VALUED AUTOREGRESSIVE (INAR(1)) PROCESS , 1987 .

[6]  Christian H. Weiß,et al.  Thinning operations for modeling time series of counts—a survey , 2008 .

[7]  J. Rocourt,et al.  The present state of foodborne disease in OECD countries , 2003 .

[8]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[9]  Christian Gourieroux,et al.  Heterogeneous INAR(1) model with application to car insurance , 2004 .

[10]  S. Arendt,et al.  Reporting of Foodborne Illness by U.S. Consumers and Healthcare Professionals , 2013, International journal of environmental research and public health.

[11]  Tsun-Jen Cheng,et al.  Global Magnitude of Reported and Unreported Mesothelioma , 2011, Environmental health perspectives.

[12]  David Moriña,et al.  A statistical model for hospital admissions caused by seasonal diseases , 2011, Statistics in medicine.

[13]  Joseph C. Gardiner,et al.  How Much Work-Related Injury and Illness is Missed By the Current National Surveillance System? , 2006, Journal of occupational and environmental medicine.

[14]  C. Lai,et al.  First‐order integer valued AR processes with zero inflated poisson innovations , 2012 .

[15]  Magda Monteiro,et al.  Integer-valued autoregressive processes with periodic structure , 2010 .

[16]  Christian H. Weiß,et al.  Compound Poisson INAR(1) processes: Stochastic properties and testing for overdispersion , 2014, Comput. Stat. Data Anal..

[17]  Paul H. C. Eilers,et al.  Twenty years of P-splines , 2015 .

[18]  K. Straif,et al.  Peritoneal mesothelioma in Italy: Trends and geography of mortality and incidence. , 2015, American journal of industrial medicine.

[19]  Michael Höhle,et al.  Estimating the under-reporting of norovirus illness in Germany utilizing enhanced awareness of diarrhoea during a large outbreak of Shiga toxin-producing E. coli O104:H4 in 2011 – a time series analysis , 2014, BMC Infectious Diseases.

[20]  Robert C. Jung,et al.  Binomial thinning models for integer time series , 2006 .

[21]  Darshan B Roy,et al.  Primary Peritoneal Mesothelioma Resulting in Small Bowel Obstruction: A Case Report and Review of Literature , 2015, The American journal of case reports.

[22]  Y. Samant,et al.  Work‐related skin diseases in Norway may be underreported: data from 2000 to 2013 , 2015, Contact dermatitis.

[23]  F. Galateau-Sallé,et al.  Digestive cancers and occupational asbestos exposure: incidence study in a cohort of asbestos plant workers , 2015, Occupational and Environmental Medicine.

[24]  M Cardinal,et al.  On the application of integer-valued time series models for the analysis of disease incidence. , 1999, Statistics in medicine.

[25]  Rainer Winkelmann,et al.  Markov chain Monte Carlo analysis of underreported count data with an application to worker absenteeism , 1996 .

[26]  Thomas Stauffer Larsen,et al.  Nye sygdomsmarkører ved de kroniske myeloproliferative neoplasier , 2015 .

[27]  Christian H. Weiß,et al.  Thinning-based models in the analysis of integer-valued time series: a review , 2015 .

[28]  M. Kogevinas,et al.  Mesothelioma mortality in men: trends during 1977–2001 and projections for 2002–2016 in Spain , 2007, Occupational and Environmental Medicine.

[29]  Brendan McCabe,et al.  Forecasting discrete valued low count time series , 2004 .

[30]  Michael Höhle,et al.  Bayesian nowcasting during the STEC O104:H4 outbreak in Germany, 2011 , 2014, Biometrics.

[31]  D. Cox,et al.  A General Definition of Residuals , 1968 .

[32]  John Iskander,et al.  CDC Grand Rounds: Reducing the Burden of HPV-Associated Cancer and Disease , 2014, MMWR. Morbidity and mortality weekly report.

[33]  Atanu Biswas,et al.  Modelling and coherent forecasting of zero-inflated count time series , 2014 .