Dealing with data conflicts in statistical inference of population assessment models that integrate information from multiple diverse data sets

Abstract Contemporary fisheries stock assessments often use multiple diverse data sets to extract as much information as possible about biological and fishery processes. However, models are, by definition, simplifications of reality and, therefore, misspecified. Model misspecification can cause degradation of results when multiple data sets are analyzed simultaneously. The process, observation, and sampling components of the model must all be, at least, approximately correct to minimize bias. Unfortunately, even the basic processes that are usually considered well understood (e.g., growth and selectivity) are misspecified in most, if not all, stock assessments. These misspecified processes, in combination with use of composition data, result in biased estimates of absolute abundance and abundance trends, which are often evident as “data conflicts.” This is compounded by over-weighting of composition data in many assessments owing to misuse of data-weighting approaches. The ‘law of conflicting data’ states that since data are facts, conflicting data implies model misspecification, but must be interpreted in the context of random sampling error. Down-weighting (or dropping) conflicting data is not necessarily appropriate because it may not resolve the model misspecification. Model misspecification and process variation can be accounted for in the variance parameters of the likelihoods (sampling error), but it is unclear when, or even if, this is appropriate. The appropriate method to deal with data conflicts depends on whether it is caused by random sampling error, process variation, observation model misspecification, or misspecification of the system (dynamics) model. Diagnostic approaches are urgently needed to evaluate goodness of fit and to identify model misspecification. We recommend external estimation of the sampling error variance in likelihood functions, modelling process variation in integrated models, and internal estimation of the standard deviation of the process variation. The required statistical framework is computationally intensive, but practical approximations are available, computational algorithms are being improved, and computer power is increasing. We provide a framework for model development that identifies and corrects model misspecification and illustrate the framework, using simulated data.

[1]  FrancisR.I.C. Chris,et al.  Data weighting in statistical fisheries stock assessment models , 2011 .

[2]  Hiroshi Okamura,et al.  Data conflict caused by model mis-specification of selectivity in an integrated stock assessment model and its potential effects on stock status estimation , 2014 .

[3]  John R. Skalski,et al.  Variance estimation in integrated assessment models and its importance for hypothesis testing , 2007 .

[4]  Ian G. Taylor,et al.  A comparison of parametric, semi-parametric, and non-parametric approaches to selectivity in age-structured assessment models , 2014 .

[5]  R. Francis,et al.  Estimating fish growth for stock assessments using both age–length and tagging-increment data , 2016 .

[6]  Richard D. Methot,et al.  Can steepness of the stock–recruitment relationship be estimated in fishery stock assessment models? , 2012 .

[7]  Mark N. Maunder,et al.  Selectivity : theory , estimation , and application in fishery stock assessment models Workshop Series Report 1 June 2013 , 2013 .

[8]  Keisuke Satoh,et al.  Get the biology right, or use size-composition data at your own risk , 2017 .

[9]  David A. Fournier,et al.  A General Theory for Analyzing Catch at Age Data , 1982 .

[10]  Malcolm Haddon,et al.  Modelling and quantitative methods in fisheries , 2001 .

[11]  J. Thorson,et al.  Relative magnitude of cohort, age, and year effects on size at age of exploited marine fishes , 2016 .

[12]  C. Phillip Goodyear,et al.  Variability of Fishing Mortality by Age: Consequences for Maximum Sustainable Yield , 1996 .

[13]  Mark N. Maunder,et al.  The relationship between fishing methods, fisheries management and the estimation of maximum sustainable yield , 2002 .

[14]  Anders Nielsen,et al.  Estimation of time-varying selectivity in stock assessments using state-space models , 2014 .

[15]  David A. Fournier,et al.  A spatially disaggregated, length-based, age-structured population model of yellowfin tuna (Thunnus albacares) in the western and central Pacific Ocean , 2001 .

[16]  R. Sharma,et al.  Investigating the influence of length-frequency data on the stock assessment of Indian Ocean bigeye tuna , 2014 .

[17]  Effect of data weighting on the mature male biomass estimate for Alaskan golden king crab , 2017 .

[18]  Russell B. Millar,et al.  Bayesian state-space modeling of age-structured data: fitting a model is just the beginning , 2000 .

[19]  C. Goodyear,et al.  FITTING SURPLUS-PRODUCTION MODELS WITH MISSING CATCH DATA USING ASPIC: EVALUATION WITH SIMULATED DATA ON ATLANTIC BLUE MARLIN , 2001 .

[20]  James N. Ianelli,et al.  Bayesian stock assessment using catch-age data and the sampling - importance resampling algorithm , 1997 .

[21]  R. Day,et al.  Accounting for cohort-specific variable growth in fisheries stock assessments: A case study from south-eastern Australia , 2013 .

[22]  R.I.C. Chris Francis,et al.  Growth in age-structured stock assessment models , 2016 .

[23]  André E. Punt,et al.  Growth: Theory, estimation, and application in fishery stock assessment models , 2016 .

[24]  Daniel R. Goethel,et al.  Incorporating Spatial Structure in Stock Assessment: Movement Modeling in Marine Fish Population Dynamics , 2011 .

[25]  David B. Sampson,et al.  Using areas-as-fleets selectivity to model spatial fishing: Asymptotic curves are unlikely under equilibrium conditions , 2014 .

[26]  Richard D. Methot,et al.  Adjusting for bias due to variability of estimated recruitments in fishery assessment models , 2011 .

[27]  James N. Ianelli,et al.  Chapter 2: Assessment of the Pacific Cod Stock in the Eastern Bering Sea and Aleutian Islands Area , 2006 .

[28]  M. Maunder,et al.  Evaluation of using random-at-length observations and an equilibrium approximation of the population age structure in fitting the von Bertalanffy growth function , 2016 .

[29]  Richard D. Methot,et al.  A Simulation-Based Method to Determine Model Misspecification: Examples using Natural Mortality and Population Dynamics Models , 2011 .

[30]  Richard D. Methot,et al.  Estimating Natural Mortality in Stock Assessment Applications , 2011 .

[31]  André E. Punt,et al.  Review of integrated size-structured models for stock assessment of hard-to-age crustacean and mollusc species , 2013 .

[33]  André E. Punt,et al.  Use of multiple selectivity patterns as a proxy for spatial structure , 2014 .

[34]  André E. Punt,et al.  Some insights into data weighting in integrated stock assessments , 2017 .

[35]  Mark N. Maunder,et al.  Is down-weighting composition data adequate for dealing with model misspecification, or do we need to fix the model? , 2017 .

[36]  Kevin R. Piner,et al.  Can diagnostic tests help identify model misspecification in integrated stock assessments , 2017 .

[37]  B SampsonDavid,et al.  A spatial model for fishery age-selection at the population level , 2011 .

[38]  Steven J. D. Martell,et al.  A historical review of selectivity approaches and retrospective patterns in the Pacific halibut stock assessment , 2014 .

[39]  M. Maunder,et al.  Influence of model misspecification, temporal changes, and data weighting in stock assessment models: Application to swordfish (Xiphias gladius) in the Indian Ocean , 2015 .

[40]  Samuel B. Truesdell,et al.  Estimating multinomial effective sample size in catch-at-age and catch-at-size models , 2017 .

[41]  R. Methot,et al.  Use of likelihood profiling over a global scaling parameter to structure the population dynamics model: An example using blue marlin in the Pacific Ocean , 2014 .

[42]  Richard D. Methot,et al.  Stock synthesis: A biological and statistical framework for fish stock assessment and fishery management , 2013 .

[43]  M. Maunder,et al.  Effects of age-based movement on the estimation of growth assuming random-at-age or random-at-length data. , 2017, Journal of fish biology.

[44]  D. Howell,et al.  An overview of Gadget, the Globally applicable Area-Disaggregated General Ecosystem Toolbox , 2004 .

[45]  R. Hilborn,et al.  Fisheries stock assessment and decision analysis: the Bayesian approach , 1997, Reviews in Fish Biology and Fisheries.

[46]  Kevin R. Piner,et al.  Contemporary fisheries stock assessment: many issues still remain , 2015 .

[47]  Terrance J. Quinn,et al.  RUMINATIONS ON THE DEVELOPMENT AND FUTURE OF POPULATION DYNAMICS MODELS IN FISHERIES , 2003 .

[48]  Terrance J. Quinn,et al.  Quantitative Fish Dynamics , 1999 .

[49]  Ray Hilborn,et al.  Analysis of contradictory data sources in fish stock assessment , 1993 .

[50]  Yong Chen,et al.  Developing robust frequentist and Bayesian fish stock assessment methods , 2003 .

[51]  Mark N. Maunder,et al.  Estimation of recruitment in catch-at-age models , 2003 .

[52]  M. Maunder,et al.  Selectivity's distortion of the production function and its influence on management advice from surplus production models , 2014 .

[53]  Searching for M: Is there more information about natural mortality in stock assessments than we realize? , 2017 .

[54]  Cole C. Monnahan,et al.  Implications of process error in selectivity for approaches to weighting compositional data in fisheries stock assessments , 2017 .

[55]  Richard D. Methot,et al.  Model-based estimates of effective sample size in stock assessment models using the Dirichlet-multinomial distribution , 2017 .

[56]  James N. Ianelli,et al.  Computers in Fisheries Population Dynamics , 2009 .

[57]  Byron J. T. Morgan,et al.  Goodness‐of‐fit of integrated population models using calibrated simulation , 2014 .

[58]  A. Punt,et al.  Estimating growth within size-structured fishery stock assessments: What is the state of the art and what does the future look like? , 2016 .

[59]  Estimation of growth within Stock Synthesis models: Management implications when using length-composition data , 2016 .

[60]  Hans J. Skaug,et al.  Automatic approximation of the marginal likelihood in non-Gaussian hierarchical models , 2006, Comput. Stat. Data Anal..

[61]  U. H. Thygesen,et al.  Estimating spatio-temporal dynamics of size-structured populations , 2014 .

[62]  Ian J. Stewart,et al.  Towards defining good practices for modeling time-varying selectivity , 2014 .

[63]  R.I.C. Chris Francis Revisiting data weighting in fisheries stock assessment models , 2017 .

[64]  C. Walters,et al.  Quantitative fisheries stock assessment: Choice, dynamics and uncertainty , 2004, Reviews in Fish Biology and Fisheries.

[65]  Mark N. Maunder,et al.  Review and evaluation of likelihood functions for composition data in stock-assessment models: Estimating the effective sample size , 2011 .

[66]  John Sibert,et al.  AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models , 2012, Optim. Methods Softw..

[67]  Sean C. Anderson,et al.  Looking in the rear-view mirror: bias and retrospective patterns in integrated, age-structured stock assessment models , 2015 .

[68]  J. Renwick,et al.  Quantifying annual variation in catchability for commercial and research fishing , 2003 .

[69]  Ian G. Taylor,et al.  An empirical weight-at-age approach reduces estimation bias compared to modeling parametric growth in integrated, statistical stock assessment models when growth is time varying , 2016 .

[70]  K. Piner,et al.  Evidence that the Migration of the Northern Subpopulation of Pacific Sardine (Sardinops sagax) off the West Coast of the United States Is Age-Based , 2016, PloS one.

[71]  M. Maunder,et al.  Evaluation of virgin recruitment profiling as a diagnostic for selectivity curve structure in integrated stock assessment models , 2014 .

[72]  D. Butterworth,et al.  Does selectivity matter? A fisheries management perspective , 2014 .

[73]  Paul J. Starr,et al.  Bayesian assessment of the SNA1 snapper (Pagrus auratus) stock on the north‐east coast of New Zealand , 2001 .

[74]  J. Thorson,et al.  Random effect estimation of time-varying factors in Stock Synthesis , 2014 .

[75]  R. Mohn,et al.  The retrospective problem in sequential population analysis: An investigation using cod fishery and simulated data , 1999 .

[76]  R.I.C. Chris Francis Replacing the multinomial in stock assessment models: A first step , 2014 .

[77]  Mark N. Maunder,et al.  Is it time to discard the Schaefer model from the stock assessment scientist's toolbox? , 2003 .

[78]  M. Maunder,et al.  A general framework for integrating environmental time series into stock assessment models: model description, simulation testing, and example , 2003 .

[79]  P. A. Larkin An Epitaph for the Concept of Maximum Sustained Yield , 1977 .

[80]  Grant G. Thompson,et al.  Confounding of Gear Selectivity and the Natural Mortality Rate in Cases where the Former is a Nonmonotone Function of Age , 1994 .

[81]  F. Carvalho,et al.  Incorporating specific change points in catchability in fisheries stock assessment models: An alternative approach applied to the blue shark (Prionace glauca) stock in the south Atlantic Ocean , 2014 .

[82]  Mark Nicholas Maunder Integration of tagging and population dynamics models in fisheries stock assessment , 1998 .

[83]  K. Lorenzen Toward a new paradigm for growth modeling in fisheries stock assessments: Embracing plasticity and its consequences , 2016 .

[84]  D. Sampson Fishery selection and its relevance to stock assessment and fishery management , 2014 .

[85]  André E. Punt,et al.  A review of integrated analysis in fisheries stock assessment , 2013 .

[86]  Mark N. Maunder,et al.  Including parameter uncertainty in forward projections of computationally intensive statistical population dynamic models , 2006 .

[87]  Richard D. Methot,et al.  Estimating natural mortality within a fisheries stock assessment model: An evaluation using simulation analysis based on twelve stock assessments , 2011 .

[88]  David A. Fournier,et al.  MULTIFAN-CL: a length-based, age-structured model for fisheries stock assessment, with application to South Pacific albacore, Thunnus alalunga , 1998 .