Parameter Identifiability and Redundancy: Theoretical Considerations

Background Models for complex biological systems may involve a large number of parameters. It may well be that some of these parameters cannot be derived from observed data via regression techniques. Such parameters are said to be unidentifiable, the remaining parameters being identifiable. Closely related to this idea is that of redundancy, that a set of parameters can be expressed in terms of some smaller set. Before data is analysed it is critical to determine which model parameters are identifiable or redundant to avoid ill-defined and poorly convergent regression. Methodology/Principal Findings In this paper we outline general considerations on parameter identifiability, and introduce the notion of weak local identifiability and gradient weak local identifiability. These are based on local properties of the likelihood, in particular the rank of the Hessian matrix. We relate these to the notions of parameter identifiability and redundancy previously introduced by Rothenberg (Econometrica 39 (1971) 577–591) and Catchpole and Morgan (Biometrika 84 (1997) 187–196). Within the widely used exponential family, parameter irredundancy, local identifiability, gradient weak local identifiability and weak local identifiability are shown to be largely equivalent. We consider applications to a recently developed class of cancer models of Little and Wright (Math Biosciences 183 (2003) 111–134) and Little et al. (J Theoret Biol 254 (2008) 229–238) that generalize a large number of other recently used quasi-biological cancer models. Conclusions/Significance We have shown that the previously developed concepts of parameter local identifiability and redundancy are closely related to the apparently weaker properties of weak local identifiability and gradient weak local identifiability—within the widely used exponential family these concepts largely coincide.

[1]  Modern algebraic theories , 1927 .

[2]  Carlos Alberto de Bragança Pereira,et al.  ON IDENTIFIABILITY OF PARAMETRIC STATISTICAL MODELS , 1994 .

[3]  Jean-Dominique Lebreton,et al.  Parameter Identifiability and Model Selection in Capture‐Recapture Models: A Numerical Approach , 1998 .

[4]  G. Picci Some Connections Between the Theory of Sufficient Statistics and the Identifiability Problem , 1977 .

[5]  P. Vineis,et al.  A stochastic carcinogenesis model incorporating multiple types of genomic instability fitted to colon cancer data. , 2008, Journal of theoretical biology.

[6]  M. Little,et al.  Are two mutations sufficient to cause cancer? Some generalizations of the two-mutation model of carcinogenesis of Moolgavkar, Venzon, and Knudson, and of the multistage model of Armitage and Doll. , 1995, Biometrics.

[7]  Byron J. T. Morgan,et al.  Solving problems in parameter redundancy using computer algebra , 2002 .

[8]  Martin A. Nowak,et al.  The role of chromosomal instability in tumor initiation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  John A. Nelder,et al.  Generalized linear models. 2nd ed. , 1993 .

[10]  W. Rudin Principles of mathematical analysis , 1964 .

[11]  P. Armitage,et al.  The age distribution of cancer and a multi-stage theory of carcinogenesis , 1954, British Journal of Cancer.

[12]  T. Rothenberg Identification in Parametric Models , 1971 .

[13]  W F Heidenreich,et al.  Some Properties of the Hazard Function of the Two‐Mutation Clonal Expansion Model , 1997, Risk analysis : an official publication of the Society for Risk Analysis.

[14]  M. Little,et al.  A stochastic carcinogenesis model incorporating genomic instability fitted to colon cancer data. , 2003, Mathematical biosciences.

[15]  W. Heidenreich,et al.  Parameter Identifiability and Redundancy in a General Class of Stochastic Carcinogenesis Models , 2009, PloS one.

[16]  Byron J. T. Morgan,et al.  Detecting parameter redundancy , 1997 .

[17]  Maria Pia Saccomani,et al.  DAISY: A new software tool to test global identifiability of biological and physiological systems , 2007, Comput. Methods Programs Biomed..

[18]  J. H. Schuenemeyer,et al.  Generalized Linear Models (2nd ed.) , 1992 .

[19]  Byron J. T. Morgan,et al.  Methods for investigating parameter redundancy , 2004 .

[20]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[21]  S. Moolgavkar,et al.  Two-event models for carcinogenesis: incidence curves for childhood and adult tumors☆ , 1979 .

[22]  M. Little,et al.  Stochastic modelling of colon cancer: is there a role for genomic instability? , 2006, Carcinogenesis.

[23]  W. Heidenreich On the parameters of the clonal expansion model , 1996, Radiation and environmental biophysics.

[24]  J A Jacquez,et al.  Parameter estimation: local identifiability of parameters. , 1990, The American journal of physiology.

[25]  C. Cobelli,et al.  Global identifiability of linear compartmental models-a computer algebra algorithm , 1998, IEEE Transactions on Biomedical Engineering.