Source-Sink Reconstruction Through Regularized Multicomponent Regression Analysis—With Application to Assessing Whether North Sea Cod Larvae Contributed to Local Fjord Cod in Skagerrak

The problem of reconstructing the source-sink dynamics arises in many biological systems. Our research is motivated by marine applications where newborns are passively dispersed by ocean currents from several potential spawning sources to settle in various nursery regions that collectively constitute the sink. The reconstruction of the sparse source-sink linkage pattern, that is, to identify which sources contribute to which regions in the sink, is a challenging task in marine ecology. We derive a constrained nonlinear multicomponent regression model for source-sink reconstruction, which is capable of simultaneously selecting important linkages from the sources to the sink regions and making inference about the unobserved spawning activities at the sources. A sparsity-inducing and nonnegativity-constrained regularization approach is developed for model estimation, and theoretically we show that our estimator enjoys the oracle properties. The empirical performance of the method is investigated via simulation studies mimicking real ecological applications. We examine the transport hypothesis that Atlantic cod larvae were transported by sea currents from the North Sea to a few exposed coastal fjords along the Norwegian Skagerrak. Our findings of the spawning date distribution is consistent with results from previous studies, and the proposed approach for the first time provides valid statistical support for the larval drift conjecture. Supplementary materials for this article are available online.

[1]  H. Leeb,et al.  Sparse Estimators and the Oracle Property, or the Return of Hodges' Estimator , 2007, 0704.1466.

[2]  N. Stenseth,et al.  MODELING PULSE DISTURBANCE IMPACT ON COD POPULATION DYNAMICS: THE 1988 ALGAL BLOOM OF SKAGERRAK, NORWAY , 2003 .

[3]  Wenjiang J. Fu,et al.  Asymptotics for lasso-type estimators , 2000 .

[4]  Jianhua Z. Huang,et al.  Biclustering via Sparse Singular Value Decomposition , 2010, Biometrics.

[5]  R. Tibshirani,et al.  On the “degrees of freedom” of the lasso , 2007, 0712.0881.

[6]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[7]  A. Izenman Reduced-rank regression for the multivariate linear model , 1975 .

[8]  Neil H. Timm,et al.  Multivariate Reduced-Rank Regression , 1999, Technometrics.

[9]  Gene H. Golub,et al.  Generalized cross-validation as a method for choosing a good ridge parameter , 1979, Milestones in Matrix Computation.

[10]  Alan J. Izenman,et al.  Multivariate Reduced-Rank Regression , 2011, International Encyclopedia of Statistical Science.

[11]  N. Stenseth,et al.  NATURAL MORTALITY AND FISHING MORTALITY IN A COASTAL COD POPULATION: A RELEASE–RECAPTURE EXPERIMENT , 2001 .

[12]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[13]  Marte O. Kittilsen,et al.  ASSESSING THE EFFECTIVENESS OF RELEASING COD LARVAE FOR STOCK IMPROVEMENT WITH MONITORING DATA , 2003 .

[14]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[15]  Jinde Wang Asymptotics of least-squares estimators for constrained nonlinear regression , 1996 .

[16]  B. M. Pötscher,et al.  MODEL SELECTION AND INFERENCE: FACTS AND FICTION , 2005, Econometric Theory.

[17]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[18]  G. Reinsel,et al.  Multivariate Reduced-Rank Regression: Theory and Applications , 1998 .

[19]  T. W. Anderson Estimating Linear Restrictions on Regression Coefficients for Multivariate Normal Distributions , 1951 .

[20]  Benedikt M. Potscher,et al.  On the distribution of the adaptive LASSO estimator , 2008, 0801.4627.

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  K. DonaldW. Generalized Method of Moments Estimation When a Parameter Is on a Boundary , 1999 .

[23]  Kathrin Klamroth,et al.  Biconvex sets and optimization with biconvex functions: a survey and extensions , 2007, Math. Methods Oper. Res..

[24]  Kung-Sik Chan,et al.  Reduced rank regression via adaptive nuclear norm penalization. , 2012, Biometrika.

[25]  S. Lahiri,et al.  Bootstrapping Lasso Estimators , 2011 .

[26]  Hao Helen Zhang,et al.  ON THE ADAPTIVE ELASTIC-NET WITH A DIVERGING NUMBER OF PARAMETERS. , 2009, Annals of statistics.

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[28]  Somratna Lertmaharith Degree of freedom , 2010 .

[29]  N. Stenseth,et al.  Estimating a Smooth Common Transfer Function with a Panel of Time Series - Inflow of Larvae Cod as an Example , 2012 .

[30]  N. Stenseth,et al.  Transport of North Sea cod larvae into the Skagerrak coastal populations , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[31]  Hannes Leeb,et al.  Performance Limits for Estimators of the Risk or Distribution of Shrinkage-Type Estimators, and Some General Lower Risk-Bound Results , 2002 .

[32]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[33]  Kung-Sik Chan,et al.  Reduced rank stochastic regression with a sparse singular value decomposition , 2012 .

[34]  ModelsThomas W. Yee Reduced-rank Vector Generalized Linear Models , 2000 .

[35]  Andrej Pázman,et al.  Nonlinear Regression , 2019, Handbook of Regression Analysis With Applications in R.

[36]  N. Stenseth,et al.  Ecological and genetic impact of Atlantic cod larval drift in the Skagerrak , 2006, Proceedings of the Royal Society B: Biological Sciences.