Test–retest reliability measures for curve data: an overview with recommendations and supplementary code

The purpose of this paper is to provide an overview of available methods for reliability investigations when the outcome of interest is a curve. Curve data, or functional data, is commonly collected in biomechanical research in order to better understand different aspects of human movement. Using recent statistical developments, curve data can be analysed in its most detailed form, as functions. However, an overview of appropriate statistical methods for assessing reliability of curve data is lacking. A review of contemporary literature of reliability measures for curve data within the fields of biomechanics and statistics identified the following methods: coefficient of multiple correlation, functional limits of agreement, measures of distance and similarity, and integrated pointwise indices (an extension of univariate reliability measures to curve data, inclusive of Pearson correlation, intraclass correlation, and standard error of measurement). These methods are briefly presented, implemented (R-code available as supplementary material) and evaluated on simulated data to highlight advantages and disadvantages of the methods. Among the identified methods, the integrated intraclass correlation and standard error of measurement are recommended. These methods are straightforward to implement, enable results over the domain, and consider variation between individuals, which the other methods partly neglect.

[1]  R. Chopard,et al.  The Key Roles of Negative Pressure Breathing and Exercise in the Development of Interstitial Pulmonary Edema in Professional Male SCUBA Divers , 2018, Sports Medicine - Open.

[2]  A. Hrõbjartsson,et al.  Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. , 2011, Journal of clinical epidemiology.

[3]  Joyce P Trost,et al.  Measurement and management of errors in quantitative gait data. , 2004, Gait & posture.

[4]  J. Richards,et al.  Knee joint coordination during single-leg landing in different directions , 2018, Sports biomechanics.

[5]  H. Müller,et al.  Dynamical Correlation for Multivariate Longitudinal Data , 2005 .

[6]  M. Stokes,et al.  Reliability of assessment tools in rehabilitation: an illustration of appropriate statistical analyses , 1998, Clinical rehabilitation.

[7]  Morgan Sangeux,et al.  Quantifying sources of variability in gait analysis. , 2017, Gait & posture.

[8]  Dara Meldrum,et al.  Test-retest reliability of three dimensional gait analysis: including a novel approach to visualising agreement of gait cycle waveforms with Bland and Altman plots. , 2014, Gait & posture.

[9]  Andrew Franklyn-Miller,et al.  Agreement between Inertia and Optical Based Motion Capture during the VU-Return-to-Play- Field-Test , 2020, Sensors.

[10]  Rick L. Lawrence,et al.  Combining functions and the closure principle for performing follow-up tests in functional analysis of variance , 2013, Comput. Stat. Data Anal..

[11]  Steffi L. Colyer,et al.  A Review of the Evolution of Vision-Based Motion Analysis and the Integration of Advanced Computer Vision Methods Towards Developing a Markerless System , 2018, Sports Medicine - Open.

[12]  A. Opheim,et al.  Evaluating the properties of the coefficient of multiple correlation (CMC) for kinematic gait data. , 2012, Journal of biomechanics.

[13]  H. K. Ramakrishnan,et al.  Repeatability of kinematic, kinetic, and electromyographic data in normal adult gait , 1989, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[14]  S. Mathiassen,et al.  Motor variability in occupational health and performance. , 2012, Clinical biomechanics.

[15]  A. Cereatti,et al.  Assessment of Waveform Similarity in Clinical Gait Data: The Linear Fit Method , 2014, BioMed research international.

[16]  R. Trevethan,et al.  Intraclass correlation coefficients: clearing the air, extending some cautions, and making some requests , 2017, Health Services and Outcomes Research Methodology.

[17]  Terry K Koo,et al.  A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. , 2016, Journal Chiropractic Medicine.

[18]  H. Graham,et al.  The gait standard deviation, a single measure of kinematic variability. , 2016, Gait & posture.

[19]  Hans Kainz,et al.  Reliability of four models for clinical gait analysis. , 2017, Gait & posture.

[20]  Kieran Moran,et al.  Comparison of discrete-point vs. dimensionality-reduction techniques for describing performance-related aspects of maximal vertical jumping. , 2014, Journal of biomechanics.

[21]  Christine D. Pollard,et al.  Altered Lower Extremity Movement Variability in Female Soccer Players During Side-Step Cutting After Anterior Cruciate Ligament Reconstruction , 2015, The American journal of sports medicine.

[22]  A.J. Harrison,et al.  Functional data analysis of joint coordination in the development of vertical jump performance , 2007, Sports biomechanics.

[23]  C. Terwee,et al.  When to use agreement versus reliability measures. , 2006, Journal of clinical epidemiology.

[24]  T. Hewett,et al.  Reliability of landing 3D motion analysis: implications for longitudinal analyses. , 2007, Medicine and science in sports and exercise.

[25]  J. Marron,et al.  Statistics of time warpings and phase variations , 2014 .

[26]  Joy Conway,et al.  Professional articlesReliability: What is it, and how is it measured?Funding , 2000 .

[27]  C. Häger,et al.  Curve analyses reveal altered knee, hip, and trunk kinematics during drop-jumps long after anterior cruciate ligament rupture. , 2018, The Knee.

[28]  Siobhán Strike,et al.  Landmark registering waveform data improves the ability to predict performance measures. , 2018, Journal of biomechanics.

[29]  Conny Draper,et al.  Bivariate functional principal components analysis: considerations for use with multivariate movement signatures in sports biomechanics , 2019, Sports biomechanics.

[30]  K. McGraw,et al.  Forming inferences about some intraclass correlation coefficients. , 1996 .

[31]  J. Røislien,et al.  Functional limits of agreement: a method for assessing agreement between measurements of gait curves. , 2012, Gait & posture.

[32]  Hsing,et al.  Functional Data Analysis , 2015 .

[33]  A. Berchtold Test–retest: Agreement or reliability? , 2016 .

[34]  Conny Draper,et al.  Considerations for the use of functional principal components analysis in sports biomechanics: examples from on-water rowing , 2019, Sports biomechanics.

[35]  Andrew Harrison,et al.  Functional data analysis of knee joint kinematics in the vertical jump , 2006, Sports biomechanics.

[36]  Tom Chau,et al.  Managing variability in the summary and comparison of gait data , 2005, Journal of NeuroEngineering and Rehabilitation.

[37]  Julien Jacques,et al.  Functional data clustering: a survey , 2013, Advances in Data Analysis and Classification.

[38]  Andrew J Harrison,et al.  Functional data analysis of running kinematics in chronic Achilles tendon injury. , 2008, Medicine and science in sports and exercise.

[39]  R. Mizner,et al.  Changes in quadriceps and hamstring cocontraction following landing instruction in patients with anterior cruciate ligament reconstruction. , 2015, The Journal of orthopaedic and sports physical therapy.

[40]  J. Mort,et al.  Effect of age on the abundance and fragmentation of link protein of the human intervertebral disc , 1989, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[41]  W G Hopkins,et al.  Measures of Reliability in Sports Medicine and Science , 2000, Sports medicine.

[42]  Jos Vanrenterghem,et al.  How reliable are knee kinematics and kinetics during side-cutting manoeuvres? , 2015, Gait & posture.

[43]  J. Weir Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM. , 2005, Journal of strength and conditioning research.

[44]  Simone Vantini,et al.  K-mean Alignment for Curve Clustering , 2010, Comput. Stat. Data Anal..

[45]  Clare E. Milner,et al.  Test-retest reliability of knee biomechanics during stop jump landings. , 2011, Journal of biomechanics.

[46]  Jay Hertel,et al.  Differences in hip-knee joint coupling during gait after anterior cruciate ligament reconstruction. , 2016, Clinical biomechanics.

[47]  Stephen T. Holgate,et al.  Reliability: What is it, and how is it measured? , 2000 .

[48]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[49]  Alessia Pini,et al.  One-leg hop kinematics 20 years following anterior cruciate ligament rupture: Data revisited using functional data analysis. , 2015, Clinical biomechanics.

[50]  G Atkinson,et al.  Statistical Methods For Assessing Measurement Error (Reliability) in Variables Relevant to Sports Medicine , 1998, Sports medicine.

[51]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[52]  Helle Sørensen,et al.  An introduction with medical applications to functional data analysis , 2013, Statistics in medicine.

[53]  Jos Vanrenterghem,et al.  Vector field statistical analysis of kinematic and force trajectories. , 2013, Journal of biomechanics.

[54]  M. Morris,et al.  The reliability of three-dimensional kinematic gait measurements: a systematic review. , 2009, Gait & posture.

[55]  C. Häger,et al.  A novel standardised side hop test reliably evaluates landing mechanics for anterior cruciate ligament reconstructed persons and controls , 2018, Sports biomechanics.

[56]  George E Gorton,et al.  Assessment of the kinematic variability among 12 motion analysis laboratories. , 2009, Gait & posture.