Data matrix centering is an ever-present yet under-examined aspect of data analysis. Functional data analysis (FDA) often operates with a default of centering such that the vectors in one dimension have mean zero. We find that centering along the other dimension identifies a novel useful mode of variation beyond those familiar in FDA. We explore ambiguities in both matrix orientation and nomenclature. Differences between centerings and their potential interaction can be easily misunderstood. We propose a unified framework and new terminology for centering operations. We clearly demonstrate the intuition behind and consequences of each centering choice with informative graphics. We also propose a new direction energy hypothesis test as part of a series of diagnostics for determining which choice of centering is best for a data set. We explore the application of these diagnostics in several FDA settings.
[1]
J. S. Marron,et al.
Angle-based joint and individual variation explained
,
2017,
J. Multivar. Anal..
[2]
Zhengyuan Zhu,et al.
Singular Value Decomposition and Its Visualization
,
2007
.
[3]
H. Hotelling.
Relations Between Two Sets of Variates
,
1936
.
[4]
J Steve Marron,et al.
Overview of object oriented data analysis
,
2014,
Biometrical journal. Biometrische Zeitschrift.
[5]
Christopher R. Cabanski,et al.
SigFuge: single gene clustering of RNA-seq reveals differential isoform usage among cancer samples
,
2014,
Nucleic acids research.
[6]
Roman Rosipal,et al.
Overview and Recent Advances in Partial Least Squares
,
2005,
SLSFS.
[7]
Steven J. M. Jones,et al.
Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer
,
2015,
Cell.