Variation Sharing: A Novel Numeric Solution to the Path Bias Underestimation Problem of PLS-Based SEM

The most fundamental problem currently associated with structural equation modeling employing the partial least squares method is that it does not properly account for measurement error, which often leads to path coefficient estimates that asymptotically converge to values of lower magnitude than the true values. This attenuation phenomenon affects applications in the field of business data analytics; and is in fact a characteristic of composite-based models in general, where latent variables are modeled as exact linear combinations of their indicators. The underestimation is often of around 10% per path in models that meet generally accepted measurement quality assessment criteria. The authors propose a numeric solution to this problem, which they call the factor-based partial least squares regression (FPLSR) algorithm, whereby variation lost in composites is restored in proportion to measurement error and amount of attenuation. Six variations of the solution are developed based on different reliability measures, and contrasted in Monte Carlo simulations. The authors' solution is nonparametric and seems to perform generally well with small samples and severely non-normal data.

[1]  Catherine E. Connelly,et al.  Understanding Nonmalicious Security Violations in the Workplace: A Composite Behavior Model , 2011, J. Manag. Inf. Syst..

[2]  Jan-Bernd Lohmöller,et al.  Latent Variable Path Modeling with Partial Least Squares , 1989 .

[3]  Anil K. Bera,et al.  Efficient tests for normality, homoscedasticity and serial independence of regression residuals , 1980 .

[4]  William Lewis,et al.  Does PLS Have Advantages for Small Sample Size or Non-Normal Data? , 2012, MIS Q..

[5]  S. Wold,et al.  Some recent developments in PLS modeling , 2001 .

[6]  R. Peterson,et al.  On the relationship between coefficient alpha and composite reliability. , 2013, The Journal of applied psychology.

[7]  Wynne W. Chin,et al.  Generating Non-normal Data for Simulation of Structural Equation Models Using Mattson's Method , 2002 .

[8]  Anil K. Bera,et al.  Efficient tests for normality, homoscedasticity and serial independence of regression residuals: Monte Carlo Evidence , 1981 .

[9]  L. Marlow,et al.  Validation of a measure of knowledge about human papillomavirus (HPV) using item response theory and classical test theory. , 2013, Preventive medicine.

[10]  S. Wilson,et al.  Market access, population density, and socioeconomic development explain diversity and functional group biomass of coral reef fish assemblages , 2012 .

[11]  S. Mattson,et al.  How to Generate Non-normal Data for Simulation of Structural Equation Models. , 1997, Multivariate behavioral research.

[12]  Marko Sarstedt,et al.  PLS-SEM: Indeed a Silver Bullet , 2011 .

[13]  Saggi Nevo,et al.  Organizational members' use of social networking sites and job performance: An exploratory study , 2013, Inf. Technol. People.

[14]  T. Dijkstra,et al.  Consistent Partial Least Squares for Nonlinear Structural Equation Models , 2013, Psychometrika.

[15]  Robert C. MacCallum,et al.  Representing sources of error in the common-factor model: Implications for theory and practice. , 1991 .

[16]  R. P. McDonald,et al.  Path Analysis with Composite Variables. , 1996, Multivariate behavioral research.

[17]  Michel Tenenhaus,et al.  PLS path modeling , 2005, Comput. Stat. Data Anal..

[18]  Rex B. Kline,et al.  Principles and Practice of Structural Equation Modeling , 1998 .

[19]  Ned Kock,et al.  Non-Normality Propagation among Latent Variables and Indicators in PLS-SEM Simulations , 2016 .

[20]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[21]  Todd C. Headrick Statistical Simulation: Power Method Polynomials and Other Transformations , 2009 .

[22]  Ned Kock,et al.  PLS-based SEM Algorithms: The Good Neighbor Assumption, Collinearity, and Nonlinearity , 2015 .

[23]  George M. Marakas,et al.  Measurement of composite reliability in research using partial least squares: some issues and an alternative approach , 2013, DATB.

[24]  Ned Kock,et al.  Lateral Collinearity and Misleading Results in Variance-Based SEM: An Illustration and Recommendations , 2012, J. Assoc. Inf. Syst..

[25]  Ned Kock,et al.  Advanced Mediating Effects Tests, Multi-Group Analyses, and Measurement Model Assessments in PLS-Based SEM , 2014, Int. J. e Collab..

[26]  Milton Mayfield,et al.  The Role of Leader Motivating Language in Employee Absenteeism , 2009 .

[27]  J. Cazier,et al.  Data Analytics in the Hardwood Industry: The Impact of Automation and Optimization on Profits, Quality, and the Environment , 2014 .

[28]  J. Mayfield,et al.  THE EFFECTS OF LEADER MOTIVATING LANGUAGE ON SUBORDINATE PERFORMANCE AND SATISFACTION , 1998 .

[29]  P. Hackl,et al.  Robustness of partial least-squares method for estimating latent variable quality structures , 1999 .

[30]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[31]  H. Wold Causal flows with latent variables: Partings of the ways in the light of NIPALS modelling , 1974 .

[32]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[33]  J. Gastwirth,et al.  A robust modification of the Jarque–Bera test of normality , 2008 .

[34]  W. Reinartz,et al.  An Empirical Comparison of the Efficacy of Covariance-Based and Variance-Based SEM , 2009 .

[35]  Ned Kock,et al.  Using WarpPLS in E-collaboration Studies: An Overview of Five Main Analysis Steps , 2010, Int. J. e Collab..

[36]  Klaas Sijtsma,et al.  On the Use, the Misuse, and the Very Limited Usefulness of Cronbach’s Alpha , 2008, Psychometrika.

[37]  Jianjun Shi,et al.  A Systematic Approach for Business Data Analytics With a Real Case Study , 2015 .

[38]  Kenneth A. Bollen,et al.  Monte Carlo Experiments: Design and Implementation , 2001 .

[39]  Young M. Lee,et al.  Analytics for Smarter Buildings , 2014 .

[40]  R. Hambleton,et al.  Fundamentals of Item Response Theory , 1991 .

[41]  Hoda Ahmed Abdelhafez Big Data Technologies and Analytics: A Review of Emerging Solutions , 2014 .

[42]  Stephen M. Smith,et al.  Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm , 2001, IEEE Transactions on Medical Imaging.

[43]  Todd C. Headrick Fast fifth-order polynomial transforms for generating univariate and multivariate nonnormal distributions , 2002 .

[44]  Wenjing Huang PLSe: Efficient Estimators and Tests for Partial Least Square , 2013 .