In preparation for multivariate analysis, an exploratory study has been undertaken to investigate the relative position, separability, homogeneity and shape of three major disease groups, using data from a clinical chemical routine package. The data set consists of 46 hepatology patients, 50 nephrology patients and 46 cardiology patients, and the measured blood levels include 20 common clinical chemical routine assays. Missing value problems were avoided by deleting some of the variables and objects. A univariate analysis was used as the basis ofa rescaling of the data. Bivariate (pairwise) plots of some major assays each show limited separation. The set of three such plots of the three major principal components reveals more distinction between the groups than was offered by univariate analysis. Three-dimensional extensions of these techniques allow better insight than any of the two-dimensional plots, but these three-dimensional versions require more plots for complete interpretation. Non-linear mapping of the data is the best way of retaining the distances and a fairly good separation is achieved in the plot. The plot is less informative about shape and relative position of the classes. Representation of the data as pictures of faces does not offer additional information and visual clustering is worse than in any of the techniques mentioned. During the analysis many assumed properties of the data are confirmed and a good starting pointfor multivariate classification is attained. Easy visual detection of outliers is offered by all techniques. Unfortunately, valuable information is lost in this data set by deleting some incomplete variables.
[1]
M. Braga,et al.
Exploratory Data Analysis
,
2018,
Encyclopedia of Social Network Analysis and Mining. 2nd Ed..
[2]
Brian Everitt,et al.
Graphical Techniques for Multivariate Data.
,
1978
.
[3]
Bruce R. Kowalski,et al.
Chemometrics, mathematics and statistics in chemistry
,
1984
.
[4]
Kurt Varmuza,et al.
Pattern recognition in chemistry
,
1980
.
[5]
John M. Chambers,et al.
Graphical Methods for Data Analysis
,
1983
.
[6]
Peter C. Wang,et al.
Graphical Representation of Multivariate Data
,
1978
.
[7]
B. Ripley,et al.
Pattern Recognition
,
1968,
Nature.