Dimensionality Reduction and Subspace Clustering in Mixed Reality for Condition Monitoring of High-Dimensional Production Data †

Visual analytics are becoming increasingly important in the light of big data and related scenarios. Along this trend, the field of immersive analytics has been variously furthered as it is able to provide sophisticated visual data analytics on one hand, while preserving user-friendliness on the other. Furthermore, recent hardware developments such as smart glasses, as well as achievements in virtual-reality applications, have fanned immersive analytic solutions. Notably, such solutions can be very effective when they are applied to high-dimensional datasets. Taking this advantage into account, the work at hand applies immersive analytics to a high-dimensional production dataset to improve the digital support of daily work tasks. More specifically, a mixed-reality implementation is presented that will support manufacturers as well as data scientists to comprehensively analyze machine data. As a particular goal, the prototype will simplify the analysis of manufacturing data through the usage of dimensionality reduction effects. Therefore, five aspects are mainly reported in this paper. First, it is shown how dimensionality reduction effects can be represented by clusters. Second, it is presented how the resulting information loss of the reduction is addressed. Third, the graphical interface of the developed prototype is illustrated as it provides (1) a correlation coefficient graph, (2) a plot for the information loss, and (3) a 3D particle system. In addition, an implemented voice recognition feature of the prototype is shown, which was considered to be being promising to select or deselect data variables users are interested in when analyzing the data. Fourth, based on a machine learning library, it is shown how the prototype reduces computational resources using smart glasses. The main idea is based on a recommendation approach as well as the use of subspace clustering. Fifth, results from a practical setting are presented, in which the prototype was shown to domain experts. The latter reported that such a tool is actually helpful to analyze machine data daily. Moreover, it was reported that such a system can be used to educate machine operators more properly. As a general outcome of this work, the presented approach may constitute a helpful solution for the industry as well as other domains such as medicine.

[1]  J. M. Peòa Reading dependencies from covariance graphs , 2013 .

[2]  T. K. Kundra,et al.  Additive Manufacturing Technologies , 2018 .

[3]  John F. Lucas,et al.  Exploring the Benefits of Immersion in Abstract Information Visualization , 2004 .

[4]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[5]  Matthias Klapperstück,et al.  Immersive Analytics , 2015, 2015 Big Data Visual Analytics (BDVA).

[6]  Thomas Witte,et al.  Debugging Quadrocopter Trajectories in Mixed Reality , 2019, AVR.

[7]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[8]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[9]  Yi Zhang,et al.  Entropy-based subspace clustering for mining numerical data , 1999, KDD '99.

[10]  Jeffrey S. Norris,et al.  Immersive and collaborative data visualization using virtual reality platforms , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[11]  Manoranjan Dash,et al.  Feature Selection for Clustering , 2009, Encyclopedia of Database Systems.

[12]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[13]  Huan Liu,et al.  Subspace clustering for high dimensional data: a review , 2004, SKDD.

[14]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[15]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  Burkhard Hoppenstedt,et al.  HOLOVIEW: Exploring Patient Data in Mixed Reality , 2018 .

[18]  Alasdair Gilchrist Industry 4.0: The Industrial Internet of Things , 2016 .

[19]  Philip S. Yu,et al.  Fast algorithms for projected clustering , 1999, SIGMOD '99.

[20]  Xin Chen,et al.  Be the data: A new approach for lmmersive analytics , 2016, 2016 Workshop on Immersive Analytics (IA).

[21]  Manfred Reichert,et al.  Convolutional Neural Networks for Image Recognition in Mixed Reality Using Voice Command Labeling , 2019, AVR.

[22]  Harry Wechsler,et al.  Color image compression using PCA and backpropagation learning , 2000, Pattern Recognit..

[23]  Burkhard Hoppenstedt,et al.  Applicability of Immersive Analytics in Mixed Reality: Usability Study , 2019, IEEE Access.

[24]  Morgan Lewis,et al.  Keeping it simple. , 2010, Medical economics.

[25]  José M. Peña,et al.  Reading dependencies from covariance graphs , 2010, Int. J. Approx. Reason..

[26]  Ernestina Menasalvas Ruiz,et al.  New insights into the suitability of the third dimension for visualizing multivariate/multidimensional data: A study based on loss of quality quantification , 2016, Inf. Vis..

[27]  Manfred Reichert,et al.  Techniques and Emerging Trends for State of the Art Equipment Maintenance Systems—A Bibliometric Analysis , 2018 .

[28]  Carolina Cruz-Neira,et al.  The benefits of statistical visualization in an immersive environment , 1999, Proceedings IEEE Virtual Reality (Cat. No. 99CB36316).

[29]  Filippo Menczer,et al.  Feature selection in unsupervised learning via evolutionary search , 2000, KDD '00.

[30]  Luciana Nedel,et al.  Immersive Analytics of Dimensionally-Reduced Data Scatterplots , 2017 .

[31]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[32]  Manfred Reichert,et al.  Measuring the Moment-to-Moment Variability of Tinnitus: The TrackYourTinnitus Smart Phone App , 2016, Front. Aging Neurosci..

[33]  Eliot Winer,et al.  Evaluating the Microsoft HoloLens through an augmented reality assembly application , 2017, Defense + Security.

[34]  Myra Spiliopoulou,et al.  Prospective crowdsensing versus retrospective ratings of tinnitus variability and tinnitus–stress associations based on the TrackYourTinnitus mobile platform , 2019, International Journal of Data Science and Analytics.

[35]  Jian Yang,et al.  Two-dimensional PCA: a new approach to appearance-based face representation and recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[37]  Manfred Reichert,et al.  Analysis of Fuel Cells Utilizing Mixed Reality and IoT Achievements , 2019, AVR.

[38]  J. Friedman,et al.  Clustering objects on subsets of attributes (with discussion) , 2004 .

[39]  Aswin C. Sankaranarayanan,et al.  Greedy feature selection for subspace clustering , 2013, J. Mach. Learn. Res..

[40]  Elise van den Hoven,et al.  MoSo tangibles: evaluating embodied learning , 2010, TEI.

[41]  Manfred Reichert,et al.  Towards a Hierarchical Approach for Outlier Detection in Industrial Production Settings , 2019, EDBT/ICDT Workshops.

[42]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[43]  Fumio Kishino,et al.  Augmented reality: a class of displays on the reality-virtuality continuum , 1995, Other Conferences.

[44]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[45]  Tamara Munzner,et al.  Empirical Guidance on Scatterplot and Dimension Reduction Technique Choices , 2013, IEEE Transactions on Visualization and Computer Graphics.

[46]  Rüdiger Pryss,et al.  Emotional states as mediators between tinnitus loudness and tinnitus distress in daily life: Results from the “TrackYourTinnitus” application , 2016, Scientific Reports.