SOCR Motion Charts: An Efficient, Open-Source, Interactive and Dynamic Applet for Visualizing Longitudinal Multivariate Data

The amount, complexity and provenance of data have dramatically increased in the past five years. Visualization of observed and simulated data is a critical component of any social, environmental, biomedical or scientific quest. Dynamic, exploratory and interactive visualization of multivariate data, without preprocessing by dimensionality reduction, remains a nearly insurmountable challenge. The Statistics Online Computational Resource (www.SOCR.ucla.edu) provides portable online aids for probability and statistics education, technology-based instruction and statistical computing. We have developed a new Java-based infrastructure, SOCR Motion Charts, for discovery-based exploratory analysis of multivariate data. This interactive data visualization tool enables the visualization of high-dimensional longitudinal data. SOCR Motion Charts allows mapping of ordinal, nominal and quantitative variables onto time, 2D axes, size, colors, glyphs and appearance characteristics, which facilitates the interactive display of multidimensional data. We validated this new visualization paradigm using several publicly available multivariate datasets including Ice-Thickness, Housing Prices, Consumer Price Index, and California Ozone Data. SOCR Motion Charts is designed using object-oriented programming, implemented as a Java Web-applet and is available to the entire community on the web at www.socr.ucla.edu/SOCR_MotionCharts. It can be used as an instructional tool for rendering and interrogating high-dimensional data in the classroom, as well as a research tool for exploratory data analysis.

[1]  Arthur W. Toga,et al.  LONI Visualization Environment , 2006, Journal of Digital Imaging.

[2]  Ivo D Dinov,et al.  SOCR: Statistics Online Computational Resource. , 2006, Journal of statistical software.

[3]  Ivo D Dinov,et al.  Statistics Online Computational Resource for Education , 2009, Teaching statistics.

[4]  Charles Safran,et al.  Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[5]  M. V. van Strijen,et al.  Diagnosis and management of subsegmental pulmonary embolism , 2006, Journal of thrombosis and haemostasis : JTH.

[6]  Ivo D. Dinov,et al.  Law of Large Numbers: The Theory, Applications and Technology-Based Education , 2009, Journal of statistics education : an international journal on the teaching and learning of statistics.

[7]  Armin Grossenbacher The globalisation of statistical content , 2008 .

[8]  John Vermylen Visualizing Energy Data Using Web-Based Applications , 2008 .

[9]  Russell K. Schutt,et al.  Research Methods in Education , 2011 .

[10]  Richard D. Smith,et al.  Advances in proteomics data analysis and display using an accurate mass and time tag approach. , 2006, Mass spectrometry reviews.

[11]  Ivo D. Dinov,et al.  Pedagogical utilization and assessment of the statistic online computational resource in introductory probability and statistics courses , 2008, Comput. Educ..