HealthAnalytic: A concept application for customizable visualization and analysis of health informatics datasets

A large amount of health statistics of the Indian population has recently become available in the public domain owing to the National Data Sharing and Accessibility Policy (NDSAP). This statistical data is beneficial if it can be used to obtain meaningful inferences from data-driven models. Analysis and visualization of such data has been a challenge and has become an imperative requirement for executive decision making. The challenge could be attributed to the fact that the set of features that an attribute is dependent on is subjective and data-dependent. Hence, there is a need to provide a pervasive and flexible data analytics framework that would allow this data to drive decision making. We propose a concept for the design and development of a fully customizable data analysis framework on the Android platform which allows users to visualize the effect of any combination of attributes drawn from these datasets in predicting a particular attribute (label). We use feature scoring techniques to achieve this, and also provide an additional option to view data correlation. The intended application, called HealthAnalytic, is beneficial for executive level decision-making, as it provides visualization via data-driven models along with the pervasiveness of a mobile application. The initial version of the application uses existing methods, however newer scalable methods can be incorporated in the future.

[1]  Xianggui Qu,et al.  Multivariate Data Analysis , 2007, Technometrics.

[2]  Chih-Jen Lin,et al.  Feature Ranking Using Linear SVM , 2008, WCCI Causation and Prediction Challenge.

[3]  Cyrus Shahabi,et al.  Feature subset selection and feature ranking for multivariate time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[4]  Christian Callegari,et al.  Advances in Computing, Communications and Informatics (ICACCI) , 2015 .

[5]  Sanjiva Weerawarana,et al.  Unraveling the Web services web: an introduction to SOAP, WSDL, and UDDI , 2002, IEEE Internet Computing.

[6]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[7]  Douglas Crockford,et al.  The application/json Media Type for JavaScript Object Notation (JSON) , 2006, RFC.

[8]  Haim Levkowitz,et al.  Cloud and Mobile Web-Based Graphics and Visualization , 2012, 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images Tutorials.

[9]  Juan Miguel García-Gómez,et al.  Mobile Clinical Decision Support Systems and Applications: A Literature and Commercial Review , 2013, Journal of Medical Systems.

[10]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[11]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[12]  D. Altman,et al.  Statistics notes: Calculating correlation coefficients with repeated observations: Part 1—correlation within subjects , 1995 .

[13]  A. Meyer-Bäse Feature Selection and Extraction , 2004 .

[14]  Jacob Benesty,et al.  Pearson Correlation Coefficient , 2009 .

[15]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[16]  Peter Groves,et al.  The 'big data' revolution in healthcare: Accelerating value and innovation , 2016 .

[17]  Ben Shneiderman,et al.  Community Health Map: A geospatial and multivariate data visualization tool for public health datasets , 2012, Gov. Inf. Q..

[18]  Philip Sedgwick,et al.  Pearson’s correlation coefficient , 2012, BMJ : British Medical Journal.

[19]  David W. Aha,et al.  A Comparative Evaluation of Sequential Feature Selection Algorithms , 1995, AISTATS.

[20]  Diana Peh,et al.  BIRT: A Field Guide to Reporting (The Eclipse Series) , 2006 .

[21]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[22]  Deo Shao,et al.  A Proposal of a Mobile Health Data Collection and Reporting System for the Developing World , 2012 .

[23]  D. Basak,et al.  Support Vector Regression , 2008 .