KNOWLEDGE DISCOVERY FROM DYNAMIC, DISPARATE DATA

Data has now become the fourth pillar of the discovery process, joining experimentation, theory, and simulations. Examples can be found in many fields of application such as healthcare, medicine, economics, science, disaster recovery, national security, and others. Today the data landscape is very rich. Data can be generated from experiments (e.g. science facilities), output from simulations (e.g. high performance computers), provided by humans (e.g. social media), or collected from sensors (e.g. satellite data). Bringing all types of disparate data to bear on a particular discovery process or mission outcome is one of the new frontiers of science and technology. In this paper, we provide an overview of our approach to putting together a knowledge discovery portfolio and provide a few recent and current examples across applications of national interest.

[1]  John R. Goodall,et al.  Visual analysis of code security , 2010, VizSec '10.

[2]  Raghul Gunasekaran,et al.  Big Data Platforms as a Service: Challenges and Approach , 2012, HotCloud.

[3]  Stacy J. Prowell,et al.  Automated vulnerability detection for compiled smart grid software , 2012, 2012 IEEE PES Innovative Smart Grid Technologies (ISGT).

[4]  Glenn O. Allgood,et al.  Capacity utilization study for aviation security cargo inspection queuing system , 2010, Defense + Commercial Sensing.

[5]  Mallikarjun Shankar,et al.  Addressing extensibility requirements for health informatics data , 2012, SIGH.

[6]  Chad A. Steed,et al.  A New Collaborative Tool for Visually Understanding National Health Indicators , 2012 .

[7]  Kalyan S. Perumalla,et al.  Switching to High Gear: Opportunities for Grand-Scale Real-Time Parallel Simulations , 2009, 2009 13th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications.

[8]  Yoonjin Lee,et al.  Construction of extremal self-dual codes over F2+uF2 with an automorphism of odd order , 2012, Finite Fields Their Appl..

[9]  James J. Nutaro,et al.  Calculating Frequency at Loads in Simulations of Electro-Mechanical Transients , 2012, IEEE Transactions on Smart Grid.

[10]  W. Grice Arbitrarily complete Bell-state measurement using only linear optical elements , 2011 .

[11]  Justin M. Beaver,et al.  Host-Based Data Exfiltration Detection via System Call Sequences , 2011 .

[12]  Tadayoshi Kohno,et al.  A Review of the Security of Insulin Pump Infusion Systems , 2011, Journal of diabetes science and technology.

[13]  Sreenivas R. Sukumar,et al.  Value-of-information based middleware for autonomic querying of distributed sensor databases , 2010, Defense + Commercial Sensing.

[14]  B. Bhaduri,et al.  LandScan USA: a high-resolution geospatial and temporal modeling approach for population distribution and dynamics , 2007 .

[15]  Garry Robins,et al.  An introduction to exponential random graph (p*) models for social networks , 2007, Soc. Networks.

[16]  Ali R. Hurson,et al.  TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams , 2006, 2006 5th International Conference on Machine Learning and Applications (ICMLA'06).

[17]  Thomas E. Potok,et al.  A Text Analysis Approach to Motivate Knowledge Sharing via Microsoft SharePoint , 2012, 2012 45th Hawaii International Conference on System Sciences.

[18]  Varun Chandola,et al.  A scalable gaussian process analysis algorithm for biomass monitoring , 2011, Stat. Anal. Data Min..

[19]  John R. Goodall,et al.  VIAssist: Visual analytics for cyber defense , 2009, 2009 IEEE Conference on Technologies for Homeland Security.

[20]  Yi Fang,et al.  Incremental Anomaly Detection Approach for Characterizing Unusual Profiles , 2008, KDD Workshop on Knowledge Discovery from Sensor Data.

[21]  Denis Gillet,et al.  Trust-based rating prediction for recommendation in Web 2.0 collaborative learning social software , 2010, 2010 9th International Conference on Information Technology Based Higher Education and Training (ITHET).

[22]  Rahul Ramachandran,et al.  Lightweight Advertising and Scalable Discovery of Earth Science Datasets, Services, Workflows and Events Using Broadcast Feeds , 2011 .

[23]  Sudip K. Seal,et al.  Discrete event modeling and massively parallel execution of epidemic outbreak phenomena , 2012, Simul..

[24]  Richard M. Fujimoto,et al.  Network Simulation , 2006, Network Simulation.

[25]  Budhendra L. Bhaduri,et al.  Visualizing Energy Resources Dynamically on Earth , 2008, 2008 IEEE Power and Energy Society General Meeting - Conversion and Delivery of Electrical Energy in the 21st Century.

[26]  Thomas E. Potok,et al.  A Simple Distributed Particle Swarm Optimization for Dynamic and Noisy Environments , 2008, NICSO.

[27]  T. J. Jankun-Kelly,et al.  Guided analysis of hurricane trends using statistical processes integrated with interactive parallel coordinates , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.