XLab: Early Indications & Warnings from Open Source Data with Application to Biological Threat

XLab is an early warning system that addresses a broad range of national security threats using a flexible, rapidly reconfigurable architecture. XLab enables intelligence analysts to visualize, explore, and query a knowledge base constructed from multiple data sources, guided by subject matter expertise codified in threat model graphs. This paper describes a novel system prototype that addresses threats arising from biological weapons of mass destruction. The prototype applies knowledge extraction analytics—including link estimation, entity disambiguation, and event detection—to build a knowledge base of 40 million entities and 140 million relationships from open sources. Exact and inexact subgraph matching analytics enable analysts to search the knowledge base for instances of modeled threats. The paper introduces new methods for inexact matching that accommodate threat models with temporal and geospatial patterns. System performance is demonstrated using several simplified threat models and an embedded scenario.

[1]  László Babai,et al.  Graph isomorphism in quasipolynomial time [extended abstract] , 2015, STOC.

[2]  Hanghang Tong,et al.  FIRST: Fast Interactive Attributed Subgraph Matching , 2017, KDD.

[3]  Gregory D. Koblentz,et al.  Predicting Peril or the Peril of Prediction? Assessing the Risk of CBRN Terrorism , 2011 .

[4]  Bruce L. Golden,et al.  Early detection of bioterrorism: Monitoring disease using an agent-based model , 2014, Proceedings of the Winter Simulation Conference 2014.

[5]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[6]  Olga Simek,et al.  Named Entity Recognition in 140 Characters or Less , 2016, #Microposts.

[7]  Reynold Xin,et al.  GraphFrames: an integrated API for mixing graph and relational queries , 2016, GRADES '16.

[8]  Hanghang Tong,et al.  MAGE: Matching approximate patterns in richly-attributed graphs , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[9]  Alessia Saggese,et al.  Introducing VF3: A New Algorithm for Subgraph Isomorphism , 2017, GbRPR.

[10]  Charu C. Aggarwal,et al.  NeMa: Fast Graph Search with Label Similarity , 2013, Proc. VLDB Endow..

[11]  Kedar P. Sambhoos Graph matching applications in high level information fusion , 2007 .

[12]  Kedar Sambhoos,et al.  Using Dynamic Graph Matching and Gravity Models for Early Detection of Bioterrorist Attacks , 2009 .

[13]  Rakesh Nagi,et al.  A fuzzy graph matching approach in intelligence analysis and maintenance of continuous situational awareness , 2014, Inf. Fusion.

[14]  Jeong-Hoon Lee,et al.  An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases , 2012, Proc. VLDB Endow..

[15]  Alan N. Steinberg,et al.  A Model for Threat Assessment , 2016 .

[16]  Lin Li,et al.  A comparison of techniques for name matching. , 2012 .

[17]  Christos Faloutsos,et al.  Fast best-effort pattern matching in large attributed graphs , 2007, KDD '07.

[18]  Jeanne Guillemin The Soviet Biological Weapons Program: A History , 2013 .