Digging into Human Rights Violations: phrase mining and trigram visualization

Unidentified victims, perpetrators, and details of human rights violations are camouflaged by the scale of collections of witness reports. This project responds to that problem by developing an approach for identifying the appearance of often unnamed individuals across a corpus by integrating semi-supervised machine-learning based phrase mining, network visualization, and an event model from human rights reporting known as “Who Did What to Whom.” This natural language processing method facilitates the retrieval of cross-document narratives of victims -and perpetrators -of population level events.

[1]  Nahum D. Gershon,et al.  Visualization of fuzzy data using generalized animation , 1992, Proceedings Visualization '92.

[2]  W. Walker,et al.  Defining Uncertainty: A Conceptual Basis for Uncertainty Management in Model-Based Decision Support , 2003 .

[3]  James R. Glass,et al.  Iterative language model estimation: efficient data structure & algorithms , 2008, INTERSPEECH.

[4]  Alex T. Pang,et al.  Glyphs for Visualizing Uncertainty in Vector Fields , 1996, IEEE Trans. Vis. Comput. Graph..

[5]  Tong Zhang,et al.  A Robust Risk Minimization based Named Entity Recognition System , 2003, CoNLL.

[6]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[7]  Dan Roth,et al.  Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[8]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[9]  Mark A. Przybocki,et al.  Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction , 2008, LREC.

[10]  Suresh K. Lodha,et al.  LISTEN: sounding uncertainty visualization , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[11]  Alex T. Pang,et al.  Approaches to uncertainty visualization , 1996, The Visual Computer.

[12]  Penny Rheingans,et al.  Point-based probabilistic surfaces to show surface uncertainty , 2004, IEEE Transactions on Visualization and Computer Graphics.

[13]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[14]  Mark Gahegan,et al.  Visualizing Geospatial Information Uncertainty: What We Know and What We Need to Know , 2005 .

[15]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[16]  Bongshin Lee,et al.  Revealing Uncertainty for Information Visualization , 2010, Inf. Vis..

[17]  William W. Cohen,et al.  Learning to match and cluster large high-dimensional data sets for data integration , 2002, KDD.

[18]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[19]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[20]  Alex T. Pang,et al.  UFLOW: visualizing uncertainty in fluid flow , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[21]  Patrick Ball,et al.  The Bosnian Book of Dead: Assessment of the Database (Full Report) , 2007 .