Uncovering and Managing the Impact of Methodological Choices for the Computational Construction of Socio-Technical Networks from Texts

Socio-technical networks are ubiquitous and impact society on many dimensions. As individuals become socialized into those networks, they alternately internalize network behavior or transform network behavior through their participation. Frequently the functioning of networks involves communication within the network or processing of communication and information originating outside the network. Such communication and information data are often available as unstructured, natural language text data. Often in prior work, text data are analyzed separately from relational data, or are reduced to the fact and frequency of the flow of information between nodes. The latter approach acknowledges that information exchange has taken place, but disregards the content of the text data. However, we know that by not considering the substance of communication and information, we are limited in our ability to understand the effects of language use in networks, including the interplay and co-evolution of information and network structure and behavior. Thus, we expect that in bringing together text data and relational data, we will be able to make substantial advances in network analysis. A complicating factor is that sometimes the structure and behavior of networks are encoded in the text data itself. In these cases, network data needs to be extracted from text data. I propose to develop, apply and evaluate a set of computational methods that facilitate the joint analysis of relational data and the content of text data. In working towards this goal, I use an interdisciplinary and computationally rigorous approach that combines theory and models from social science and socio-linguistics with methods from natural language processing and machine learning that are based on probabilistic graphical models. The datasets used for this work are the Enron email data, data about research funding, and a dataset about the Sudan. The anticipated contributions include: - Provide and evaluate methods that will be integrated into the publicly available software products AutoMap and ORA. - Clean and normalize public datasets that contain relational data and text data in order to ensure that each node represents one unique social entity and no entity is represented by more than one node. The overall goal with this thesis is to provide methods that support users in collecting rich network data that allow for meaningful and actionable analysis.

[1]  Sunita Sarawagi,et al.  Information Extraction , 2008 .

[2]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[3]  Lada A. Adamic,et al.  Internet: Growth dynamics of the World-Wide Web , 1999, Nature.

[4]  Christopher Winship Thoughts about roles and relations: An old document revisited , 1988 .

[5]  J. Milroy,et al.  Linguistic change, social network and speaker innovation , 1985, Journal of Linguistics.

[6]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[7]  Ulrik Brandes,et al.  Network Analysis: Methodological Foundations , 2010 .

[8]  Lesley Milroy,et al.  Language and social networks , 1980 .

[9]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[10]  Peter S. Bearman,et al.  Becoming a Nazi: A model for narrative networks☆ , 2000 .

[11]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[12]  L MercerRobert,et al.  Class-based n-gram models of natural language , 1992 .

[13]  H. Bernard,et al.  Data Management and Analysis Methods , 2000 .

[14]  J. Kruskal The Relationship between Multidimensional Scaling and Clustering , 1977 .

[15]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[16]  Dennis V. Pereira,et al.  Automatic Lexicon Generation for Unsupervised Part-of-Speech Tagging Using Only Unannotated Text , 1999 .

[17]  Ronald S. Burt,et al.  Interorganization Contagion in Corporate Philanthropy , 1991 .

[18]  James D. Herbsleb,et al.  Construction of association networks from communication in teams working on complex projects , 2011, Stat. Anal. Data Min..

[19]  Andrew McCallum,et al.  Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression , 2008, UAI.

[20]  Kathleen M. Carley Designing organizational structures to cope with communication breakdowns: a simulation model , 1991 .

[21]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[22]  D. Krackhardt Graph theoretical dimensions of informal organizations , 1994 .

[23]  William A. Woods,et al.  What's in a Link: Foundations for Semantic Networks , 1975 .

[24]  Dawn Iacobucci,et al.  Social Contagion and Social Structure , 2008 .

[25]  Jonathon N. Cummings,et al.  Collaborative Research Across Disciplinary and Organizational Boundaries , 2005 .

[26]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[27]  Ajay Mehra The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[28]  P. V. Marsden,et al.  NETWORK DATA AND MEASUREMENT , 1990 .

[29]  Terrill L. Frantz,et al.  Communication Networks from the Enron Email Corpus “It's Always About the People. Enron is no Different” , 2005, Comput. Math. Organ. Theory.

[30]  Kathleen M. Carley,et al.  Exploration of communication networks from the Enron email corpus , 2005 .

[31]  Lada A. Adamic,et al.  Tracking information epidemics in blogspace , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[32]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[33]  M. Ross Quillian,et al.  Retrieval time from semantic memory , 1969 .

[34]  Jaideep Srivastava,et al.  Dark Gold: Statistical Properties of Clandestine Networks in Massively Multiplayer Online Games , 2010, 2010 IEEE Second International Conference on Social Computing.

[35]  Angelica BACIVAROV,et al.  A Neuro-Classification Model for Socio-Technical Systems , 2009 .

[36]  Kathleen M. Carley,et al.  AutoMap User's Guide , 2006 .

[37]  James F. Allen,et al.  What's in a Semantic Network? , 1982, ACL.

[38]  Jon Patrick,et al.  Identifying Interpersonal Distance using Systemic Features , 2003, Computing Attitude and Affect in Text.

[39]  Andrew McCallum,et al.  Joint Group and Topic Discovery from Relations and Text , 2006, SNA@ICML.

[40]  Ann Lewins,et al.  Using Software in Qualitative Research: A Step-by-Step Guide , 2007 .

[41]  Loet Leydesdorff,et al.  Network Structure, Self-Organization and the Growth of International Collaboration in Science.Research Policy, 34(10), 2005, 1608-1618. , 2005, 0911.4299.

[42]  H. White,et al.  “Structural Equivalence of Individuals in Social Networks” , 2022, The SAGE Encyclopedia of Research Design.

[43]  D. Krackhardt Assessing the political landscape: Structure, cognition, and power in organizations. , 1990 .

[44]  Stuart C. Shapiro,et al.  A Net Structure for Semantic Information Storage, Deduction and Retrieval , 1971, IJCAI.

[45]  Duncan J. Watts The accidental influentials , 2011 .

[46]  Kathleen M. Carley,et al.  Network Analysis Software , 2011 .

[47]  Uffe Kock Wiil,et al.  Detecting Social Polarization and Radicalization , 2011 .

[48]  Philip A. Schrodt,et al.  MACHINE CODING OF EVENT DATA USING REGIONAL AND INTERNATIONAL SOURCES , 1994 .

[49]  C. Shapiro,et al.  Technology Adoption in the Presence of Network Externalities , 1986, Journal of Political Economy.

[50]  Wei Li,et al.  Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.

[51]  E. F. Tjong Kim Sang,et al.  Proceedings of CoNLL-2009 , 2009, ACL 2009.

[52]  Ben Taskar,et al.  Statistical Relational Learning for Natural Language Information Extraction , 2007 .

[53]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[54]  Dmitry Zelenko,et al.  Kernel methods for relation extraction , 2003 .

[55]  Ian McAllister,et al.  Bandwagon, Underdog, or Projection? Opinion Polls and Electoral Choice in Britain, 1979-1987 , 1991, The Journal of Politics.

[56]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[57]  C. McCarty,et al.  COMPARING FOUR DIFFERENT METHODS FOR MEASURING PERSONAL SOCIAL NETWORKS , 1990 .

[58]  Tong Zhang,et al.  Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[59]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[60]  Kathleen M. Carley Formalizing the Social Expert's Knowledge , 1988 .

[61]  Richard A. Lobban,et al.  Alienation, Urbanisation, and Social Networks in the Sudan , 1975, The Journal of Modern African Studies.

[62]  Carl W. Roberts,et al.  3. A Generic Semantic Grammar for Quantitative Text Analysis: Applications to East and West Berlin Radio News Content from 1979 , 1997 .

[63]  Terrill L. Frantz,et al.  An Automated Methodology for Conducting a Social Network Study of a University Faculty , 2005 .

[64]  Kathleen M. Carley,et al.  Toward Automated Definition Acquisition From Operations Law , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[65]  Cliff Lampe,et al.  A familiar face(book): profile elements as signals in an online social network , 2007, CHI.

[66]  H. Simon,et al.  ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS , 1955 .

[67]  Charles A. McClelland,et al.  The Management and Analysis of International Event Data: A Computerized System for Monitoring and Projecting Event Flows. , 1971 .

[68]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[69]  Jeffrey C. Johnson,et al.  Network Visualization: The "Bush Team" in Reuters News Ticker 9/11-11/15/01 , 2004, J. Soc. Struct..

[70]  Jeroen Groenendijk,et al.  Formal methods in the study of language , 1983 .

[71]  Philip A. Schrodt,et al.  Monitoring conflict using automated coding of newswire reports: a comparison of five geographical regions , 2001 .

[72]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[73]  David B. Skillicorn,et al.  Knowledge Discovery for Counterterrorism and Law Enforcement , 2008 .

[74]  B. Erickson Secret Societies and Social Structure , 1981 .

[75]  Razvan C. Bunescu,et al.  Learning for information extraction: from named entity recognition and disambiguation to relation extraction , 2007 .

[76]  Peter R. Monge,et al.  Theories of Communication Networks , 2003 .

[77]  Christine D. Piatko,et al.  Named Entity Recognition using Hundreds of Thousands of Features , 2003, CoNLL.

[78]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[79]  Dan Klein,et al.  Named Entity Recognition with Character-Level Models , 2003, CoNLL.

[80]  L. Getoor,et al.  1 Global Inference for Entity and Relation Identification via a Linear Programming Formulation , 2007 .

[81]  J. Mitchell,et al.  The Concept and Use of Social Networks , 1969 .

[82]  Heather Fry,et al.  A user’s guide , 2003 .

[83]  A. Hämmerli,et al.  Conflict and Cooperation in an Actors' Network of Chechnya Based on Event Data , 2006 .

[84]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[85]  K. Reitz,et al.  Graph and Semigroup Homomorphisms on Networks of Relations , 1983 .

[86]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[87]  V Latora,et al.  Efficient behavior of small-world networks. , 2001, Physical review letters.

[88]  Macartan Humphreys,et al.  Natural Resources, Conflict, and Conflict Resolution , 2005 .

[89]  Joseph D. Novak,et al.  Learning How to Learn , 1984 .

[90]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[91]  Y. Altun,et al.  Named-Entity Recognition in Novel Domains with External Lexical Knowledge , 2005 .

[92]  P. Arabie,et al.  An algorithm for clustering relational data with applications to social network analysis and comparison with multidimensional scaling , 1975 .

[93]  I. A. Richards,et al.  The Meaning of Meaning: a Study of the Influence of Language upon Thought and of the Science of Symbolism , 1923, Nature.

[94]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[95]  Ricky Leung Network Position, Research Funding and Interdisciplinary Collaboration among Nanotechnology Scientists: An Application of Social Network Analysis , 2007 .

[96]  Marya L. Doerfel What Constitutes Semantic Network Analysis? A Comparison of Research and Methodologies' , 2003 .

[97]  Lise Getoor,et al.  Collective entity resolution in relational data , 2007, TKDD.

[98]  Marc Sageman,et al.  Understanding terror networks. , 2004, International journal of emergency mental health.

[99]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[100]  Nancy J. Cooke,et al.  1 CONVERGING APPROACHES TO AUTOMATED COMMUNICATIONS-BASED ASSESSMENT OF TEAM SITUATION AWARENESS , 2007 .

[101]  Kees van Deemter,et al.  On Coreferring: Coreference in MUC and Related Annotation Schemes , 2000, CL.

[102]  S. Nadel The Theory of Social Structure , 1957 .

[103]  S. Wasserman,et al.  Models and Methods in Social Network Analysis , 2005 .

[104]  S. Boorman,et al.  Social Structure from Multiple Networks. I. Blockmodels of Roles and Positions , 1976, American Journal of Sociology.

[105]  Andrew E. Smith,et al.  Evaluation of unsupervised semantic mapping of natural language with Leximancer concept mapping , 2006, Behavior research methods.

[106]  Rohit J. Kate,et al.  Comparative experiments on learning information extractors for proteins and their interactions , 2005, Artif. Intell. Medicine.

[107]  Kathleen M. Carley,et al.  Toward an interoperable dynamic network analysis toolkit , 2007, Decis. Support Syst..

[108]  Yan Zhao,et al.  Analyzing Actors and Their Discussion Topics by Semantic Social Network Analysis , 2006, Tenth International Conference on Information Visualisation (IV'06).

[109]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[110]  F. Auerbach Das Gesetz der Bevölkerungskonzentration. , 1913 .

[111]  Kathleen M. Carley,et al.  An Integrated Approach to the Collection and Analysis of Network Data , 2004 .

[112]  N. M. Morris,et al.  On Looking into the Black Box: Prospects and Limits in the Search for Mental Models , 1986 .

[113]  Kathleen M. Carley,et al.  Relationale Methoden in der Erforschung, Ermittlung und Prävention von Kriminalität , 2010, Handbuch Netzwerkforschung.

[114]  Carole M. McNamee Using Both Sides of the Brain: Experiences that Integrate Art and Talk Therapy Through Scribble Drawings , 2004 .

[115]  W. H. van Atteveldt,et al.  Semantic Network Analysis: Techniques for Extracting, Representing, and Querying Media Content , 2008 .

[116]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[117]  James D. Herbsleb,et al.  Communication networks in geographically distributed software development , 2008, CSCW.

[118]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[119]  H. Bernard,et al.  Handbook of Methods in Cultural Anthropology , 2000 .

[120]  Andrew P. Feldstein Brand Communities in a World of Knowledge-based Products and Common Property , 2007 .

[121]  Philip A. Schrodt AUTOMATED CODING OF INTERNATIONAL EVENT DATA USING SPARSE PARSING TECHNIQUES , 2000 .

[122]  W. Bainbridge The Scientific Research Potential of Virtual Worlds , 2007, Science.

[123]  Razvan Bunescu and Raymond J. Mooney Statistical Relational Learning for Natural Language Information Extraction , 2007 .

[124]  Gary King,et al.  An Automated Information Extraction Tool for International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design , 2003, International Organization.

[125]  Ralph Grishman,et al.  Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition , 1998, VLC@COLING/ACL.

[126]  Camille Roth,et al.  Social and semantic coevolution in knowledge networks , 2010, Soc. Networks.

[127]  J. Boster,et al.  Social roles and the evolution of networks in extreme and isolated environments , 2003, The Journal of mathematical sociology.

[128]  R. Burt The Social Capital of Opinion Leaders , 1999 .

[129]  David M. Blei,et al.  Connections between the lines: augmenting social networks with text , 2009, KDD.

[130]  Ronald A. Howard,et al.  Knowledge Maps , 1989 .

[131]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[132]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[133]  R. Kraut,et al.  Varieties of Social Influence: the Role of Utility and Norms in the Success of a New Communication Medium , 1998 .

[134]  William W. Cohen,et al.  Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods , 2004, KDD.

[135]  Frank Biocca,et al.  Building bridges across fields, universities, and countries: Successfully funding communication research through interdisciplinary collaboration , 2002 .

[136]  S. Borgatti,et al.  Regular equivalence: general theory , 1994 .

[137]  Joe Bond,et al.  Integrated Data for Events Analysis (IDEA): An Event Typology for Automated Events Data Development , 2003 .

[138]  Rosina L. Lippi-Green Social network integration and language change in progress in a rural alpine village , 1989, Language in Society.

[139]  Kathleen M. Carley,et al.  Revealing Social Structure from Texts: Meta-Matrix Text Analysis as a Novel Method for Network Text Analysis , 2005 .

[140]  Kathleen M. Carley,et al.  Dynamic Social Network Modeling and Analysis: Workshop Summary and Papers , 2004 .

[141]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[142]  Miriam R. L. Petruck FRAME SEMANTICS , 1996 .

[143]  Kathleen M. Carley,et al.  AutoMap User's Guide 2011 , 2011 .

[144]  William W. Cohen,et al.  Node Clustering in Graphs: An Empirical Study , 2010 .

[145]  Kathleen M. Carley,et al.  ORA User's Guide 2011 , 2011 .

[146]  Dafna Shahaf,et al.  Connecting the dots between news articles , 2011, IJCAI 2011.

[147]  Tina Eliassi-Rad,et al.  Visual Analysis of Large Heterogeneous Social Networks by Semantic and Structural Abstraction , 2006 .

[148]  Mark Weiser,et al.  TEXTNET: a network-based approach to text handling , 1986, TOIS.

[149]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[150]  Kathleen M. Carley,et al.  A Methodology for Integrating Network Theory and Topic Modeling and its Application to Innovation Diffusion , 2010, 2010 IEEE Second International Conference on Social Computing.

[151]  Carl W. Roberts,et al.  Text analysis for the social sciences : methods for drawing statistical inferences from texts and transcripts , 1997 .

[152]  Andrew McCallum,et al.  Information Extraction , 2005, ACM Queue.

[153]  David Krackhardt,et al.  Cognitive social structures , 1987 .

[154]  Kathleen M. Carley Smart Agents and Organizations of the Future , 2001 .

[155]  David L. Alderson,et al.  OR FORUM - Catching the "Network Science" Bug: Insight and Opportunity for the Operations Researcher , 2008, Oper. Res..

[156]  Kathleen M. Carley,et al.  Conditional random fields for entity extraction and ontological text coding , 2008 .

[157]  T. Snijders The statistical evaluation of social network dynamics , 2001 .

[158]  SommervilleIan,et al.  Socio-technical systems , 2011 .

[159]  H. Bernard,et al.  Text Analysis: Qualitative and Quantitative Methods , 1998 .

[160]  L. Sailer Structural equivalence: Meaning and definition, computation and application , 1978 .

[161]  Roel Popping,et al.  Knowledge Graphs and Network Text Analysis , 2003 .

[162]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[163]  F. G. Crookshank,et al.  The meaning of meaning : a study of the influence of language upon thought and of the science of symbolism , 1924 .

[164]  Kathleen M. Carley,et al.  The communication infrastructure during the learning process in web based collaborative learning systems , 2011, WebSci '11.

[165]  Claire Cardie,et al.  Reconcile: A Coreference Resolution Research Platform , 2010 .

[166]  Frank M. Bass,et al.  A New Product Growth for Model Consumer Durables , 2004, Manag. Sci..

[167]  Caroline Haythornthwaite,et al.  Analyzing Networked Learning Texts , 2008 .

[168]  E. Rogers Diffusion of Innovations , 1962 .

[169]  Katherine Faust,et al.  Comparing Social Networks: Size, Density, and Local Structure , 2006 .

[170]  Rahul Gupta,et al.  Domain adaptation of information extraction models , 2009, SGMD.

[171]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[172]  Aili Malm,et al.  Social Network and Distance Correlates of Criminal Associates Involved in Illicit Drug Production , 2008 .

[173]  T. Newcomb The acquaintance process , 1961 .

[174]  Marya L. Doerfel,et al.  A Semantic Network Analysis of the International Communication Association , 1999 .

[175]  Ross M. Miller,et al.  What went wrong at Enron : everyone's guide to the largest bankruptcy in U.S. history , 2002 .

[176]  John A. Barnden,et al.  Semantic Networks , 1998, Encyclopedia of Social Network Analysis and Mining.

[177]  Anthony J. G. Hey,et al.  ViewpointA "smart" cyberinfrastructure for research , 2009, Commun. ACM.

[178]  Stuart C. Shapiro,et al.  Encyclopedia of artificial intelligence, vols. 1 and 2 (2nd ed.) , 1992 .

[179]  Katherine Giuffre Mental Maps: Social Networks and the Language of Critical Reviews , 2001 .

[180]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[181]  Arthur B. Markman,et al.  Knowledge Representation , 1998 .

[182]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[183]  Kathleen M. Carley,et al.  A PCANS Model of Structure in Organizations , 1998 .

[184]  Roberto Franzosi,et al.  From Words to Numbers: A Generalized and Linguistics-Based Coding Procedure for Collecting Textual Data , 1989 .

[185]  P. Klerks The Network Paradigm Applied to Criminal Organisations: Theoretical nitpicking or a relevant doctrine for investigators? Recent developments in the Netherlands , 2001 .

[186]  Douglas H. Harris,et al.  The Application of Link Analysis to Police Intelligence , 1975 .

[187]  W. Baker,et al.  THE SOCIAL ORGANIZATION OF CONSPIRACY: ILLEGAL NETWORKS IN THE HEAVY ELECTRICAL EQUIPMENT INDUSTRY* , 1993 .

[188]  Oren Etzioni,et al.  Relational Web Search , 2006 .

[189]  S. Borgatti The Key Player Problem , 2002 .

[190]  Kathleen M. Carley,et al.  Semantic Connectivity: An Approach for Analyzing Symbols in Semantic Networks , 1993 .

[191]  J. Sarnecki Delinquent Networks: Youth Co-Offending in Stockholm , 2001 .

[192]  James S. Boster,et al.  Social position and shared knowledge: Actors' perceptions of status, role, and social structure , 1987 .

[193]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[194]  Robin Cowan,et al.  The Joint Dynamics of Networks and Knowledge , 2003 .

[195]  Valdis E. Krebs,et al.  Mapping Networks of Terrorist Cells , 2001 .

[196]  Kathleen M. Carley,et al.  Netintel: A Database for Manipulation of Rich Social Network Data , 2005 .

[197]  Kathleen M. Carley Coding Choices for Textual Analysis: A Comparison of Content Analysis and Map Analysis , 1993 .

[198]  Mark Steyvers,et al.  Topics in semantic representation. , 2007, Psychological review.

[199]  John C. Paolillo The Virtual Speech Community: Social Network and Language Variation on IRC , 1999, J. Comput. Mediat. Commun..

[200]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[201]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[202]  R. Fazio,et al.  Social network integration , 2011 .

[203]  M. Tushman Special Boundary Roles in the Innovation Process. , 1977 .

[204]  Susan Fitzmaurice Coalitions and the Investigation of Social Influence in Linguistic History , 2000 .

[205]  Tom Richards,et al.  An intellectual history of NUD*IST and NVivo , 2002 .

[206]  Kathleen M. Carley Extracting culture through textual analysis , 1994 .

[207]  James D. Herbsleb,et al.  Socio-technical congruence: a framework for assessing the impact of technical and work dependencies on software development productivity , 2008, ESEM '08.

[208]  Edmond Chow,et al.  Knowledge Representation Issues in Semantic Graphs for Relationship Detection , 2005, AAAI Spring Symposium: AI Technologies for Homeland Security.

[209]  S. Kauffman At Home in the Universe: The Search for the Laws of Self-Organization and Complexity , 1995 .

[210]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[211]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[212]  Ralph Grishman,et al.  A Maximum Entropy Approach to Named Entity Recognition , 1999 .

[213]  Terrill L. Frantz,et al.  Robustness of centrality measures under uncertainty: Examining the role of network topology , 2009, Comput. Math. Organ. Theory.

[214]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[215]  William W. Cohen,et al.  A Comparison of String Metrics for Matching Names and Records , 2003 .

[216]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[217]  J. L. Snell,et al.  An Anatomy of Kinship: Mathematical Models for Structures of Cumulated Roles. , 1965 .

[218]  Kathleen M. Carley,et al.  On the robustness of centrality measures under conditions of imperfect data , 2006, Soc. Networks.

[219]  D. Krackhardt Simmelian Ties: Super Strong and Sticky , 1998 .

[220]  Steven R. Corman,et al.  Studying Complex Discursive Systems: Centering Resonance Analysis of Communication. , 2002 .

[221]  Thomas L. Griffiths,et al.  Probabilistic author-topic models for information discovery , 2004, KDD.

[222]  Jan Kleinnijenhuis,et al.  A Theory of Evaluative Discourse: Towards a Graph Theory of Journalistic Texts , 1986 .

[223]  Kathleen M. Carley,et al.  He says, she says. Pat says, Tricia says. How much reference resolution matters for entity extraction, relation extraction, and social network analysis , 2009, 2009 IEEE Symposium on Computational Intelligence for Security and Defense Applications.

[224]  Adam Zagorecki,et al.  Interorganizational Information Exchange and Efficiency: Organizational Performance in Emergency Environments , 2010, J. Artif. Soc. Soc. Simul..

[225]  Dan Roth,et al.  Probabilistic Reasoning for Entity & Relation Recognition , 2002, COLING.

[226]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[227]  R. Hartley,et al.  Semantic networks: visualizations of knowledge , 1997, Trends in Cognitive Sciences.

[228]  S. Mohammed,et al.  Team Mental Model: Construct or Metaphor? , 1994 .

[229]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[230]  Eduard H. Hovy Parsimonious and Profligate Approaches to the Question of Discourse Structure Relations , 1990, INLG.

[231]  Robin Cowan,et al.  Heterogenous agents, interactions and economic performance , 2003 .

[232]  Norman P. Hummon,et al.  Connectivity in a citation network: The development of DNA theory☆ , 1989 .

[233]  Heinz Ulrich Hoppe,et al.  Combining social network analysis with semantic relations to support the evolution of a scientific community , 2007, CSCL.

[234]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[235]  W. Powers Report of Investigation by the Special Investigative Committee of the Board of Directors of Enron Co , 2002 .

[236]  Karlene H. Roberts,et al.  Some correlations of communication roles in organizations. , 1979 .

[237]  Mark E. J. Newman,et al.  Structure and Dynamics of Networks , 2009 .

[238]  Charles J. Fillmore,et al.  THE CASE FOR CASE. , 1967 .

[239]  Sunita Sarawagi,et al.  Domain Adaptation of Conditional Probability Models Via Feature Subsetting , 2007, PKDD.

[240]  Pascal Denis,et al.  Joint Determination of Anaphoricity and Coreference Resolution using Integer Programming , 2007, NAACL.

[241]  P. Bourdieu,et al.  Language and Symbolic Power , 1991 .

[242]  Kathleen M. Carley Extracting team mental models through textual analysis , 1997 .

[243]  S. Boorman,et al.  Social Structure from Multiple Networks. II. Role Structures , 1976, American Journal of Sociology.

[244]  A. Reiss, Co-Offending and Criminal Careers , 1988, Crime and Justice.

[245]  Suzanne Bakken,et al.  Description of a method to support public health information management: Organizational network analysis , 2007, J. Biomed. Informatics.

[246]  R. Merton Social Theory and Social Structure , 1958 .

[247]  J. Mitchell,et al.  Social Networks in Urban Situations: Analyses of Personal Relationships in Central African Towns. , 1970 .

[248]  J. Mohr Measuring Meaning Structures , 1998 .

[249]  J. Novak The Theory Underlying Concept Maps and How To Construct Them , 2004 .

[250]  B. Ryan The diffusion of hybrid seed corn in two Iowa communities , 1943 .

[251]  Ronald S. Burt,et al.  Positions in Networks , 1976 .

[252]  M. Mandel,et al.  Local Roles and Social Networks , 1983 .

[253]  Dan Roth,et al.  Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[254]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[255]  Kathleen M. Carley,et al.  Extracting, Representing, and Analyzing Mental Models , 1992 .

[256]  Ronald S. Burt,et al.  Network Time Series from Archival Records , 1977 .

[257]  Candace L. Sidner,et al.  Towards a computational theory of definite anaphora comprehension in English discourse , 1979 .

[258]  James A. Danowski,et al.  CRISIS EFFECTS ON INTRAORGANIZATIONAL COMPUTER-BASED COMMUNICATION , 1985 .

[259]  Kathleen M. Carley,et al.  The Words of Warcraft: relational text analysis of quests in an MMORPG , 2009, DiGRA Conference.

[260]  Yannick Versley,et al.  BART: A Modular Toolkit for Coreference Resolution , 2008, ACL.

[261]  Richard Tobin,et al.  Datasets for generic relation extraction* , 2011, Natural Language Engineering.

[262]  William Richards,et al.  An Improved Conceptually-Based Method for Analysis of Communication Network Structure of Large Complex Organizations. , 1971 .

[263]  Kate Ehrlich,et al.  Searching for experts in the enterprise: combining text and social network analysis , 2007, GROUP.

[264]  Janyce Wiebe,et al.  Computing Attitude and Affect in Text: Theory and Applications , 2005, The Information Retrieval Series.

[265]  Samuel Leinhardt,et al.  Social Networks: A Developing Paradigm , 1977 .

[266]  Nancy J. Cooke,et al.  Measuring Situational Awareness through Analysis of Communications: A Preliminary Exercise , 2006 .

[267]  Caroline Haythornthwaite,et al.  Exploring Multiplexity: Social Network Structures in a Computer-Supported Distance Learning Class , 2001, Inf. Soc..

[268]  Detlef Schoder,et al.  Web Science 2.0: Identifying Trends through Semantic Social Network Analysis , 2008, 2009 International Conference on Computational Science and Engineering.

[269]  John J. Gumperz,et al.  Discourse strategies: Social network and language shift , 1982 .

[270]  James A. Anderson Associative networks , 1998 .

[271]  Kathleen M. Carley,et al.  Rapid modeling and analyzing networks extracted from pre-structured news articles , 2012, Comput. Math. Organ. Theory.

[272]  John Scott What is social network analysis , 2010 .

[273]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[274]  James D. Herbsleb,et al.  Identification of coordination requirements: implications for the Design of collaboration and awareness tools , 2006, CSCW '06.

[275]  Philip A. Schrodt,et al.  The CAMEO (Conflict and Mediation Event Observations) Actor Coding Framework , 2008 .

[276]  Yannick Versley,et al.  SemEval-2010 Task 1: Coreference Resolution in Multiple Languages , 2009, *SEMEVAL.

[277]  Gerald D. Feldman,et al.  Networks of Nazi Persecution: Bureaucracy, Business and the Organization of the Holocaust , 2004 .

[278]  Andrew McCallum,et al.  Joint deduplication of multiple record types in relational data , 2005, CIKM '05.

[279]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[280]  Hugo Horta,et al.  Does competitive research funding encourage diversity in higher education , 2008 .

[281]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[282]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[283]  Dragomir R. Radev,et al.  Book Review: Graph-Based Natural Language Processing and Information Retrieval by Rada Mihalcea and Dragomir Radev , 2011, CL.

[284]  Dan Roth,et al.  Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[285]  Stephen C. Hayne,et al.  Visualization and Analysis of Social Networks of Research Funding , 2011, 2011 44th Hawaii International Conference on System Sciences.

[286]  Danyel Fisher,et al.  Visualizing the Signatures of Social Roles in Online Discussion Groups , 2007, J. Soc. Struct..

[287]  Michel Marcoccia On-line polylogues: conversation structure and participation framework in internet newsgroups , 2004 .

[288]  Joshua S. Goldstein A Conflict-Cooperation Scale for WEIS Events Data , 1992 .

[289]  Kathleen M. Carley,et al.  The interaction of size and density with graph-level indices , 1999, Soc. Networks.

[290]  Scott Miller,et al.  A Novel Use of Statistical Parsing to Extract Information from Text , 2000, ANLP.

[291]  Loren Fox,et al.  Enron: The Rise and Fall , 2002 .

[292]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[293]  Noah E. Friedkin,et al.  The development of structure in random networks: an analysis of the effects of increasing network density on five measures of structure , 1981 .

[294]  Julia Melkers,et al.  Evaluating the Improved Research Capacity of EPSCoR States: R&D Funding and Collaborative Networks in the NSF EPSCoR Program , 2009 .

[295]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[296]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[297]  Mark A. Przybocki,et al.  The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[298]  Kathleen M. Carley,et al.  AutoMap 1.2 : extract, analyze, represent, and compare mental models from texts , 2004 .

[299]  D. Bobrow,et al.  Representation and Understanding: Studies in Cognitive Science , 1975 .

[300]  Ronald E. Rice,et al.  The NEGOPY network analysis program , 1981 .