Measuring Temporal and Contextual Proximity: Big Text-Data Analytics in Concept Maps

Despite being important, time and context have yet to be formally incorporated into the process of visually representing the temporal and contextual proximity between keywords in a concept map. In response to the context and time challenges, this study improves automated conventional concept mapping by measuring the temporal and contextual distance between pairs of co-occurring concepts. After generating a conventional concept map, it is temporally and contextually augmented in this work by applying an unsupervised temporal trend detection algorithm and a novel measure of contextual proximity. This proposed approach is demonstrated and validated without loss of generality for a spectrum of information technologies, showing that the resulting assessments of temporal and contextual proximity are highly correlated with subjective assessments of experts. The contribution of this work is emphasized and magnified against the current growing attention to big data analytics in general and to big text-data analytics in particular.

[1]  Daniel J. Power,et al.  Data science: supporting decision-making , 2016, J. Decis. Syst..

[2]  Jan vom Brocke,et al.  Utilizing big data analytics for information systems research: challenges, promises and guidelines , 2016, Eur. J. Inf. Syst..

[3]  Nava Pliskin,et al.  Improving similarity measures of relatedness proximity: Toward augmented concept maps , 2015, J. Informetrics.

[4]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[5]  Rajkumar Buyya,et al.  Big Data computing and clouds: Trends and future directions , 2013, J. Parallel Distributed Comput..

[6]  Anita Greenhill,et al.  Technological Forecasting and Social Change Special Section: Creative prototyping , 2014 .

[7]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[8]  Casey G. Cegielski,et al.  Developing a Big Data-Enabled Transformation Model in Healthcare: A Practice Based View , 2014, ICIS.

[9]  Florian Stahl,et al.  Marketplaces for data: an initial survey , 2013, SGMD.

[10]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[11]  Wei Chen,et al.  Extracting hot spots of topics from time-stamped documents , 2011, Data Knowl. Eng..

[12]  Bettina Berendt,et al.  From bursty patterns to bursty facts: The effectiveness of temporal text mining for news , 2010, ECAI.

[13]  Lyle Ungar,et al.  Discovery of significant emerging trends , 2010, KDD.

[14]  Pei-Chun Lee,et al.  Mapping knowledge structure by keyword co-occurrence: a first look at journal papers in Technology Foresight , 2010, Scientometrics.

[15]  Ed C. M. Noyons,et al.  A unified approach to mapping and clustering of bibliometric networks , 2010, J. Informetrics.

[16]  Frank Vanclay,et al.  Technology Assessment in Social Context: The case for a new framework for assessing and shaping technological developments , 2010 .

[17]  James C. Wetherbe,et al.  An Empirical Comparison of Four Text Mining Methods , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[18]  Wei Chen,et al.  Extracting hot spots of basic and complex topics from time stamped documents , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[19]  Mike Thelwall,et al.  Introduction to Webometrics: Quantitative Web Research for the Social Sciences , 2009, Introduction to Webometrics.

[20]  Mike Thelwall,et al.  Quantitative comparisons of search engine results , 2008, J. Assoc. Inf. Sci. Technol..

[21]  Ronald N. Kostoff,et al.  Literature-Related Discovery (LRD): Introduction and background , 2008 .

[22]  Dustin Johnson,et al.  Assessment of India's Research Literature , 2007 .

[23]  Kuo Zhang,et al.  New event detection based on indexing-tree and named entity , 2007, SIGIR.

[24]  Anna Feldman Computational Linguistics: Models, Resources, Applications , 2006, Computational Linguistics.

[25]  Loet Leydesdorff,et al.  Measuring the meaning of words in contexts: An automated analysis of controversies about 'Monarch butterflies,' 'Frankenfoods,' and 'stem cells' , 2006, Scientometrics.

[26]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[27]  Chaomei Chen,et al.  CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature , 2006, J. Assoc. Inf. Sci. Technol..

[28]  Chaomei Chen,et al.  Tech Mining: Exploiting New Technologies for Competitive Advantage , 2005, Inf. Process. Manag..

[29]  ChengXiang Zhai,et al.  Discovering evolutionary theme patterns from text: an exploration of temporal text mining , 2005, KDD '05.

[30]  George Karypis,et al.  Power source roadmaps using bibliometrics and database tomography , 2005 .

[31]  Jean Pierre Courtial,et al.  A coword analysis of scientometrics , 1994, Scientometrics.

[32]  Jean Pierre Courtial,et al.  Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemsitry , 1991, Scientometrics.

[33]  William M. Pottenger,et al.  Methodologies for Trend Detection in Textual Data Mining , 2005 .

[34]  Henk F. Moed,et al.  Handbook of Quantitative Science and Technology Research: The Use of Publication and Patent Statistics in Studies of S&T Systems , 2004 .

[35]  Satoshi Morinaga,et al.  Tracking dynamics of topic trends using a finite mixture model , 2004, KDD.

[36]  James Allan,et al.  Text classification and named entities for new event detection , 2004, SIGIR '04.

[37]  William M. Pottenger,et al.  A Survey of Emerging Trend Detection in Textual Data Mining , 2004 .

[38]  Cherie Courseault Trumbach,et al.  A Text Mining Framework Linking Technical Intelligence from Publication Databases to Strategic Technology Decisions , 2004 .

[39]  J. Srivastava,et al.  Mining Temporally Evolving Graphs , 2004 .

[40]  Frank Keller,et al.  Using the Web to Obtain Frequencies for Unseen Bigrams , 2003, CL.

[41]  Junshui Ma,et al.  Online novelty detection on temporal sequences , 2003, KDD '03.

[42]  Reinhard Rapp,et al.  The Computation of Word Associations: Comparing Syntagmatic and Paradigmatic Approaches , 2002, COLING.

[43]  Qiang Wang,et al.  Design and Evaluation of Multimedia to Teach Java and Object-Oriented Software Engineering * , 2002 .

[44]  Ana Gabriela Maguitman,et al.  Assessing Conceptual Similarity to Support Concept Mapping , 2002, FLAIRS Conference.

[45]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[46]  William M. Pottenger,et al.  CIMEL: constructive, collaborative inquiry-based multimedia E-learning , 2001, Annual Conference on Innovation and Technology in Computer Science Education.

[47]  Michele Banko,et al.  Scaling to Very Very Large Corpora for Natural Language Disambiguation , 2001, ACL.

[48]  Ah-Hwee Tan,et al.  Topic Detection, Tracking, and Trend Analysis Using Self-Organizing Neural Networks , 2001, PAKDD.

[49]  Fredrik Halsius,et al.  Assessing technological opportunities and threats : an introduction to technology forecasting , 2001 .

[50]  Irene Wormell Informetrics and webometrics for measuring impact, visibility, and connectivity in science, politics, and business , 2001 .

[51]  William M. Pottenger,et al.  Detecting emerging concepts in textual data mining , 2001 .

[52]  Owen Thomas,et al.  Webometric analysis of departments of librarianship and information science , 2000, J. Inf. Sci..

[53]  Marko Grobelnik,et al.  Text mining as integration of several related research areas: report on KDD's workshop on text mining 2000 , 2000, SKDD.

[54]  Pak Chung Wong,et al.  Visualizing sequential patterns for text mining , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[55]  Stanley Boykin,et al.  Machine learning of event segmentation for news on demand , 2000, CACM.

[56]  David Jensen,et al.  TimeMines: Constructing Timelines with Statistical Models of Word Usage , 2000, KDD 2000.

[57]  Qin He,et al.  Knowledge Discovery Through Co-Word Analysis , 1999, Libr. Trends.

[58]  Peter Ingwersen,et al.  The calculation of web impact factors , 1998, J. Documentation.

[59]  Ronald N. Kostoff,et al.  Database Tomography for Technical Intelligence: A Roadmap of the Near-Earth Space Science and Technology Literature , 1998, Inf. Process. Manag..

[60]  Peter Ingwersen,et al.  Informetric analyses on the world wide web: methodological approaches to 'webometrics' , 1997, J. Documentation.

[61]  Ramakrishnan Srikant,et al.  Discovering Trends in Text Databases , 1997, KDD.

[62]  Ronen Feldman,et al.  Pattern Based Browsing in Document Collections , 1997, PKDD.

[63]  Deborah Hix,et al.  Exploring search results with Envision , 1997, CHI Extended Abstracts.

[64]  Melina Alexa,et al.  Computer-assisted text analysis methodology in the social sciences , 1997 .

[65]  P Eric,et al.  Concept Mapping: a Graphical System for Understanding the Relationship Between Concepts , 1997 .

[66]  Yorick Wilks,et al.  Information Extraction as a Core Language Technology , 1997, SCIE.

[67]  E. Plotnick Concept Mapping: A Graphical System for Understanding the Relationship between Concepts. ERIC Digest. , 1997 .

[68]  Mark J. Dixon An Overview of Document Mining Technology , 1997 .

[69]  Alan L. Porter,et al.  Technology opportunities analysis , 1995 .

[70]  Thomas G. Whiston Forecasting for technologists and engineers: a practical guide for better decisions : Brian C. Twiss, (Peter Peregrines (on behalf of the Institution of Electrical Engineers), London, 1992) xv, 221 pp., [UK pound] 19.00, ISBN 0 86341 285 8 , 1994 .

[71]  F. Narin,et al.  Bibliometrics/Theory, Practice and Problems , 1994 .

[72]  B. C. Twiss,et al.  Forecasting for technologists and engineers: A practical guide for better decisions , 1992 .

[73]  M. Callon,et al.  Mapping the dynamics of science and technology : sociology of science in the real world , 1988 .

[74]  Gerald Salton,et al.  Automatic text processing , 1988 .

[75]  M. Callon,et al.  Mapping the Dynamics of Science and Technology , 1986 .

[76]  Joseph D. Novak,et al.  Learning How to Learn , 1984 .

[77]  Denise M. Rousseau,et al.  Assessment of Technology In Organizations: Closed versus Open Systems Approach , 1979 .

[78]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.