Tracking and modelling information diffusion across interactive online media

Information spreads rapidly across websites and other online media. The IDIOM research project analyses this process by identifying redundant content elements, mapping them to ontology concepts, and tracking their temporal and geographic distribution. Linguists define 'idiom' as an expression whose meaning is different from the literal meanings of its component words. Similarly, investigating information diffusion promises insights that cannot be inferred from individual network elements. Previous research often focused on particular media, or neglected important aspects of the human language. IDIOM addresses these gaps to reveal fundamental mechanisms of information diffusion across media with distinct interactive characteristics.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  Mark Sanderson,et al.  A Study of User Interaction with a Concept-Based Interactive Query Expansion Support Tool , 2004, ECIR.

[3]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[4]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[5]  Elizabeth Chang,et al.  Semi-Automatic Ontology Extension Using Spreading Activation , 2005 .

[6]  Jay F. Nunamaker,et al.  Automatic concept classification of text from electronic meetings , 1994, CACM.

[7]  Arno Scharl,et al.  Quantitive evaluation of Web site content and structure , 2000, Internet Res..

[8]  Atanas Kiryakov,et al.  Semantic annotation, indexing, and retrieval , 2004, J. Web Semant..

[9]  Des Greer,et al.  Evolutionary Web Development , 2001, Softw. Focus.

[10]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[11]  Óscar Corcho,et al.  Ontology based document annotation: trends and open research problems , 2006, Int. J. Metadata Semant. Ontologies.

[12]  Seth Godin,et al.  Unleashing the Idea Virus , 2001 .

[13]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[14]  Elena García Barriocanal,et al.  Integrating descriptions of knowledge management learning activities into large ontological structures: A case study , 2006, Data Knowl. Eng..

[15]  Duncan J. Watts,et al.  Six Degrees: The Science of a Connected Age , 2003 .

[16]  Tom M. Mitchell,et al.  Improving Text Classification by Shrinkage in a Hierarchy of Classes , 1998, ICML.

[17]  Jonathan W. Palmer,et al.  Web Site Usability, Design, and Performance Metrics , 2002, Inf. Syst. Res..

[18]  James A. Hendler,et al.  Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential , 2002 .

[19]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[20]  C. Haythornthwaite Social network analysis: An approach and technique for the study of information exchange☆ , 1996 .

[21]  James A. Hendler,et al.  A Tool for Working with Web Ontologies , 2005, Int. J. Semantic Web Inf. Syst..

[22]  Ramanathan V. Guha,et al.  TAP: a Semantic Web platform , 2003, Comput. Networks.

[23]  Ludovic Lebart,et al.  Exploring Textual Data , 1997 .

[24]  Salvador Sánchez-Alonso,et al.  Making use of upper ontologies to foster interoperability between SKOS concept schemes , 2006 .

[25]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[26]  Sai-Ping Li,et al.  A guided Monte Carlo approach to optimization problems , 2003 .

[27]  P. J. Stone Thematic text analysis: new agendas for analyzing text content , 1997 .

[28]  Stefan Th. Gries,et al.  What is Corpus Linguistics? , 2009, Lang. Linguistics Compass.

[29]  Carl Bedingfield Review of "Spinning the semantic web: Bringing the world wide web to its full potential" edited by Dieter Fensel, James Hendler, Henry Lieberman, and Wolfgang Wahlster, The MIT press , 2003, UBIQ.

[30]  Pernilla Danielsson Automatic extraction of meaningful units from corpora: A corpus-driven approach using the word stroke , 2003 .

[31]  Susan Conrad,et al.  Corpus Linguistics: Investigating Language Structure and Use , 1998 .

[32]  Setsuo Ohsuga,et al.  INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES , 1977 .

[33]  Paola Velardi,et al.  Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites , 2004, CL.

[34]  José Palazzo Moreira de Oliveira,et al.  Concept-based knowledge discovery in texts extracted from the Web , 2000, SKDD.

[35]  Arno Scharl,et al.  Web coverage of the 2004 US Presidential election , 2006 .

[36]  Gustavo Rossi,et al.  Measuring Web Application Quality with WebQEM , 2002, IEEE Multim..

[37]  Arno Scharl A Roadmap Towards Distributed Web Assessment , 2004, ICWE.

[38]  Asunción Gómez-Pérez,et al.  Six challenges for the Semantic Web , 2002, KR 2002.

[39]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[40]  Thorsten Liebig,et al.  OntoTrack: A semantic approach for ontology authoring , 2005, J. Web Semant..

[41]  Gobinda G. Chowdhury,et al.  Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential , 2004 .

[42]  J. Leon Zhao,et al.  Automatic discovery of similarity relationships through Web mining , 2003, Decis. Support Syst..

[43]  Lipika Dey,et al.  A feature selection technique for classificatory analysis , 2005, Pattern Recognit. Lett..

[44]  Hae-Chang Rim,et al.  Unsupervised word sense disambiguation using WordNet relatives , 2004, Comput. Speech Lang..

[45]  Steffen Staab,et al.  Learning Taxonomic Relations from Heterogeneous Evidence , 2004 .

[46]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[47]  Mark Klein,et al.  How Similar Is It? Towards Personalized Similarity Measures in Ontologies , 2005, Wirtschaftsinformatik.

[48]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[49]  Tharam S. Dillon,et al.  A semantic network-based design methodology for XML documents , 2002, TOIS.

[50]  Ian T. Foster,et al.  On Death, Taxes, and the Convergence of Peer-to-Peer and Grid Computing , 2003, IPTPS.

[51]  Dieter Fensel,et al.  A Conceptual Comparison of WSMO and OWL-S , 2004, ECOWS.

[52]  Mario Cannataro,et al.  The knowledge grid , 2003, CACM.

[53]  Ravi Kumar,et al.  Self-similarity in the web , 2001, TOIT.

[54]  S. Havlin,et al.  Self-similarity of complex networks , 2005, Nature.

[55]  Hsinchun Chen,et al.  CI Spider: a tool for competitive intelligence on the Web , 2002, Decis. Support Syst..

[56]  Arno Scharl,et al.  Determining the Semantic Orientation of Web-Based Corpora , 2003, IDEAL.

[57]  Stefan Decker,et al.  Creating Semantic Web Contents with Protégé-2000 , 2001, IEEE Intell. Syst..

[58]  Gerald Salton,et al.  Automatic text processing , 1988 .

[59]  Ramanathan V. Guha,et al.  A case for automated large-scale semantic annotation , 2003, J. Web Semant..

[60]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[61]  A. Parasuraman,et al.  Service quality delivery through web sites: A critical review of extant knowledge , 2002, Journal of the Academy of Marketing Science.

[62]  Richard L. Daft,et al.  Message Equivocality, Media Selection, and Manager Performance: Implications for Information Systems , 1987, MIS Q..