Matrix-like visualization based on topic modeling for discovering connections between disjoint disciplines

Interdisciplinary research is challenging because of the knowledge overspecialization problem, which makes it diffi- cult for researchers to discover connections between disjoint disciplines. Closed Literature-based discovery shows the potentials of solving this problem by using information retrieval and natural language processing techniques. However, it still faces some drawbacks, such as large amounts of manual works with prior knowledge, difficulty in understanding the discoveries, and limi- tation of the extension of the domain. In this paper, we propose a matrix-like visualization approach based on topic modeling for discovering connections between disjoint disciplines. With our approach, we expect interdisciplinary connections to be efficiently discovered by detecting the topics we call mixed topics, which contain literature from disjoint disciplines. For achieving this purpose, we visualize the document-phrase matrix generated by topic modeling and develop an original biclustering algorithm to extract mixed topics in the matrix. Experiment results show that our approach can help users without prior knowledge to detect connections between disciplines.

[1]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[2]  Weiguo Fan,et al.  Literature-based discovery on the World Wide Web , 2002, TOIT.

[3]  Susan T. Dumais,et al.  Using latent semantic indexing for literature based discovery , 1998 .

[4]  Padmini Srinivasan,et al.  Text mining: Generating hypotheses from MEDLINE , 2004, J. Assoc. Inf. Sci. Technol..

[5]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[6]  Michael D. Gordon,et al.  Literature-based discovery by lexical statistics , 1999 .

[7]  Marc Weeber,et al.  Using concepts in literature-based discovery: simulating Swanson's Raynaud-fish oil and migraine-magnesium discoveries , 2001 .

[8]  Don R. Swanson,et al.  Complementary structures in disjoint science literatures , 1991, SIGIR '91.

[9]  Erik M. van Mulligen,et al.  Constructing an associative concept space for literature-based discovery , 2004, J. Assoc. Inf. Sci. Technol..

[10]  Amit P. Sheth,et al.  Context-Driven Automatic Subgraph Creation for Literature-Based Discovery , 2015, J. Biomed. Informatics.

[11]  N R Smalheiser,et al.  Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses. , 1998, Computer methods and programs in biomedicine.

[12]  Neil R. Smalheiser,et al.  Artificial Intelligence An interactive system for finding complementary literatures : a stimulus to scientific discovery , 1995 .

[13]  Meliha Yetisgen-Yildiz,et al.  Evaluation of Literature-Based Discovery Systems , 2008 .

[14]  Xiaoyong Liu,et al.  Literature-based knowledge discovery: the state of the art , 2012, ArXiv.

[15]  Wanda Pratt,et al.  H.3.3 Information Search and Retrieval , 2022 .

[16]  Michael D. Gordon,et al.  Toward Discovery Support Systems: A Replication, Re-Examination, and Extension of Swanson's Work on Literature-Based Discovery of a Connection between Raynaud's and Fish Oil , 1996, J. Am. Soc. Inf. Sci..

[17]  Ulrich Bodenhofer,et al.  FABIA: factor analysis for bicluster acquisition , 2010, Bioinform..