Bisociative Knowledge Discovery by Literature Outlier Detection

The aim of this chapter is to present the role of outliers in literature-based knowledge discovery that can be used to explore potential bisociative links between different domains of expertise. The proposed approach upgrades the RaJoLink method which provides a novel framework for effectively guiding the knowledge discovery from literature, based on the principle of rare terms from scientific articles. This chapter shows that outlier documents can be successfully used as means of detecting bridging terms that connect documents of two different literature sources. This linking process, known also as closed discovery, is incorporated as one of the steps of the RaJoLink methodology, and is performed by using publicly available topic ontology construction tool OntoGen. We chose scientific articles about autism as the application example with which we demonstrated the proposed approach.

[1]  S. Mednick The associative basis of the creative process. , 1962, Psychological review.

[2]  A. Koestler The Act of Creation , 1964 .

[3]  D. Swanson Undiscovered Public Knowledge , 1986 .

[4]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[5]  H. J. Arnold Introduction to the Practice of Statistics , 1990 .

[6]  E. Ziegel Introduction to the Practice of Statistics (2nd ed.) , 1994 .

[7]  D. Zabel,et al.  Diagnostic and Statistical Manual of Mental Disorders, 4th ed , 1995 .

[8]  N R Smalheiser,et al.  Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses. , 1998, Computer methods and programs in biomedicine.

[9]  Michael D. Gordon,et al.  Literature-based discovery by lexical statistics , 1999 .

[10]  Marc Weeber,et al.  Using concepts in literature-based discovery: simulating Swanson's Raynaud-fish oil and migraine-magnesium discoveries , 2001 .

[11]  Marc Weeber,et al.  Using concepts in literature-based discovery: Simulating Swanson's Raynaud-fish oil and migraine-magnesium discoveries , 2001, J. Assoc. Inf. Sci. Technol..

[12]  Carol A. Bean,et al.  Relationships in the Organization of Knowledge , 2001, Information Science and Knowledge Management.

[13]  Betsy L. Humphreys,et al.  Relationships in Medical Subject Headings (MeSH) , 2001 .

[14]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[15]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[16]  S. Fatemi,et al.  Levels of Bcl-2 and P53 Are Altered in Superior Frontal and Cerebellar Cortices of Autistic Subjects , 2003, Cellular and Molecular Neurobiology.

[17]  Padmini Srinivasan,et al.  Mining MEDLINE for implicit links between dietary substances and diseases , 2004, ISMB/ECCB.

[18]  Mark P. Mattson,et al.  NF-κB in the Survival and Plasticity of Neurons , 2005, Neurochemical Research.

[19]  M. Mattson NF-kappaB in the survival and plasticity of neurons. , 2005, Neurochemical research.

[20]  Joyce A. Mitchell,et al.  Using literature-based discovery to identify disease candidate genes , 2005, Int. J. Medical Informatics.

[21]  Lorenzo Magnani,et al.  Chance Discovery and the Disembodiment of Mind , 2005, KES.

[22]  Mercedes Blázquez,et al.  EU-IST Project IST-2003-506826 SEKT , 2005 .

[23]  Yukio Ohsawa,et al.  Chance Discovery: The Current States of Art , 2006, Chance Discoveries in Real World Decision Making.

[24]  D. Mladení,et al.  SEMI-AUTOMATIC DATA-DRIVEN ONTOLOGY CONSTRUCTION SYSTEM , 2006 .

[25]  Neil R. Smalheiser,et al.  Ranking indirect connections in literature-based discovery: The role of medical subject headings: Research Articles , 2006 .

[26]  Wanda Pratt,et al.  Using statistical and knowledge-based approaches for literature-based discovery , 2006, J. Biomed. Informatics.

[27]  Neil R. Smalheiser,et al.  Ranking indirect connections in literature-based discovery: The role of medical subject headings , 2006, J. Assoc. Inf. Sci. Technol..

[28]  Tanja Urbancic,et al.  Discovering Hidden Knowledge from Biomedical Literature , 2007, Informatica.

[29]  Marc Weeber,et al.  Drug Discovery as an Example of Literature-Based Discovery , 2007, Computational Discovery of Scientific Knowledge.

[30]  Tanja Urbancic,et al.  Literature Mining: Towards Better Understanding of Autism , 2007, AIME.

[31]  Saso Dzeroski,et al.  Computational Discovery of Scientific Knowledge , 2007, Computational Discovery of Scientific Knowledge.

[32]  Tanja Urbancic,et al.  Literature mining method RaJoLink for uncovering relations between biomedical concepts , 2009, J. Biomed. Informatics.

[33]  A. Sheikh,et al.  Cathepsin D and apoptosis related proteins are elevated in the brain of autistic subjects , 2010, Neuroscience.

[34]  Michael R. Berthold Bisociative Knowledge Discovery , 2011, IDA.

[35]  Nada Lavrac,et al.  Outlier Detection in Cross-Context Link Discovery for Creative Literature Mining , 2012, Comput. J..

[36]  Janet B W Williams,et al.  Diagnostic and Statistical Manual of Mental Disorders , 2013 .