Information discovery from complementary literatures: categorizing viruses as potential weapons

Using novel informatics techniques to process the output of Medline searches, we have generated a list of viruses that may have the potential for development as weapons. Our findings are intended as a guide to the virus literature to support further studies that might then lead to appropriate defense and public health measures. This article stresses methods that are more generally relevant to information science. Initial Medline searches identified two kinds of virus literatures---the first concerning the genetic aspects of virulence, and the second concerning the transmission of viral diseases. Both literatures taken together are of central importance in identifying research relevant to the development of biological weapons. Yet, the two literatures had very few articles in common. We downloaded the Medline records for each of the two literatures and used a computer to extract all virus terms common to both. The fact that the resulting virus list includes most of an earlier independently published list of viruses considered by military experts to have the highest threat as potential biological weapons served as a test of the method; the test outcome showed a high degree of statistical significance, thus supporting an inference that the new viruses on the list share certain important characteristics with viruses of known biological warfare interest.

[1]  John Bibby,et al.  The Analysis of Contingency Tables , 1978 .

[2]  Stephen P. Harter Scientific inquiry: A model for online searching , 1984, J. Am. Soc. Inf. Sci..

[3]  D. Swanson,et al.  Calcium-independent phospholipase A2 and schizophrenia. , 1998, Archives of general psychiatry.

[4]  N R Smalheiser,et al.  Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses. , 1998, Computer methods and programs in biomedicine.

[5]  D. Swanson Migraine and Magnesium: Eleven Neglected Connections , 2015, Perspectives in biology and medicine.

[6]  D. Swanson Undiscovered Public Knowledge , 1986 .

[7]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[8]  Erhard Geissler Biological and toxin weapons today , 1986 .

[9]  D. Swanson Somatomedin C and Arginine: Implicit Connections between Mutually Isolated Literatures , 2015, Perspectives in biology and medicine.

[10]  Neil R. Smalheiser,et al.  Implicit Text Linkages between Medline Records: Using Arrowsmith as an Aid to Scientific Discovery , 1999, Libr. Trends.

[11]  D. Swanson Medical literature as a potential source of new knowledge. , 1990, Bulletin of the Medical Library Association.

[12]  Neil R. Smalheiser,et al.  Assessing a gap in the biomedical literature: Magnesium deficiency and neurologic disease , 1994 .

[13]  Don R. Swanson,et al.  Online search for logically-related noninteractive medical literatures: A systematic trial-and-error strategy , 1989, JASIS.

[14]  Don R. Swanson,et al.  Complementary structures in disjoint science literatures , 1991, SIGIR '91.

[15]  Neil R. Smalheiser,et al.  Artificial Intelligence An interactive system for finding complementary literatures : a stimulus to scientific discovery , 1995 .

[16]  Don R. Swanson,et al.  Two medical literatures that are logically but not bibliographically connected , 1987, J. Am. Soc. Inf. Sci..

[17]  D. Swanson,et al.  Indomethacin and Alzheimer's disease , 1996, Neurology.

[18]  D. Swanson A second example of mutually isolated medical literatures related by implicit, unnoticed connections. , 1989 .

[19]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[20]  Don R. Swanson,et al.  Intervening in the Life Cycles of Scientific Knowledge Patrick Wilson, The Value of Currency , 1993, Libr. Trends.

[21]  M. Bates The invisible substrate of information science , 1999 .

[22]  D. Swanson,et al.  Linking estrogen to Alzheimer's disease , 1996, Neurology.