TransMiner: Mining Transitive Associations among Biological Objects from Text

Associations among biological objects such as genes, proteins, and drugs can be discovered automatically from the scientific literature. TransMiner is a system for finding associations among objects by mining the Medline database of the scientific literature. The direct associations among the objects are discovered based on the principle of co-occurrence in the form of an association graph. The principle of transitive closure is applied to the association graph to find potential transitive associations. The potential transitive associations that are indeed direct are discovered by iterative retrieval and mining of the Medline documents. Those associations that are not found explicitly in the entire Medline database are transitive associations and are the candidates for hypothesis generation. The transitive associations were ranked based on the sum of weight of terms that co-occur with both the objects. The direct and transitive associations are visualized using a graph visualization applet. TransMiner was tested by finding associations among 56 breast cancer genes and among 24 objects in the calpain signal transduction pathway. TransMiner was also used to rediscover associations between magnesium and migraine.

[1]  Anton Yuryev,et al.  Extracting human protein interactions from MEDLINE using a full-sentence parser , 2004, Bioinform..

[2]  Anthony W. Norman,et al.  Calcium and Calpain as Key Mediators of Apoptosis-like Death Induced by Vitamin D Compounds in Breast Cancer Cells* , 2002, The Journal of Biological Chemistry.

[3]  Neil R. Smalheiser,et al.  Artificial Intelligence An interactive system for finding complementary literatures : a stimulus to scientific discovery , 1995 .

[4]  Neil R Smalheiser Informatics and hypothesis‐driven research , 2002, EMBO reports.

[5]  D. Swanson Migraine and Magnesium: Eleven Neglected Connections , 2015, Perspectives in biology and medicine.

[6]  Wanda Pratt,et al.  H.3.3 Information Search and Retrieval , 2022 .

[7]  M. Topcuoglu,et al.  Efficacy of Intravenous Magnesium Sulfate in the Treatment of Acute Migraine Attacks , 2001, Headache.

[8]  T. Jenssen,et al.  A literature network of human genes for high-throughput analysis of gene expression , 2001, Nature Genetics.

[9]  G. Dubyak,et al.  Calcium is a key signaling molecule in beta-lapachone-mediated cell death. , 2001, The Journal of biological chemistry.

[10]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[11]  R. Cerione,et al.  Activated Cdc42 Sequesters c-Cbl and Prevents EGF Receptor Degradation , 2003, Cell.

[12]  D. Boothman,et al.  μ-Calpain Activation in β-Lapachone-Mediated Apoptosis , 2003 .

[13]  David L. Steffen,et al.  The Breast Cancer Gene Database: a collaborative information resource , 1999, Oncogene.

[14]  D. Boothman,et al.  Mu-calpain activation in beta-lapachone-mediated apoptosis. , 2003, Cancer biology & therapy.

[15]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[16]  Ralf Mrowka,et al.  A Java applet for visualizing protein-protein interaction , 2001, Bioinform..

[17]  R. DeBiasi,et al.  MEKK1 regulates calpain‐dependent proteolysis of focal adhesion proteins for rear‐end detachment of migrating fibroblasts , 2003, The EMBO journal.

[18]  G. Dubyak,et al.  Calcium Is a Key Signaling Molecule in β-Lapachone-mediated Cell Death* , 2001, The Journal of Biological Chemistry.

[19]  T. Honderich The Oxford Companion to Philosophy , 1995 .

[20]  Ah-Hwee Tan,et al.  Text Mining: The state of the art and the challenges , 2000 .

[21]  Michael Krauthammer,et al.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles , 2001, ISMB.

[22]  Saso Dzeroski,et al.  Supporting Discovery in Medicine by Association Rule Mining of Bibliographic Databases , 2000, PKDD.

[23]  B J Stapley,et al.  Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[24]  Javed Mostafa,et al.  Detecting Gene Relations from MEDLINE Abstracts , 2000, Pacific Symposium on Biocomputing.

[25]  R. Bast,et al.  Reexpression of the tumor suppressor gene ARHI induces apoptosis in ovarian and breast cancer cells through a caspase-independent calpain-dependent pathway. , 2002, Cancer research.

[26]  Pak Chung Wong,et al.  Visualizing association rules for text mining , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[27]  Stephen Warshall,et al.  A Theorem on Boolean Matrices , 1962, JACM.

[28]  T. Carver,et al.  Oxford Companion to Philosophy , 2005 .

[29]  D. Boothman,et al.  Activation of a cysteine protease in MCF-7 and T47D breast cancer cells during beta-lapachone-mediated apoptosis. , 2000, Experimental cell research.

[30]  Javed Mostafa,et al.  An intelligent biological information management system , 2002, SAC '02.