Science and Technology Text Mining: Electrochemical Power

Abstract : Database Tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multi-word phrase frequencies and phrase proximities (physical closeness of the multi-word technical phrases) from any type of large textual database, to augment (2) the interpretative capabilities of the expert human analyst. DT was used to derive technical intelligence from an electrochemical power database derived from the Science Citation Index (SCI). Phrase frequency analysis by technical domain experts provided the pervasive technical themes of the electrochemical power database, and phrase proximity analysis provided the relationships among the pervasive technical themes. Bibliometric analysis of the electrochemical power literature supplemented the DT results with author, journal, and institution publication and citation data. Military requirements for energy and power were reviewed, especially for electrochemical sources and converters. Appendixes provide clustering methodologies, an author factor matrix, an author clustering dendogram, a country co-publishing matrix, a keywords nonstatistics taxonomy, an abstract manual taxonomy, an abstract phrase factor matrix, and an abstract phrase clustering dendogram. (11 tables, 11 figures, 16 refs.)