Using concept maps as a tool for cross-language relevance determination

Concept maps, introduced by Novak, aid learners’ understanding. I hypothesize that concept maps also can function as a summary of large documents, e.g., electronic theses and dissertations (ETDs). I have built a system that automatically generates concept maps from English-language ETDs in the computing field. The system also will provide Spanish translations of these concept maps for native Spanish speakers. Using machine translation techniques, my approach leads to concept maps that could allow researchers to discover pertinent dissertations in languages they cannot read, helping them to decide if they want a potentially relevant dissertation translated. I am using a state-of-the-art natural language processing system, called Relex, to extract noun phrases and noun-verb-noun relations from ETDs, and then produce concept maps automatically. I also have incorporated information from the table of contents of ETDs to create novel styles of concept maps. I have conducted five user studies, to evaluate user perceptions about these different map styles. I am using several methods to translate node and link text in concept maps from English to Spanish. Nodes labeled with single words from a given technical area can be translated using wordlists, but phrases in specific technical fields can be difficult to translate. Thus I have amassed a collection of about 580 Spanish-language ETDs from Scirus and two Mexican universities and I am using this corpus to mine phrase translations that I could not find otherwise. The usefulness of the automatically-generated and translated concept maps has been assessed in an experiment at Universidad de las Americas (UDLA) in Puebla, Mexico. This experiment demonstrated that concept maps can augment abstracts (translated using a standard machine translation package) in helping Spanish speaking users find ETDs of interest.* *This research work was funded in part by NSF through grants DUE-0121679 and IIS-0080748, a grant from IMLS, and the RDEC/ACE project.

[1]  Lisa P. Skaggs,et al.  Knowledge Maps and the Presentation of Related Information Domains , 1992 .

[2]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[3]  Donald F. Dansereau,et al.  Scripted Cooperation and Knowledge Map Supplements , 1992 .

[4]  David M. Kennedy,et al.  Use of concept mapping in the design of learning tools for interactive multimedia , 1997 .

[5]  Thomas C. Rindflesch,et al.  EDGAR: extraction of drugs, genes and relations from the biomedical literature. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[6]  William B. Langdon,et al.  BioRAT: extracting biological information from full-length papers , 2004, Bioinform..

[7]  Ruben Leon,et al.  A word stemming algorithm for the Spanish language , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[8]  Marvin L. Thordsen,et al.  A Comparison of Two Tools for Cognitive Task Analysis: Concept Mapping and the Critical Decision Method , 1991 .

[9]  Emden R. Gansner,et al.  An open graph visualization system and its applications to software engineering , 2000 .

[10]  Brian Moon,et al.  Multiple Source Information Analysis, GIS and Starlight , 1999, ACM-GIS.

[11]  Hans-Michael Müller,et al.  Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological Literature , 2004, PLoS biology.

[12]  Hsinchun Chen,et al.  Using sentence-selection heuristics to rank text segments in TXTRACTOR , 2002, JCDL '02.

[13]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[14]  Patrick Pantel,et al.  Discovery of inference rules for question-answering , 2001, Natural Language Engineering.

[15]  Jean-Michel Renders,et al.  A Geometric View on Bilingual Lexicon Extraction from Comparable Corpora , 2004, ACL.

[16]  Brian R. Gaines,et al.  Using Knowledge Acquisition and Representation Tools to Support Scientific Communities , 1994, AAAI.

[17]  Pak Chung Wong,et al.  Visualizing association rules for text mining , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[18]  J. Malone,et al.  The Concept Map as an Aid to Instruction in Science and Mathematics , 1984 .

[19]  Donald F. Dansereau,et al.  Enhancing the recall of text: Knowledge mapping training promotes implicit transfer. , 1998 .

[20]  Thomas W. Shiland Probing for Understanding. , 2002 .

[21]  Marvin Willerman,et al.  The concept map as an advance organizer. , 1991 .

[22]  Hiroyuki Kaji,et al.  Unsupervised word sense disambiguation using bilingual comparable corpora , 2002, COLING 2002.

[23]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[24]  Jan W. Buzydlowski,et al.  Term Co-occurrence Analysis as an Interface for Digital Libraries , 2002, Visual Interfaces to Digital Libraries.

[25]  Michael D. McNeese,et al.  An Advanced Knowledge and Design Acquisition Methodology: Application for the Pilot's Associate , 1990 .

[26]  Christopher G Chute,et al.  National Center for Biomedical Ontology: advancing biomedicine through structured organization of scientific knowledge. , 2006, Omics : a journal of integrative biology.

[27]  Hiroyuki Kaji,et al.  Unsupervised Word-Sense Disambiguation Using Bilingual Comparable Corpora , 2002, IEICE Trans. Inf. Syst..

[28]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[29]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[30]  George D. Gopen,et al.  The Science of Scientific Writing , 1990 .

[31]  R. Shavelson,et al.  Problems and Issues in the Use of Concept Maps in Science Assessment. , 1996 .

[32]  Michael Loren Mauldin,et al.  Information retrieval by text skimming , 1989 .

[33]  Gloria Gomez,et al.  CmapTools: A Knowledge Modeling and Sharing Environment , 2004 .

[34]  G. O. Esiobu,et al.  Effects of concept and vee mappings under three learning modes on students' cognitive achievement in ecology and genetics , 1995 .

[35]  A. Cañas,et al.  Concept Maps: a Theoretical Note on Concepts and the Need for Cyclic Concept Maps , 2003 .

[36]  Tomek Strzalkowski,et al.  A Robust Practical Text Summarization , 1998 .

[37]  J. Mintzes,et al.  Assessing science understanding : a human constructivist view , 2005 .

[38]  Satoshi Sato,et al.  Toward Memory-based Translation , 1990, COLING.

[39]  Gordon W. Paynter,et al.  Human evaluation of Kea, an automatic keyphrasing system , 2001, JCDL '01.

[40]  D. Wiegmann,et al.  Effects of knowledge map characteristics on information processing , 1992 .

[41]  Joseph D. Novak,et al.  Learning creating and using knowledge: Concept maps as facilitative tools , 1998 .

[42]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[43]  Fatiha Sadat,et al.  An Approach Based on Multilingual Thesauri and Model Combination for Bilingual Lexicon Extraction , 2002, COLING.

[44]  J. Mintzes,et al.  The concept map as a research tool: Exploring conceptual change in biology , 1990 .

[45]  Michael Krauthammer,et al.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles , 2001, ISMB.

[46]  E. Yalow On Educational psychology: A cognitive view. , 1979 .

[47]  Christian Borgelt,et al.  EFFICIENT IMPLEMENTATIONS OF APRIORI AND ECLAT , 2003 .

[48]  Kosta Tsipis,et al.  And another thing… , 2003 .

[49]  Henry G. Small,et al.  Mapping the dynamics of science and technology , 1988, Scientometrics.

[50]  Min-Yen Kan,et al.  Applying Natural Language Generation to Indicative Summarization , 2001, EWNLG@ACL.

[51]  John A. Barnden,et al.  Semantic Networks , 1998, Encyclopedia of Social Network Analysis and Mining.

[52]  Burr Settles,et al.  ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text , 2005 .

[53]  Pascale Fung,et al.  A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora , 1995, ACL.

[54]  Michael White,et al.  Learning Domain-Specific Transfer Rules: An Experiment with Korean to English Translation , 2002, COLING 2002.

[55]  E. Schlesinger,et al.  The science of scientific writing , 1979, Proceedings of the IEEE.

[56]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[57]  Dekai Wu Grammarless extraction of phrasal translation examples from parallel texts , 1995 .

[58]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[59]  Ah-Hwee Tan,et al.  Knowledge discovery from texts: a concept frame graph approach , 2002, CIKM '02.

[60]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[61]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[62]  Julio Gonzalo,et al.  Noun phrases as building blocks for cross-language Search Assistance , 2005, Inf. Process. Manag..

[63]  Xiaoru Yuan,et al.  INSPIRE: an interactive image assisted non-photorealistic rendering system , 2003, 11th Pacific Conference onComputer Graphics and Applications, 2003. Proceedings..

[64]  Stephen J. Garber,et al.  Birds of a Feather? How Politics and Culture Affected the Designs of the U.S. Space Shuttle and the Soviet Buran , 2002 .

[65]  Emden R. Gansner,et al.  An open graph visualization system and its applications to software engineering , 2000, Softw. Pract. Exp..

[66]  David D. Redell,et al.  An evaluation of the ninth SOSP submissions or how (and how not) to write a good systems paper , 1983, ACM SIGOPS Oper. Syst. Rev..

[67]  Takako Aikawa,et al.  English-Japanese Example-Based Machine Translation Using Abstract Linguistic Representations , 2002, COLING-02 on Machine translation in Asia -.

[68]  Donald F. Dansereau,et al.  Learning from lectures: Effects of knowledge maps and cooperative review strategies , 1993 .

[69]  A. Wierzbicka Semantics: Primes and Universals , 1996 .

[70]  C A Nelson,et al.  Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.

[71]  Fredric C. Gey,et al.  Cross language information retrieval: a research roadmap , 2002, SIGF.

[72]  George Hripcsak,et al.  Natural language processing in an operational clinical information system , 1995, Natural Language Engineering.

[73]  Alexander Nakhimovsky,et al.  LEXICAL FUNCTIONS AND LANGUAGE LEARNING , 1979 .

[74]  Wessel Kraaij,et al.  Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval , 2003, CL.

[75]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[76]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[77]  Philipp Koehn,et al.  Learning a Translation Lexicon from Monolingual Corpora , 2002, ACL 2002.

[78]  Xiaodong Liu,et al.  The chemistry of Chinese language , 2002 .

[79]  Edward A. Fox,et al.  Enhancing Concept Mapping Tools Below and Above to Facilitate the Use of Superimposed Information , 2006 .

[80]  Edward A. Fox,et al.  Convergence of knowledge management and e-learning: the GetSmart experience , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[81]  Ryen W. White,et al.  Concept maps to support oral history search and use , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[82]  Alberto J. Cañas,et al.  Using WordNet for Word Sense Disambiguation to Support Concept Map Construction , 2003, SPIRE.

[83]  L. F. Rau,et al.  Extracting company names from text , 1991, [1991] Proceedings. The Seventh IEEE Conference on Artificial Intelligence Application.

[84]  E. T. Hallberg,et al.  Implications of Counselor Conceptualizations for Counselor Education. , 1990 .

[85]  J. Bradshaw,et al.  Institute for Human and Machine Cognition , 2022 .

[86]  Leonard J. Seligman,et al.  Rapper: a wrapper generator with linguistic knowledge , 1999, WIDM '99.

[87]  Melius Weideman,et al.  Concept mapping vs. web page hyperlinks as an information retrieval interface: preferences of postgraduate culturally diverse learners , 2003 .

[88]  Christian Borgelt Recursion Pruning for the Apriori Algorithm , 2004, FIMI.

[89]  Yi-fang Brook Wu,et al.  Domain-specific keyphrase extraction , 2005, CIKM '05.

[90]  José Palazzo Moreira de Oliveira,et al.  Concept-based knowledge discovery in texts extracted from the Web , 2000, SKDD.

[91]  Richard T. Gill,et al.  Conceptual Graph Analysis: Knowledge Acquisition for Instructional System Design , 1993 .

[92]  Karen Sparck Jones Automatic keyword classification for information retrieval , 1971 .

[93]  Richard May,et al.  The STARLIGHT information visualization system , 1997, Proceedings. 1997 IEEE Conference on Information Visualization (Cat. No.97TB100165).

[94]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[95]  Richard E. Mayer,et al.  The effects of graphic organizers giving cues to the structure of a hypertext document on users' navigation strategies and performance , 2002, Int. J. Hum. Comput. Stud..

[96]  D. Dansereau,et al.  Bilingual Knowledge Maps (BiK-Maps) in Second-Language Vocabulary Learning , 2001 .

[97]  Donald F. Dansereau,et al.  Effects of scripted cooperation and knowledge maps on the processing of technical material , 1989 .

[98]  Wolff‐Michael Roth,et al.  The concept map as a tool for the collaborative construction of knowledge: A microanalysis of high school physics students , 1993 .

[99]  Massih-Reza Amini,et al.  The use of unlabeled data to improve supervised learning for text summarization , 2002, SIGIR '02.

[100]  Lei Zhang,et al.  Knowledge graph theory and structural parsing , 2002 .

[101]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[102]  James W. Cooper Visualization of relational text information for biomedical knowledge discovery , 2003 .