Text Map Explorer: a Tool to Create and Explore Document Maps

This paper presents a tool, called text map explorer, which can be used to create and explore document maps (visual representations of document collections). This tool is capable of grouping (and separating) documents by their contents, revealing to the user relationships amongst them. This paper also presents a novel multi-dimensional projection technique for text that reduces the quadratic time complexity of our previous approach to O(N3/2), keeping the same quality of maps. The technique creates a surface that reveals intrinsic patterns and supports various kinds of exploration of a text collection

[1]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[2]  James J. Thomas,et al.  Visualizing the non-visual: spatial analysis and interaction with information from text documents , 1995, Proceedings of Visualization 1995 Conference.

[3]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[4]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[5]  Chaomei Chen,et al.  Visualizing knowledge domains , 2005, Annu. Rev. Inf. Sci. Technol..

[6]  Herbert Edelsbrunner,et al.  Geometry and Topology for Mesh Generation (Cambridge Monographs on Applied and Computational Mathematics) , 2006 .

[7]  Marc M. Sebrechts,et al.  Visualization of search results: a comparative evaluation of text, 2D, and 3D interfaces , 1999, SIGIR '99.

[8]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[9]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[10]  Timo Honkela,et al.  WEBSOM - Self-organizing maps of document collections , 1998, Neurocomputing.

[11]  David S. Ebert,et al.  The shape of Shakespeare: visualizing text using implicit surfaces , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[12]  Pak Chung Wong,et al.  TOPIC ISLANDS/sup TM/-a wavelet-based text visualization system , 1998 .

[13]  James Allan,et al.  Lighthouse: showing the way to relevant information , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[14]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[15]  Edgar R. Weippl Visualizing content based relations in texts , 2001, Proceedings Second Australasian User Interface Conference. AUIC 2001.

[16]  Mark Greaves,et al.  Visualizing text data sets , 1999, Comput. Sci. Eng..

[17]  James A. Wise,et al.  The Ecological Approach to Text Visualization , 1999, J. Am. Soc. Inf. Sci..

[18]  Rosane Minghim,et al.  Visual Mapping of Text Collections using an Approximation of Kolmogorov Complexity , 2005 .

[19]  Massimo Ruffolo,et al.  Managing the knowledge contained in electronic documents: a clustering method for text mining , 2001, 12th International Workshop on Database and Expert Systems Applications.

[20]  Vinicius Veloso de Melo,et al.  Mapping texts through dimensionality reduction and visualization techniques for interactive exploration of document collections , 2006, Electronic Imaging.

[21]  Herbert Edelsbrunner,et al.  Geometry and Topology for Mesh Generation , 2001, Cambridge monographs on applied and computational mathematics.

[22]  Herbert Edelsbrunner,et al.  Geometry and Topology for Mesh Generation , 2001, Cambridge monographs on applied and computational mathematics.

[23]  Rosane Minghim,et al.  Content-based text mapping using multi-dimensional projections for exploration of document collections , 2006, Electronic Imaging.

[24]  George Karypis,et al.  gCLUTO – An Interactive Clustering, Visualization, and Analysis System , 2004 .

[25]  Wolfgang Kienreich,et al.  Evaluating a System for Interactive Exploration of Large, Hierarchically Structured Document Repositories , 2004, IEEE Symposium on Information Visualization.

[26]  Stefan Rüger,et al.  Info Navigator: A visualization tool for document searching and browsing , 2003 .

[27]  Matthew Chalmers,et al.  A linear iteration time layout algorithm for visualising high-dimensional data , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[28]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[29]  Pat Langley,et al.  Models of Incremental Concept Formation , 1990, Artif. Intell..

[30]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[31]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[32]  Ricardo A. Baeza-Yates,et al.  Alternative implementation techniques for Web text visualization , 2003, Proceedings of the IEEE/LEOS 3rd International Conference on Numerical Simulation of Semiconductor Optoelectronic Devices (IEEE Cat. No.03EX726).

[33]  Rosane Minghim,et al.  On Improved Projection Techniques to Support Visual Exploration of Multi-Dimensional Data Sets , 2003, Inf. Vis..