Fast algorithms for online construction of web tag clouds

Abstract In this paper tag cloud construction for web exposition is studied. Construction of a tag cloud must simultaneously solve at least three interdisciplinary engineering problems: modeling and controlling graphics aesthetics, solving discrete two-dimensional layout optimization problem, and all these must be done on computationally constrained browser platform. We analyze the design choices in the earlier tag cloud studies and provide a taxonomy of algorithmic approaches to tag cloud building. Then, the design requirements for tag clouds on websites are defined. We propose to quantify tag cloud aesthetics by use of a novel objective function based on the rules of typography. Tag cloud construction is formalized as a combinatorial optimization problem with an irregular objective function. A set of algorithms is proposed and evaluated on a collection of tag sets from popular web pages. The methods that meet constraints of the browser platform are chosen.

[1]  Wolfgang Kienreich,et al.  On the Beauty and Usability of Tag Clouds , 2008, 2008 12th International Conference Information Visualisation.

[2]  Fred W. Glover,et al.  Backtracking based iterated tabu search for equitable coloring , 2015, Eng. Appl. Artif. Intell..

[3]  Robert E. Tarjan,et al.  Performance Bounds for Level-Oriented Two-Dimensional Packing Algorithms , 1980, SIAM J. Comput..

[4]  Martin Halvey,et al.  An assessment of tag presentation techniques , 2007, WWW '07.

[5]  Andrea Lodi,et al.  Two-dimensional packing problems: A survey , 2002, Eur. J. Oper. Res..

[6]  Jedrzej Musial,et al.  Database Scheme Optimization For Online Applications , 2011 .

[7]  Michael Burch,et al.  Concentri Cloud: Word Cloud Visualization for Multiple Text Documents , 2015, 2015 19th International Conference on Information Visualisation.

[8]  Steffen Lohmann,et al.  Comparison of Tag Cloud Layouts: Task-Related Performance and Visual Exploration , 2009, INTERACT.

[9]  Carl Gutwin,et al.  Seeing things in the clouds: the effect of visual features on tag cloud selections , 2008, Hypertext.

[10]  Charibeth Cheng,et al.  Synonym Based Tag Cloud Generation , 2014 .

[11]  Maciej Drozdowski,et al.  Optimization of column width in website layout for advertisement fit , 2013, Eur. J. Oper. Res..

[12]  S. Milgram Psychological maps of Paris , 1976 .

[13]  Michael Burch,et al.  Prefix Tag Clouds , 2013, 2013 17th International Conference on Information Visualisation.

[14]  Michael J. Muller,et al.  Getting our head in the clouds: toward evaluation studies of tagclouds , 2007, CHI.

[15]  Martin Wattenberg,et al.  TIMELINESTag clouds and the case for vernacular visualization , 2008, INTR.

[16]  Benjamin M. Good,et al.  Tag clouds for summarizing web search results , 2007, WWW '07.

[17]  Richard E. Korf,et al.  New Improvements in Optimal Rectangle Packing , 2009, IJCAI.

[18]  Paolo Nesi,et al.  Geographical localization of web domains and organization addresses recognition by employing natural language processing, Pattern Matching and clustering , 2016, Eng. Appl. Artif. Intell..

[19]  Chao-Hung Lin,et al.  Morphable Word Clouds for Time-Varying Text Data Visualization , 2015, IEEE Transactions on Visualization and Computer Graphics.

[20]  Heidrun Schumann,et al.  Taggram: Exploring Geo-data on Maps through a Tag Cloud-Based Visualization , 2010, 2010 14th International Conference Information Visualisation.

[21]  Saskia Brand-Gruwel,et al.  Learning and navigating in hypertext: Navigational support by hierarchical menu or tag cloud? , 2015, Comput. Hum. Behav..

[22]  Václav Snásel,et al.  A novel approach for comparing web sites by using MicroGenres , 2014, Eng. Appl. Artif. Intell..

[23]  Jackie Fenn,et al.  Mastering the Hype Cycle: How to Choose the Right Innovation at the Right Time , 2008 .

[24]  Evaggelos Spyrou,et al.  A survey on Flickr multimedia research challenges , 2016, Eng. Appl. Artif. Intell..

[25]  Maciej Drozdowski,et al.  Analysis and Solution of CSS-Sprite Packing Problem , 2015, TWEB.

[26]  Pascal Bouvry,et al.  Exact and heuristic approaches to solve the Internet shopping optimization problem with delivery costs , 2016, Int. J. Appl. Math. Comput. Sci..

[27]  Owen Kaser,et al.  Tag-Cloud Drawing: Algorithms for Cloud Visualization , 2007, ArXiv.

[28]  Brenda S. Baker,et al.  Shelf Algorithms for Two-Dimensional Packing Problems , 1983, SIAM J. Comput..

[29]  Jan H. van Vuuren,et al.  A survey and comparison of guillotine heuristics for the 2D oriented offline strip packing problem , 2009, Discret. Optim..

[30]  Donald E. Knuth,et al.  Breaking paragraphs into lines , 1981, Softw. Pract. Exp..

[31]  Graham Kendall,et al.  Evolving Bin Packing Heuristics with Genetic Programming , 2006, PPSN.

[32]  Takeshi Yamada,et al.  Topigraphy: visualization for large-scale tag clouds , 2008, WWW.

[33]  Furu Wei,et al.  Context preserving dynamic word cloud visualization , 2010, 2010 IEEE Pacific Visualization Symposium (PacificVis).

[34]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.

[35]  David S. Ebert,et al.  WordBridge: Using Composite Tag Clouds in Node-Link Diagrams for Visualizing Content and Relations in Text Corpora , 2011, 2011 44th Hawaii International Conference on System Sciences.

[36]  Fred Glover,et al.  Tabu Search - Part II , 1989, INFORMS J. Comput..

[37]  Maciej Drozdowski,et al.  Empirical Study of Load Time Factor in Search Engine Ranking , 2014, J. Web Eng..