Review of automatic document formatting

We review the literature on automatic document formatting with an emphasis on recent work in the field. One common way to frame document formatting is as a constrained optimization problem where decision variables encode element placement, constraints enforce required geometric relationships, and the objective function measures layout quality. We present existing research using this framework, describing the kind of optimization problem being solved and the basic optimization techniques used to solve it. Our review focuses on the formatting of primarily textual documents, including both micro- and macro-typographic concerns. We also cover techniques for automatic table layout. Related problems such as widget and diagram layout, as well as temporal layout issues that arise in multimedia documents are outside the scope of this review.

[1]  Hsin-Kuo Kan A computerized template-driven news-layout system for newspapers. , 1977 .

[2]  John DeTreville An analytical approach to computerized news layout for newspapers , 1978 .

[3]  Donald E. Knuth,et al.  Breaking paragraphs into lines , 1981, Softw. Pract. Exp..

[4]  Michael Frederick Plass,et al.  Optimal pagination techniques for automatic typesetting systems , 1981 .

[5]  Richard Furuta,et al.  Document Formatting Systems: Survey, Concepts, and Issues , 1982, CSUR.

[6]  Donald E. Knuth,et al.  The TeXbook , 1984 .

[7]  Richard John Beach Setting tables and illustrations with style , 1985 .

[8]  Lawrence L. Larmore,et al.  The least weight subsequence problem , 1987, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[9]  C. Bigelow,et al.  Digital typography , 1987 .

[10]  John S. Gourlay,et al.  Optimal line breaking in music , 1988 .

[11]  Steven K. Feiner,et al.  A grid-based approach to automating display layout , 1998 .

[12]  Håkon Wium Lie The electronic broadsheet : all the news that fits the display , 1991 .

[13]  Richard Furuta Important Papers in the History of Document Preparation Systems: Basic Sources , 1992, Electron. Publ..

[14]  David Eppstein,et al.  Sparse dynamic programming II: convex and concave cost functions , 1992, JACM.

[15]  Frank Mittelbach,et al.  The Pursuit of Quality How can automated typesetting achieve the highest standards of craft typography , 1992 .

[16]  Louis Weitzman,et al.  Relational grammars for interactive design , 1993, Proceedings 1993 IEEE Symposium on Visual Languages.

[17]  Hermann Zapf About Micro-Typography and the hz-program , 1993, Electron. Publ..

[18]  Klaus Reichenberger,et al.  Automatic layout based on formal semantics , 1994, AVI '94.

[19]  Louis Weitzman,et al.  Automatic presentation of multimedia documents using relational grammars , 1994, MULTIMEDIA '94.

[20]  Anne Brüggemann-Klein,et al.  Pagination reconsidered , 1995 .

[21]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[22]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[23]  Derick Wood,et al.  Tabular Formatting Problems , 1996, PODP.

[24]  Winfried Graf,et al.  The Constraint-Based Layout Framework LayLab and Its Applications , 1996 .

[25]  Joe Marks,et al.  Automatic Yellow-Pages pagination and layout , 1997, J. Heuristics.

[26]  Alan Borning,et al.  Constraints for the web , 1997, MULTIMEDIA '97.

[27]  Peter Karow Two Decades of Typographic Research at URW: A Rerospective , 1998, EP.

[28]  Nabil Layaïda,et al.  Madeus, and authoring environment for interactive multimedia documents , 1998, MULTIMEDIA '98.

[29]  Kenton O'Hara,et al.  Supporting memory for spatial location while reading from small displays , 1999, CHI EA '99.

[30]  Lars Erik Holmquist,et al.  WEST: a Web browser for small terminals , 1999, UIST '99.

[31]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[32]  Richard J. Anderson,et al.  The table layout problem , 1999, SCG '99.

[33]  Peter J. Stuckey,et al.  Constraint cascading style sheets for the Web , 1999, UIST '99.

[34]  Alan Borning,et al.  Constraint-based document layout for the Web , 2000, Multimedia Systems.

[35]  Frank Mittelbach Formatting documents with floats A new algorithm for L A T E X2 , 2000 .

[36]  F. Mittelbach Formatting documents with floats; A new algorithm , 2000 .

[37]  John A. Bateman,et al.  Towards Constructive Text, Diagram, and Layout Generation for Information Presentation , 2001, Computational Linguistics.

[38]  Alan Borning,et al.  A constraint extension to scalable vector graphics , 2001, WWW '01.

[39]  Eldan Goldenberg Automatic layout of variable-content print data , 2002 .

[40]  Kim Marriott,et al.  Fast and efficient client-side adaptivity for SVG , 2002, WWW '02.

[41]  Wei-Ying Ma,et al.  Detecting web page structure for adaptive viewing on small form factor devices , 2003, WWW '03.

[42]  Barry O'Sullivan,et al.  Creating personalized documents: an optimization approach , 2003, DocEng '03.

[43]  Han-Wen Nienhuys,et al.  LILYPOND, A SYSTEM FOR AUTOMATED MUSIC ENGRAVING , 2003 .

[44]  David Salesin,et al.  Adaptive grid-based document layout , 2003, ACM Trans. Graph..

[45]  David Salesin,et al.  Adaptive Document Layout via Manifold Content , 2003 .

[46]  Kim Marriott,et al.  Cobweb: A Constraint-Based WEB Browser , 2003, ACSC.

[47]  Jun Kong,et al.  Authoring multimedia documents through grammatical specifications , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[48]  P. Roetling,et al.  Aesthetic measures for automated document layout , 2004, DocEng '04.

[49]  Bongshin Lee,et al.  Fishnet, a fisheye web browser with search term popouts: a comparative evaluation with overview and linear view , 2004, AVI.

[50]  Nicholas Beaumont,et al.  Fitting a Table to a Page using nonlinear Optimization , 2004, Asia Pac. J. Oper. Res..

[51]  Xing Xie,et al.  Efficient browsing of Web search results on mobile devices based on block importance model , 2005, Third IEEE International Conference on Pervasive Computing and Communications.

[52]  Patrick Baudisch,et al.  Summary thumbnails: readable overviews for small screen web browsers , 2005, CHI.

[53]  Francisco Azevedo,et al.  Constrained XSL formatting objects for adaptive documents , 2005, DocEng '05.

[54]  Kim Marriott,et al.  Toward tighter tables , 2005, DocEng '05.

[55]  Shumeet Baluja,et al.  Browsing on small screens: recasting web-page segmentation into an efficient machine learning framework , 2006, WWW '06.

[56]  Kim Marriott,et al.  Minimum sized text containment shapes , 2006, DocEng '06.

[57]  Kim Marriott,et al.  Solving the simple continuous table layout problem , 2006, DocEng '06.

[58]  Xiaofan Lin,et al.  Predictive Text Fitting , 2006, Smart Graphics.

[59]  Virpi Roto,et al.  Minimap: a web page visualization method for mobile phones , 2006, CHI.

[60]  Owen Rees,et al.  Resolving layout interdependency with presentational variables , 2006, DocEng '06.

[61]  Owen Rees,et al.  Extensible layout in functional documents , 2006, Electronic Imaging.

[62]  Christof Lutteroth,et al.  User interface layout with ordinal and linear constraints , 2006, AUIC.

[63]  Xiaofan Lin,et al.  Active layout engine: Algorithms and applications in variable data printing , 2006, Comput. Aided Des..

[64]  Kim Marriott,et al.  Automatic float placement in multi-column documents , 2007, DocEng '07.

[65]  Ian Jacobs,et al.  Cascading Style Sheets, level 2 CSS2 Specification , 2008 .

[66]  Kim Marriott,et al.  Satisficing scrolls: a shortcut to satisfactory layout , 2008, ACM Symposium on Document Engineering.

[67]  Angelo Di Iorio,et al.  Higher-level layout through topological abstraction , 2008, ACM Symposium on Document Engineering.

[68]  João Batista S. de Oliveira,et al.  Two algorithms for automatic document page layout , 2008, ACM Symposium on Document Engineering.

[69]  David Salesin,et al.  Adaptive layout for dynamically aggregated documents , 2008, IUI '08.

[70]  Kim Marriott,et al.  MADRID ! Track : User Interfaces and Mobile Web / Session : User Interfaces 831 , 2009 .