Working with scientific and technical papers on small screen devices, such as tablets and eBook readers, is difficult since these works are often typeset in multiple columns with a relatively small font size. On tablets, pan and zoom operations allow users to visualize the text in the desired size, however, tracing the text in multiple columns can be uneasy and not appropriate for studying and working with the scientific works. Moreover, these operations are slow on most e-ink eBook readers that have limited computation resources. Document reflow is in this case one option, but it is difficult to provide a satisfactory visualization of scientific and technical papers. In this paper, we describe one off-line tool for scientific document reflow that adopts document image processing techniques to generate one modified version of the original PDF organized as a single column text that can be easily visualized on eBook readers. Moreover, the tool allows the user to make free-form annotations on the modified paper using the tools of the eBook reader. These annotations are faithfully reproduced in the original two-column document.
[1]
George Nagy,et al.
HIERARCHICAL REPRESENTATION OF OPTICALLY SCANNED DOCUMENTS
,
1984
.
[2]
Zile Wei,et al.
Recognizing Freeform Digital Ink Annotations
,
2004,
Document Analysis Systems.
[3]
Francesca Cesarini,et al.
A general system for the retrieval of document images from digital libraries
,
2004,
First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..
[4]
David Bargeron,et al.
Reflowing digital ink annotations
,
2003,
CHI '03.
[5]
Laurent Denoue,et al.
Moving markup: repositioning freeform annotations
,
2002,
UIST '02.
[6]
Francesca Cesarini,et al.
Structured document segmentation and representation by the modified X-Y tree
,
1999,
Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).
[7]
Giovanni Soda,et al.
Conversion of PDF Books in ePub Format
,
2011,
2011 International Conference on Document Analysis and Recognition.