Browsing through Closed Books: Fully Automatic Book Page Extraction from a 3-D X-Ray CT Volume

When digitizing or investigating historical documents, it is often the case that a document can not be opened, page-turned or touched anymore. Damages such as moisture or fire and aging processes disallow browsing through a book. To address these particular cases, our earlier work showed that Micro-CT X-ray scanners are able to image documents written with iron gall ink. A self-made book consisting of ten hand written pages was scanned and investigated without opening or page-turning. However, when analyzing the reconstruction results, we faced the problem of a proper automatic page segmentation and 2-D mapping within the volume in an acceptable time without losing information of the writings. The main problem is that the pages can be arbitrary deformed or squeezed together. In this paper, we present a fully automatic algorithm for the segmentation and extraction of book pages from the original 3-D volume. Our method delivers high quality results for our book model and can be easily adapted to other imaging modalities. We show that it performs well even for an extreme case with low resolution input data and wavy pages. To keep it simple for users, our algorithm works without any need of prior information or user interactions.

[1]  Giovanni Ramponi,et al.  Image enhancement via adaptive unsharp masking , 2000, IEEE Trans. Image Process..

[2]  Christian Riess,et al.  CONRAD--a software framework for cone-beam imaging in radiology. , 2013, Medical physics.

[3]  Rebecca Fahrig,et al.  GPU Denoising for Computed Tomography , 2018 .

[4]  Andreas K. Maier,et al.  Robust Vessel Segmentation in Fundus Images , 2013, Int. J. Biomed. Imaging.

[5]  A. Kak,et al.  Simultaneous Algebraic Reconstruction Technique (SART): A Superior Implementation of the Art Algorithm , 1984, Ultrasonic imaging.

[6]  Masatoshi Ishikawa,et al.  Real-time 3D page tracking and book status recognition for high-speed book digitization based on adaptive capturing , 2014, IEEE Winter Conference on Applications of Computer Vision.

[7]  Alejandro F. Frangi,et al.  Muliscale Vessel Enhancement Filtering , 1998, MICCAI.

[8]  Wolfgang Holub,et al.  3-D Reconstruction of Iron Gall Ink Writings , 2016 .

[9]  W. B. Seales,et al.  From damage to discovery via virtual unwrapping: Reading the scroll from En-Gedi , 2016, Science Advances.

[10]  Demetri Psaltis,et al.  From closed testaments to books: Virtual X-ray Reading as an alternate digitization technology for fragile documents , 2017 .

[11]  Frédéric Kaplan,et al.  Virtual reading of a large ancient handwritten science book , 2016 .

[12]  L. Feldkamp,et al.  Practical cone-beam algorithm , 1984 .

[13]  Vincent Christlein,et al.  Fast , Robust and Efficient Extraction of Book Pages from a 3-D X-ray CT Volume , 2017 .

[14]  Curtis R. Vogel,et al.  Ieee Transactions on Image Processing Fast, Robust Total Variation{based Reconstruction of Noisy, Blurred Images , 2022 .

[15]  J. Romberg,et al.  Terahertz time-gated spectral imaging for content extraction through layered structures , 2016, Nature Communications.

[16]  Burkhard Beckhoff,et al.  Characterization of iron‐gall inks in historical manuscripts and music compositions using x‐ray fluorescence spectrometry , 2004 .

[17]  Emmanuel Brun,et al.  Revealing letters in rolled Herculaneum papyri by X-ray phase-contrast imaging , 2015, Nature Communications.

[18]  Maier Andreas,et al.  3-D reconstruction of historical documents using an X-Ray C-Arm CT system , 2016 .