Vectorization of Historical Maps Using Deep Edge Filtering and Closed Shape Extraction

Maps have been a unique source of knowledge for centuries. Such historical documents provide invaluable information for analyzing the complex spatial transformation of landscapes over important time frames. This is particularly true for urban areas that encompass multiple interleaved research domains (social sciences, economy, etc.). The large amount and significant diversity of map sources call for automatic image processing techniques in order to extract the relevant objects under a vectorial shape. The complexity of maps (text, noise, digitization artifacts, etc.) has hindered the capacity of proposing a versatile and efficient raster-to-vector approaches for decades. We propose a learnable, reproducible, and reusable solution for the automatic transformation of raster maps into vector objects (building blocks, streets, rivers). It is built upon the complementary strength of mathematical morphology and convolutional neural networks through efficient edge filtering. Evenmore, we modify ConnNet and combine with deep edge filtering architecture to make use of pixel connectivity information and built an end-to-end system without requiring any post-processing techniques. In this paper, we focus on the comprehensive benchmark on various architectures on multiple datasets coupled with a novel vectorization step. Our experimental results on a new public dataset using COCO Panoptic metric exhibit very encouraging results confirmed by a qualitative analysis of the success and failure cases of our approach. Code, dataset, results and extra illustrations are freely available at https://github.com/soduco/ICDAR-2021-Vectorization.

[1]  Kai Chen,et al.  Hybrid Task Cascade for Instance Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Song-Chun Zhu,et al.  What are Textons? , 2005, Int. J. Comput. Vis..

[3]  Allan Hanbury,et al.  Morphological segmentation on learned boundaries , 2009, Image Vis. Comput..

[4]  Craig A. Knoblock,et al.  Efficient and Robust Graphics Recognition from Historical Maps , 2011, GREC.

[5]  Min Bai,et al.  Deep Watershed Transform for Instance Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Fabian Feitsch,et al.  Polygon consensus: smart crowdsourcing for extracting building footprints from historical maps , 2016, SIGSPATIAL/GIS.

[7]  Gilles Bertrand,et al.  Watershed Cuts: Thinnings, Shortest Path Forests, and Topological Watersheds , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Craig A. Knoblock,et al.  Historical Map Applications and Processing Technologies , 2020 .

[9]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Robert Weibel,et al.  Saliency and semantic processing: Extracting forest cover from historical topographic maps , 2006, Pattern Recognit..

[11]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[12]  R'emi Petitpierre Neural networks for semantic segmentation of historical city maps: Cross-cultural performance and the impact of figurative diversity , 2021, ArXiv.

[13]  Jos B. T. M. Roerdink,et al.  The Watershed Transform: Definitions, Algorithms and Parallelization Strategies , 2000, Fundam. Informaticae.

[14]  Song-Chun Zhu,et al.  What are Textons? , 2005, International Journal of Computer Vision.

[15]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Marc Barthelemy,et al.  Roads and cities of 18th century France , 2015, Scientific Data.

[17]  Karl Tombre,et al.  Robust and accurate vectorization of line drawings , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  D. Kaim,et al.  Historical dataset of administrative units with social-economic attributes for Austrian Silesia 1837–1910 , 2020, Scientific Data.

[19]  Craig A. Knoblock,et al.  A Survey of Digital Map Processing Techniques , 2014, ACM Comput. Surv..

[20]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[21]  Humphrey Southall,et al.  GB1900: Engaging the Public in Very Large Scale Gazetteer Construction from the Ordnance Survey “County Series” 1:10,560 Mapping of Great Britain , 2017 .

[22]  Michael Kampffmeyer,et al.  ConnNet: A Long-Range Relation-Aware Pixel-Connectivity Network for Salient Segmentation , 2018, IEEE Transactions on Image Processing.

[23]  Carsten Rother,et al.  Panoptic Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Philip H. S. Torr,et al.  Recurrent Instance Segmentation , 2015, ECCV.

[25]  Gilles Bertrand,et al.  Quasi-Linear Algorithms for the Topological Watershed , 2005, Journal of Mathematical Imaging and Vision.

[26]  Bhabatosh Chanda,et al.  Extraction and recognition of geographical features from paper maps , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[27]  Stefan Leyk,et al.  Colors of the past: color image segmentation in historical topographic maps based on homogeneity , 2010, GeoInformatica.

[28]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[29]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[30]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[32]  Julien Perret,et al.  Combining Deep Learning and Mathematical Morphology for Historical Map Segmentation , 2021, ArXiv.

[33]  Jiajun Wu,et al.  Raster-to-Vector: Revisiting Floorplan Transformation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Ming Yang,et al.  BDCN: Bi-Directional Cascade Network for Perceptual Edge Detection , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  J. Serra,et al.  MATHEMATICAL MORPHOLOGY IN COLOR SPACES APPLIED TO THE ANALYSIS OF CARTOGRAPHIC IMAGES , 2003 .

[36]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.