Multiclass semantic segmentation for digitisation of movable heritage using deep learning techniques

Digitisation processes of movable heritage are becoming increasingly popular to document the artworks stored in our museums. A growing number of strategies for the three-dimensional (3D) acquisition and modelling of these invaluable assets have been developed in the last few years. Their objective is to efficiently respond to this documentation need and contribute to deepening the knowledge of the masterpieces investigated constantly by researchers operating in many fieldworks. Nowadays, one of the most effective solutions is represented by the development of image-based techniques, usually connected to a Structure-from-Motion (SfM) photogrammetric approach. However, while images acquisition is relatively rapid, the processes connected to data processing are very time-consuming and require the operator’s substantial manual involvement. Developing deep learning-based strategies can be an effective solution to enhance the automatism level. In this research, which has been carried out in the framework of the digitisation of a wooden maquettes collection stored in the ‘Museo Egizio di Torino’, using a photogrammetric approach, an automatic masking strategy using deep learning techniques is proposed, to increase the level of automatism and therefore, optimise the photogrammetric pipeline. Starting from a manually annotated dataset, a neural network was trained to automatically perform a semantic classification to isolate the maquettes from the background. The proposed methodology allowed the researchers to obtain automatically segmented masks with a high degree of accuracy. The workflow is described (as regards acquisition strategies, dataset processing, and neural network training). In addition, the accuracy of the results is evaluated and discussed. Finally, the researchers proposed the possibility of performing a multiclass segmentation on the digital images to recognise different object categories in the images, as well as to define a semantic hierarchy to perform automatic classification of different elements in the acquired images.Highlights:In the framework of movable heritage digitisation processes, many procedures are very time-consuming, and they still require the operator’s substantial manual involvement.This research proposes using deep learning techniques to enhance the automatism level in the generation of exclusion masks, improving the optimisation of the photogrammetric procedures.Following this strategy, the possibility of performing a multiclass semantic segmentation (on the 2D images and, consequently, on the 3D point cloud) is also discussed, considering the accuracy of the obtainable results.

[1]  Caterina Balletti,et al.  An Application of Integrated 3D Technologies for Replicas in Cultural Heritage , 2019, ISPRS Int. J. Geo Inf..

[2]  Gary K. L. Tam,et al.  3D Mesh Segmentation via Multi-branch 1D Convolutional Neural Networks , 2017, Graph. Model..

[3]  Fulvio Rinaudo,et al.  A Neural Networks Approach to Detecting Lost Heritage in Historical Video , 2020, ISPRS Int. J. Geo Inf..

[4]  Andrea Adami,et al.  The bust of Francesco II Gonzaga: from digital documentation to 3D printing. , 2015 .

[5]  Roberto Pierdicca,et al.  Point Cloud Semantic Segmentation Using a Deep Learning Framework for Cultural Heritage , 2020, Remote. Sens..

[6]  Ersan YAZAN,et al.  Comparison of the stochastic gradient descent based optimization techniques , 2017, 2017 International Artificial Intelligence and Data Processing Symposium (IDAP).

[7]  M. Pontil,et al.  Machine Learning for Cultural Heritage: A Survey , 2020, Pattern Recognit. Lett..

[8]  Fabio Bruno,et al.  User-centered design of a virtual reality exhibit for archaeological museums , 2017, International Journal on Interactive Design and Manufacturing (IJIDeM).

[9]  Riccardo Roncella,et al.  Photogrammetric Techniques for Promotion of Archaeological Heritage: the Archaeological Museum of Parma (italy) , 2016 .

[10]  Massimiliano Lo Turco,et al.  The digitisation of museum collections for research, management and enhancement of tangible and intangible heritage , 2018, 2018 3rd Digital Heritage International Congress (DigitalHERITAGE) held jointly with 2018 24th International Conference on Virtual Systems & Multimedia (VSMM 2018).

[11]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[12]  Caterina Balletti,et al.  3D printing: State of the art and future perspectives , 2017 .

[13]  Roberto Pierdicca,et al.  Mo.Se.: Mosaic image segmentation based on deep cascading learning , 2021 .

[14]  Thomas P. Kersten,et al.  Development of a Virtual Museum Including a 4d Presentation of Building History in Virtual Reality , 2017 .

[15]  José García Rodríguez,et al.  A survey on deep learning techniques for image and video semantic segmentation , 2018, Appl. Soft Comput..

[16]  Amir Mosavi,et al.  Deep Learning: A Review , 2018 .

[17]  Viviana Mollica Nardo,et al.  Combined 3D Surveying and Raman Spectroscopy Techniques on Artifacts Preserved at Archaeological Museum of Lipari , 2019 .

[18]  Giacomo Patrucco,et al.  Image and Range-Based 3D Acquisition and Modeling of Popular Musical Instruments , 2018 .

[19]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[20]  Fabio Remondino,et al.  GEOMETRIC FEATURES ANALYSIS FOR THE CLASSIFICATION OF CULTURAL HERITAGE POINT CLOUDS , 2019, ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[21]  Armin Gruen,et al.  3D Reconstruction of a Complex Grid Structure Combining UAS Images and Deep Learning , 2020, Remote. Sens..

[22]  Fulvio Rinaudo,et al.  A NEW HANDHELD SCANNER FOR 3D SURVEY OF SMALL ARTIFACTS: THE STONEX F6 , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[23]  Fabio Remondino,et al.  Online access and sharing of reality-based 3d models , 2014 .

[24]  Gabriele Guidi,et al.  The Indiana University-Uffizi project: Metrologica! challenges and workflow for massive 3D digitization of sculptures , 2017, 2017 23rd International Conference on Virtual System & Multimedia (VSMM).

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  Ting Liu,et al.  Recent advances in convolutional neural networks , 2015, Pattern Recognit..

[27]  M. Viñals,et al.  POTENTIAL OF HBIM TO IMPROVE THE EFFICIENCY OF VISITOR FLOW MANAGEMENT IN HERITAGE SITES. TOWARDS SMART HERITAGE MANAGEMENT , 2020 .

[28]  Alessandra Spreafico,et al.  Hybrid 3D Models: When Geomatics Innovations Meet Extensive Built Heritage Complexes , 2019, ISPRS Int. J. Geo Inf..

[29]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Jianping Shi,et al.  Graph-Guided Architecture Search for Real-Time Semantic Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Yoshua Bengio,et al.  The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[32]  Fabio Remondino,et al.  APPLICATION OF MACHINE AND DEEP LEARNING STRATEGIES FOR THE CLASSIFICATION OF HERITAGE POINT CLOUDS , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[33]  Gabriele Guidi,et al.  Massive 3D digitization of sculptures: Methodological approaches for improving efficiency , 2018 .

[34]  Samuel Rota Bulo,et al.  Modeling the Background for Incremental Learning in Semantic Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Roberto Medina,et al.  Classification of Architectural Heritage Images Using Deep Learning Techniques , 2017 .

[36]  F. Remondino,et al.  SEMANTIC PHOTOGRAMMETRY – BOOSTING IMAGE-BASED 3D RECONSTRUCTION WITH SEMANTIC LABELING , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[37]  Changming Sun,et al.  Knowledge Adaptation for Efficient Semantic Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).