Prosformer: Accurate Surface Reconstruction for Sparse Profilometer Measurement with Transformer

Surface micro-structure measurement is significant for precision manufacturing. However, existing stylus profilometer is inefficient, and sparse line-scan measurement can't support accurate surface description. To improve the reconstruction performance, we propose a high-accurate reconstruction method with sparse line-scan measurement based on attention mechanism. We first arrange the sparse-line measurement in the 2D matrix and crop as patch region. Then we utilize transformer to construct semantic relationships between patches and assign new weights to each patch to accurately model the structural relationships of target region and perform feature extraction, where the self-attention can enhance the description of local details while cross-attention will interact with global information. Finally, a fully connected network as a decoder is adopted to reconstruct accurate surface details with complete geometric representation. We refer to this model as Prosformer. Furthermore, we simulate a larger-scale surface micro-structure dataset to drive the training process and measure micro-structures to valid Prosformer. Experiments show proposed method can effectively restore complex surface details.

[1]  Yizhou Yu,et al.  Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  T. Zeng,et al.  Transformer for Single Image Super-Resolution , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  D. Tao,et al.  A Survey on Vision Transformer , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  M. Ren,et al.  Fast Surface Topography Reconstruction Method for Profilometer Measurement based on Neural Continuous Representation , 2021, 2021 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD).

[5]  Shizheng Wang,et al.  PU-EVA: An Edge-Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[6]  Luc Van Gool,et al.  SwinIR: Image Restoration Using Swin Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[7]  L. Gool,et al.  Video Super-Resolution Transformer , 2021, ArXiv.

[8]  Cordelia Schmid,et al.  Segmenter: Transformer for Semantic Segmentation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  Wolfgang Broll,et al.  Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction , 2021, VISIGRAPP.

[10]  Ralph R. Martin,et al.  PCT: Point cloud transformer , 2020, Computational Visual Media.

[11]  Wen Gao,et al.  Pre-Trained Image Processing Transformer , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Klaus Dietmayer,et al.  Point Transformer , 2020, IEEE Access.

[13]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[14]  Baining Guo,et al.  Learning Texture Transformer Network for Image Super-Resolution , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Nicolas Usunier,et al.  End-to-End Object Detection with Transformers , 2020, ECCV.

[16]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[17]  Daniel Cohen-Or,et al.  PU-Net: Point Cloud Upsampling Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[19]  Lijian Sun,et al.  Gaussian process based intelligent sampling for measuring nano-structure surfaces , 2016, International Symposium on Advanced Optical Manufacturing and Testing Technologies (AOMATT).

[20]  Stepan Yu. Gatilov,et al.  Vectorizing NURBS surface evaluation with basis functions in power basis , 2016, Comput. Aided Des..

[21]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Xiaoou Tang,et al.  Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[24]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[25]  Hsi-Yung Feng,et al.  Reconstruction of 2D polygonal curves and 3D triangular surfaces via clustering of Delaunay circles/spheres , 2011, Comput. Aided Des..

[26]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[27]  Vincenzo Piuri,et al.  Automatic multiscale meshing through HRBF networks , 2005, IEEE Transactions on Instrumentation and Measurement.

[28]  Hwang Soo Lee,et al.  Adaptive image interpolation based on local gradient features , 2004, IEEE Signal Process. Lett..

[29]  Steven Fortune,et al.  Voronoi Diagrams and Delaunay Triangulations , 2004, Handbook of Discrete and Computational Geometry, 2nd Ed..