PointGame: Geometrically and Adaptively Masked Autoencoder on Point Clouds

Authors: Liu, Y., Yan, X., Li, Z., Chen, Z., Wei, Z. and Wei, M.

Journal: IEEE Transactions on Geoscience and Remote Sensing

Volume: 61

Pages: 1-12

eISSN: 1558-0644

ISSN: 0196-2892

DOI: 10.1109/TGRS.2023.3331748

Abstract:

Self-supervised learning is attracting large attention in point cloud understanding. However, exploring discriminative and transferable features still remains challenging due to their nature of irregularity. We propose a geometrically and adaptively masked autoencoder on point clouds for self-supervised learning, termed PointGame. PointGame contains two core components: GATE and EAT. GATE stands for the geometrical and adaptive token embedding module; it not only absorbs the conventional wisdom of geometric descriptors that capture the surface shape effectively, but also exploits adaptive saliency to focus on the salient part of a point cloud. EAT stands for the external attention-based transformer encoder with linear computational complexity, which increases the efficiency of the whole pipeline. Unlike cutting-edge unsupervised learning models, PointGame leverages geometric descriptors to perceive surface shapes and adaptively mines discriminative features from training data. PointGame showcases clear advantages over its competitors on various downstream tasks under both global and local fine-tuning strategies. The code and pretrained models will be publicly available.

https://eprints.bournemouth.ac.uk/39219/

Source: Scopus

PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point Clouds

Authors: Liu, Y., Yan, X., Li, Z., Chen, Z., Wei, Z. and Wei, M.

Journal: IEEE Transactions on Geoscience and Remote Sensing

Volume: 61

ISSN: 0196-2892

Abstract:

Self-supervised learning is attracting large attention in point cloud understanding. However, exploring discriminative and transferable features still remains challenging due to their nature of irregularity. We propose a geometrically and adaptively masked auto-encoder on point clouds for self-supervised learning, termed PointGame. PointGame contains two core components: GATE and EAT. GATE stands for the geometrical and adaptive token embedding module; it not only absorbs the conventional wisdom of geometric descriptors that captures the surface shape effectively, but also exploits adaptive saliency to focus on the salient part of a point cloud. EAT stands for the external attention-based Transformer encoder with linear computational complexity, which increases the efficiency of the whole pipeline. Unlike cutting-edge unsupervised learning models, PointGame leverages geometric descriptors to perceive surface shapes and adaptively mines discriminative features from training data. PointGame showcases clear advantages over its competitors on various downstream tasks under both global and local fine-tuning strategies. The code and pre-trained models will be publicly available.

https://eprints.bournemouth.ac.uk/39219/

Source: BURO EPrints