DOAJ Open Access 2025

A novel projection map driven multimodal fusion framework for ALS point cloud semantic segmentation

Pangyin Li Zhe Chen Chen Long Huazu Zhang Yang Lv +3 lainnya

Abstrak

Semantic segmentation of urban point clouds captured by Airborne Laser Scanning (ALS) is essential for understanding complex 3D environments, serving as a robust underlying data foundation for digital twin applications. The fusion of multimodal data has been proven to significantly improve the performance of ALS semantic segmentation by fully mining rich complementary information in each modality. However, existing fusion-based ALS semantic segmentation methods face critical limitations due to the reliance on multiple sensors, which constrains their applicability. To this end, we propose a novel multimodal framework Elevation Guidance Adaptive Fused Network, termed EGAFNet, that integrates naturally formed top-view projection images from ALS to enhance the information perception of the point cloud. The framework focuses on utilizing projection images, structured around two key components: input representation and feature representation. Specifically, to generate highly discriminative input representation, we propose a novel projection method that accurately preserves the relative height relationships between objects and develop a Height Adaptive Scaling Module (HASM) to adaptively adjust object heights, enhancing the expressive capability of elevation information in the projection images. As for feature representation, we design a dual-branch network that effectively captures local and global context from the projection images within a large receptive field. Meanwhile, we propose an Elevation Guidance Adaptive Fusion Module (EGAFM) that adaptively fuses 2D and 3D features based on occlusion relationships to reduce feature confusion caused by occlusion in elevation projection, ensuring meaningful fusion between multimodal features. Extensive experiments on three public datasets demonstrate that our EGAFNet outperforms current state-of-the-art methods.

Penulis (8)

P

Pangyin Li

Z

Zhe Chen

C

Chen Long

H

Huazu Zhang

Y

Yang Lv

R

Ronggang Huang

Z

Zhen Dong

B

Bisheng Yang

Format Sitasi

Li, P., Chen, Z., Long, C., Zhang, H., Lv, Y., Huang, R. et al. (2025). A novel projection map driven multimodal fusion framework for ALS point cloud semantic segmentation. https://doi.org/10.1016/j.jag.2025.104907

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1016/j.jag.2025.104907
Informasi Jurnal
Tahun Terbit
2025
Sumber Database
DOAJ
DOI
10.1016/j.jag.2025.104907
Akses
Open Access ✓