Optics and Precision Engineering, Volume. 32, Issue 7, 1087(2024)
Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer
To tackle the challenges in multimodal classification tasks involving hyperspectral images (HSI) and LiDAR data, such as cross-modal information expression and feature alignment, this paper introduces a contrastive learning-based multi-branch CNN-Transformer network (CLCT-Net) for the joint classification of hyperspectral and LiDAR data. Initially, CLCT-Net employs a feature extraction module with a ConvNeXt V2 Block to capture shared features across different modalities, addressing the semantic alignment issue between data from heterogeneous sensors. It then develops a dual-branch HSI encoder with spatial channel and spectral context branches, alongside a LiDAR encoder enhanced by a frequency domain self-attention mechanism, to secure more comprehensive feature representations. Lastly, it leverages ensemble contrastive learning for classification to further refine the accuracy of multimodal collaborative classification. Experimental evaluations on the Houston 2013 and Trento datasets demonstrate that the proposed model excels in extracting and integrating cross-modal data features, achieving superior ground object classification accuracies of 92.01% and 98.90%, respectively, when compared to existing models for classifying hyperspectral images and LiDAR data.
Get Citation
Copy Citation Text
Haibin WU, Shiyu DAI, Aili WANG, Iwahori YUJI, Xiaoyu YU. Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer[J]. Optics and Precision Engineering, 2024, 32(7): 1087
Category:
Received: Oct. 23, 2023
Accepted: --
Published Online: May. 28, 2024
The Author Email: