Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer

Haibin WU; Shiyu DAI; Aili WANG; Iwahori YUJI; Xiaoyu YU

doi:10.37188/OPE.20243207.1087

Optics and Precision Engineering, Volume. 32, Issue 7, 1087(2024)

Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer

Haibin WU1... Shiyu DAI1, Aili WANG1,*, Iwahori YUJI2 and Xiaoyu YU3 |Show fewer author(s)

¹Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application， College of Measurement and Control Technology and Communication Engineering， Harbin University of Science and Technology， Harbin50080， China

²Department of Computer Science， Chubu University， Aichi487-8501， Japan

³College of Electron and Information， University of Electronic Science and Technology of China，Zhongshan Institute， Zhongshan528400， China

show less

Abstract Get PDF(in Chinese)

To tackle the challenges in multimodal classification tasks involving hyperspectral images (HSI) and LiDAR data, such as cross-modal information expression and feature alignment, this paper introduces a contrastive learning-based multi-branch CNN-Transformer network (CLCT-Net) for the joint classification of hyperspectral and LiDAR data. Initially, CLCT-Net employs a feature extraction module with a ConvNeXt V2 Block to capture shared features across different modalities, addressing the semantic alignment issue between data from heterogeneous sensors. It then develops a dual-branch HSI encoder with spatial channel and spectral context branches, alongside a LiDAR encoder enhanced by a frequency domain self-attention mechanism, to secure more comprehensive feature representations. Lastly, it leverages ensemble contrastive learning for classification to further refine the accuracy of multimodal collaborative classification. Experimental evaluations on the Houston 2013 and Trento datasets demonstrate that the proposed model excels in extracting and integrating cross-modal data features, achieving superior ground object classification accuracies of 92.01% and 98.90%, respectively, when compared to existing models for classifying hyperspectral images and LiDAR data.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

contrastive learning convolutional neural network hyperspectral image LiDAR data transformer

Tools

Get Citation

Copy Citation Text

Haibin WU, Shiyu DAI, Aili WANG, Iwahori YUJI, Xiaoyu YU. Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer[J]. Optics and Precision Engineering, 2024, 32(7): 1087

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites