Optics and Precision Engineering, Volume. 32, Issue 2, 237(2024)
Audio object detection network with multimodal cross level feature knowledge transfer
Fig. 2. Multimodal knowledge distillation target detection network
Fig. 4. Cross-level feature knowledge transfer loss based on attentional fusion
Fig. 5. Attention fusion module(AFM) and the KL divergence calculation module(KLD)
Fig. 8. Comparison of object detection capability under different network architecture
Fig. 10. Qualitative comparison of vehicle detection capability with or without LDLoss
Fig. 12. Qualitatively compares the vehicle detection capabilities of the baseline network and the method presented in this paper
|
|
|
|
|
|
Get Citation
Copy Citation Text
Shibei LIU, Ying CHEN. Audio object detection network with multimodal cross level feature knowledge transfer[J]. Optics and Precision Engineering, 2024, 32(2): 237
Category:
Received: Jun. 8, 2023
Accepted: --
Published Online: Apr. 2, 2024
The Author Email: CHEN Ying (chenying@jiangnan.edu.cn)