Laser & Optoelectronics Progress, Volume. 59, Issue 8, 0811009(2022)
Micro-Video Popularity Prediction with Bidirectional Deep Encoding Network
Aiming at the micro-video popularity prediction, we propose a micro-video popularity prediction model with a bidirectional deep encoding network. The model considers both multi-modal fusion and unimodal supervision modeling, and integrates them into a bidirectional deep encoding network. The multi-modal fusion module uses modal relevance to solve problems such as data missing and dimensional differences among original features to obtain a more comprehensive feature representation. The unimodal supervision module uses modal differences to supervise multi-modal feature fusion. Via joint training of multi-modal fusion and unimodal supervision tasks, the consistency and difference of multi-modal information are fully learned to improve the generalization ability of the algorithm. The experiments on the public NUS dataset have proved the effectiveness and superiority of our proposed algorithm.
Get Citation
Copy Citation Text
Peiguang Jing, Xuqing Ye, Yu Liu, Yuting Su. Micro-Video Popularity Prediction with Bidirectional Deep Encoding Network[J]. Laser & Optoelectronics Progress, 2022, 59(8): 0811009
Category: Imaging Systems
Received: Aug. 9, 2021
Accepted: Sep. 10, 2021
Published Online: Apr. 11, 2022
The Author Email: Ye Xuqing (yxq@tju.edu.cn)