Laser & Optoelectronics Progress, Volume. 58, Issue 22, 2215008(2021)
Video Summarization Algorithm Based on Improved Fully Convolutional Network
In the face of massive video data, video summarization technique plays an increasingly important role in video retrieval, video browsing and other fields. It aims to obtain important information in input videos by generating short video clips or selecting a set of key frames. Most of the existing methods focus on the representativeness and diversity of video summarization, without considering the multi-scale contextual information such as the structure of the video. To solve the above problems, a video summarization model based on improved fully convolutional network is proposed, in which time pyramid pooling is used to extract multi-scale contextual information, and the fully connected conditional random field is used to label the video frame sequence. Experiments on SumMe and TVSum datasets show that the proposed model achieves better performance than fully convolutional sequence networks, and the F-score indexes on these two data sets are improved by 1.6% and 3.0%, respectively.
Get Citation
Copy Citation Text
Hao Wang, Li Peng. Video Summarization Algorithm Based on Improved Fully Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(22): 2215008
Category: Machine Vision
Received: Mar. 15, 2021
Accepted: Jul. 15, 2021
Published Online: Nov. 10, 2021
The Author Email: Li Peng (penglimail2002@163.com)