Laser & Optoelectronics Progress, Volume. 58, Issue 22, 2215008(2021)

Video Summarization Algorithm Based on Improved Fully Convolutional Network

Hao Wang and Li Peng*
Author Affiliations
  • School of Internet of Things Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China
  • show less

    In the face of massive video data, video summarization technique plays an increasingly important role in video retrieval, video browsing and other fields. It aims to obtain important information in input videos by generating short video clips or selecting a set of key frames. Most of the existing methods focus on the representativeness and diversity of video summarization, without considering the multi-scale contextual information such as the structure of the video. To solve the above problems, a video summarization model based on improved fully convolutional network is proposed, in which time pyramid pooling is used to extract multi-scale contextual information, and the fully connected conditional random field is used to label the video frame sequence. Experiments on SumMe and TVSum datasets show that the proposed model achieves better performance than fully convolutional sequence networks, and the F-score indexes on these two data sets are improved by 1.6% and 3.0%, respectively.

    Tools

    Get Citation

    Copy Citation Text

    Hao Wang, Li Peng. Video Summarization Algorithm Based on Improved Fully Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(22): 2215008

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Mar. 15, 2021

    Accepted: Jul. 15, 2021

    Published Online: Nov. 10, 2021

    The Author Email: Li Peng (penglimail2002@163.com)

    DOI:10.3788/LOP202158.2215008

    Topics