- 专利标题: Method and apparatus for summarization of unsupervised video with efficient key frame selection reward functions
-
申请号: US17730536申请日: 2022-04-27
-
公开(公告)号: US11756300B1公开(公告)日: 2023-09-12
- 发明人: Geun Sik Jo , Ui Nyoung Yoon , Myung Duk Hong
- 申请人: INHA University Research and Business Foundation
- 申请人地址: KR Incheon
- 专利权人: INHA UNIVERISTY RESEARCH AND BUSINESS FOUNDATION
- 当前专利权人: INHA UNIVERISTY RESEARCH AND BUSINESS FOUNDATION
- 当前专利权人地址: KR Incheon
- 代理机构: Keohane & D'Alessandro, PLLC
- 代理商 Hunter E. Webb
- 主分类号: G06V10/74
- IPC分类号: G06V10/74 ; G06V20/40
摘要:
Disclosed are a method and apparatus for summarization of unsupervised video with efficient key frame selection reward functions. Frame-level visual features are extracted from an input video. An attention weight is computed and an importance score is represented as a frame tracking probability for selecting a key frame using the attention weight. A temporal consistency reward function and a representativeness reward function are obtained so as to select the key frame, based on a visual similarity distance and temporal distance between key frames, and an attention-based video summarization network is trained to predict an importance score for selecting a key frame of a video summary by using the temporal consistency reward function and the representativeness reward function. A video summary is created by selecting a corresponding key frame based on the predicted importance score, the quality of the created video summary is evaluated, and policy gradient learning is performed for the attention-based video summarization network. Regularization and reconstruction loss is calculated for controlling the probability to select a key frame by using the importance score of the selected key frame. A video summary is created based on the calculated regularization and reconstruction loss.
公开/授权文献
信息查询