-
公开(公告)号:US20220180127A1
公开(公告)日:2022-06-09
申请号:US17569725
申请日:2022-01-06
Applicant: INTEL CORPORATION
Inventor: Yurong CHEN , Jianguo LI , Zhou SU , Zhiqiang SHEN
IPC: G06K9/62 , G06F40/169 , G06N3/08 , G06V20/40
Abstract: Techniques and apparatus for generating dense natural language descriptions for video content are described. In one embodiment, for example, an apparatus may include at least one memory and logic, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to receive a source video comprising a plurality of frames, determine a plurality of regions for each of the plurality of frames, generate at least one region-sequence connecting the determined plurality of regions, apply a language model to the at least one region-sequence to generate description information comprising a description of at least a portion of content of the source video. Other embodiments are described and claimed.
-
公开(公告)号:US20210142115A1
公开(公告)日:2021-05-13
申请号:US16616533
申请日:2017-06-29
Applicant: INTEL CORPORATION
Inventor: Yurong CHEN , Jianguo LI , Zhou SU , Zhiqiang SHEN
IPC: G06K9/62 , G06K9/00 , G06N3/08 , G06F40/169
Abstract: Techniques and apparatus for generating dense natural language descriptions for video content are described. In one embodiment, for example, an apparatus may include at least one memory and logic, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to receive a source video comprising a plurality of frames, determine a plurality of regions for each of the plurality of frames, generate at least one region-sequence connecting the determined plurality of regions, apply a language model to the at least one region-sequence to generate description information comprising a description of at least a portion of content of the source video. Other embodiments are described and claimed.
-