Automatic recognition of visual and audio-visual cues

发明授权

US12125317B2 Automatic recognition of visual and audio-visual cues 有权

请登陆查看更多内容

专利标题： Automatic recognition of visual and audio-visual cues
申请号： US17539652

申请日： 2021-12-01
公开(公告)号： US12125317B2

公开(公告)日： 2024-10-22
发明人: Jiyoung Lee , Justin Jonathan Salamon , Dingzeyu Li
申请人： ADOBE INC.
申请人地址： US CA San Jose
专利权人： ADOBE INC.
当前专利权人： ADOBE INC.
当前专利权人地址： US CA San Jose
代理机构： F. Chau & Associates, LLC
主分类号： G06V40/20
IPC分类号： G06V40/20 ; G06N3/045 ; G06N3/08 ; G06V10/82 ; G06V20/40

Automatic recognition of visual and audio-visual cues

摘要：

A method for detecting a cue (e.g., a visual cue or a visual cue combined with an audible cue) occurring together in an input video includes: presenting a user interface to record an example video of a user performing an act including the cue; determining a part of the example video where the cue occurs; applying a feature of the part to a neural network to generate a positive embedding; dividing the input video into a plurality of chunks and applying a feature of each chunk to the neural network to generate a plurality of negative embeddings; applying a feature of a given one of the chunks to the neural network to output a query embedding; and determining whether the cue occurs in the input video from the query embedding, the positive embedding, and the negative embeddings.

公开/授权文献

US20230169795A1 AUTOMATIC RECOGNITION OF VISUAL AND AUDIO-VISUAL CUES 公开/授权日：2023-06-01

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V40/00	识别图像或视频数据中的生物特征、人类相关或动物相关模式
G06V40/20	.动作或行为，例如手势识别（面部表情识别 G06V40/16）