Multi-model techniques to generate video metadata

Invention Grant

US10685236B2 Multi-model techniques to generate video metadata 有权

Please log in to see more content

Patent Title: Multi-model techniques to generate video metadata
Application No.: US16028352

Application Date: 2018-07-05
Publication No.: US10685236B2

Publication Date: 2020-06-16
Inventor: Saayan Mitra , Viswanathan Swaminathan , Somdeb Sarkhel , Julio Alvarez Martinez, Jr.
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Agency: Kilpatrick Townsend & Stockton LLP
Main IPC: G06K9/00
IPC: G06K9/00 ; G06N20/00 ; G06F16/73 ; G06F16/78 ; G06K9/62

Multi-model techniques to generate video metadata

Abstract:

A metadata generation system utilizes machine learning techniques to accurately describe content of videos based on multi-model predictions. In some embodiments, multiple feature sets are extracted from a video, including feature sets showing correlations between additional features of the video. The feature sets are provided to a learnable pooling layer with multiple modeling techniques, which generates, for each of the feature sets, a multi-model content prediction. In some cases, the multi-model predictions are consolidated into a combined prediction. Keywords describing the content of the video are determined based on the multi-model predictions (or combined prediction). An augmented video is generated with metadata that is based on the keywords.

Public/Granted literature

US20200012862A1 Multi-model Techniques to Generate Video Metadata Public/Granted day:2020-01-09

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )