-
公开(公告)号:US10685236B2
公开(公告)日:2020-06-16
申请号:US16028352
申请日:2018-07-05
Applicant: Adobe Inc.
Inventor: Saayan Mitra , Viswanathan Swaminathan , Somdeb Sarkhel , Julio Alvarez Martinez, Jr.
Abstract: A metadata generation system utilizes machine learning techniques to accurately describe content of videos based on multi-model predictions. In some embodiments, multiple feature sets are extracted from a video, including feature sets showing correlations between additional features of the video. The feature sets are provided to a learnable pooling layer with multiple modeling techniques, which generates, for each of the feature sets, a multi-model content prediction. In some cases, the multi-model predictions are consolidated into a combined prediction. Keywords describing the content of the video are determined based on the multi-model predictions (or combined prediction). An augmented video is generated with metadata that is based on the keywords.