-
公开(公告)号:US20240086457A1
公开(公告)日:2024-03-14
申请号:US17944502
申请日:2022-09-14
Applicant: ADOBE INC.
Inventor: Yaman KUMAR , Vaibhav AHLAWAT , Ruiyi ZHANG , Milan AGGARWAL , Ganesh Karbhari PALWE , Balaji KRISHNAMURTHY , Varun KHURANA
Abstract: A content analysis system provides content understanding for a content item using an attention aware multi-modal model. Given a content item, feature extractors extract features from content components of the content item in which the content components comprise multiple modalities. A cross-modal attention encoder of the attention aware multi-modal model generates an embedding of the content item using features extracted from the content components. A decoder of the attention aware multi-modal model generates an action-reason statement using the embedding of the content item from the cross-modal attention encoder.