VIDEO LOCALIZATION USING ARTIFICIAL INTELLIGENCE

    公开(公告)号:US20240371164A1

    公开(公告)日:2024-11-07

    申请号:US18652703

    申请日:2024-05-01

    Applicant: Google LLC

    Abstract: Methods and systems for video localization using artificial intelligence are provided herein. A set of video embeddings representing features of one or more video frames of a media it em and a set of textual embeddings corresponding to an event associated with the media item are obtained. Fused video-textual data is generated based on the set of video embeddings and the set of textual embeddings. The fused video-textual data indicates features of the video frames of the media item and textual data pertaining to the media item. The fused video-textual data is provided as an input to an artificial intelligence (AI) model trained to perform multiple video localization tasks with respect to media items of a platform. One or move outputs of the AI model are obtained. A segment of the media item that depicts the event is determined based on the one or move outputs of the AI model.

Patent Agency Ranking