-
公开(公告)号:US20250113087A1
公开(公告)日:2025-04-03
申请号:US18395356
申请日:2023-12-22
Applicant: Lemon Inc.
Inventor: Ju He , Qihang Yu , Inkyu Shin , Xueqing Deng , Xiaohui Shen , Liang-Chieh Chen
IPC: H04N21/845 , H04N21/44
Abstract: The present disclosure describes techniques for implementing video segmentation. A video is divided into a plurality of clips. Each of the plurality of clips comprises several frames. Axial-trajectory attention is applied to each of the plurality of clips by a first sub-model. Clip features corresponding to each of the plurality of clips are generated by the first sub-model. A set of object queries corresponding to each of the plurality of clips is generated based on the clip features by a transformer decoder. Trajectory attention is applied to refine sets of object queries corresponding to the plurality of clips by a second sub-model. Video-level segmentation results are generated based on the refined object queries.