-
公开(公告)号:US20240046606A1
公开(公告)日:2024-02-08
申请号:US18363175
申请日:2023-08-01
Applicant: NEC Laboratories America, Inc.
Inventor: Kai Li , Renqiang Min , Deep Patel , Erik Kruus , Xin Hu
IPC: G06V10/62 , G06V20/40 , G06V10/82 , G06V10/774 , G06V10/776 , G06V10/77
CPC classification number: G06V10/62 , G06V20/41 , G06V20/46 , G06V10/82 , G06V10/774 , G06V10/776 , G06V10/7715
Abstract: Methods and systems for temporal action localization include processing a video stream to identify an action and a start time and a stop time for the action using a neural network model that separately processes information of appearance and motion modalities from the video stream using transformer branches that include a self-attention and a cross-attention between the appearance and motion modalities. An action is performed responsive to the identified action.