MULTI-VIEW MULTI-TARGET ACTION RECOGNITION

    公开(公告)号:US20230050992A1

    公开(公告)日:2023-02-16

    申请号:US17559751

    申请日:2021-12-22

    IPC分类号: G06T7/73 G06T7/292 G06V40/20

    摘要: Implementations generally perform robust multi-view multi-target action recognition using reconstructed 3-dimensional (3D) poses. In some implementations, a method includes obtaining a plurality of videos of a plurality of subjects in an environment, where at least one target subject of the plurality of subjects performs one or more actions in the environment. The method further includes tracking the at least one target subject across at least two cameras. The method further includes reconstructing a 3-dimensional (3D) model of the at least one target subject based on the plurality of videos and the tracking of the at least one target subject. The method further includes recognizing the one or more actions of the at least one target subject based on the reconstructing of the 3D model.

    Surgical scene assessment based on computer vision

    公开(公告)号:US11625834B2

    公开(公告)日:2023-04-11

    申请号:US16808265

    申请日:2020-03-03

    申请人: Sony Corporation

    摘要: Implementations generally relate to surgical scene assessment based on computer vision. In some implementations, a method includes receiving a first image frame of a plurality of image frames associated with a surgical scene. The method further includes detecting one or more objects in the first image frame. The method further includes determining one or more positions corresponding to the one or more objects. The method further includes tracking each position of the one or more objects in other image frames of the plurality of image frames.

    CLINICAL ACTIVITY RECOGNITION WITH MULTIPLE CAMERAS

    公开(公告)号:US20220398396A1

    公开(公告)日:2022-12-15

    申请号:US17344730

    申请日:2021-06-10

    IPC分类号: G06K9/00 G06T7/73

    摘要: Implementations generally recognize clinical activity using multiple cameras. In some implementations, a method includes obtaining a plurality of videos of a plurality of objects in an environment. The method further includes determining one or more key points for each object of the plurality of objects. The method further includes recognizing activity information based on the one or more key points. The method further includes computing workflow information based on the activity information.

    SURGICAL SCENE ASSESSMENT BASED ON COMPUTER VISION

    公开(公告)号:US20210142487A1

    公开(公告)日:2021-05-13

    申请号:US16808265

    申请日:2020-03-03

    申请人: Sony Corporation

    IPC分类号: G06T7/20 G06T7/70 A61B34/20

    摘要: Implementations generally relate to surgical scene assessment based on computer vision. In some implementations, a method includes receiving a first image frame of a plurality of image frames associated with a surgical scene. The method further includes detecting one or more objects in the first image frame. The method further includes determining one or more positions corresponding to the one or more objects. The method further includes tracking each position of the one or more objects in other image frames of the plurality of image frames.

    POSE RECONSTRUCTION BY TRACKING FOR VIDEO ANALYSIS

    公开(公告)号:US20220398768A1

    公开(公告)日:2022-12-15

    申请号:US17344734

    申请日:2021-06-10

    IPC分类号: G06T7/73 G06T7/292 G06K9/00

    摘要: Implementations generally perform pose reconstruction by tracking for video analysis. In some implementations, a method includes obtaining a plurality of videos of at least one subject performing at least one action in an environment. The method further includes tracking the at least one subject across at least two cameras. The method further includes reconstructing a 3-dimensional (3D) model of the at least one subject based on the plurality of videos and the tracking of the at least one subject.