Behavior-based standard operating procedure detection

    公开(公告)号:US12073027B2

    公开(公告)日:2024-08-27

    申请号:US18084885

    申请日:2022-12-20

    摘要: Implementations are directed to receiving a first set of images included in a first video captured by a camera that monitors a human performing a task; processing the first set of images using a first machine learning (ML) model to determine whether the first set of images depicts a gesture that is included in a predefined set of gestures; in response to determining that the first set of images depicts a gesture included in a predefined set of gestures, processing a second set of images included in the first video using a second ML model to determine a first gesture type of the gesture; comparing the first gesture type with a first expected gesture type to determine whether performance of the task conforms to a standard operating procedure (SOP) for the task; and providing feedback representative of a comparison result in a user interface.

    PROVISIONING RESOURCE-EFFICIENT ARTIFICIAL INTELLIGENCE MODELS

    公开(公告)号:US20240273399A1

    公开(公告)日:2024-08-15

    申请号:US18168723

    申请日:2023-02-14

    发明人: Yao Yang David Nguyen

    IPC分类号: G06N20/00

    CPC分类号: G06N20/00

    摘要: Implementations for receiving first user input representative of a first accuracy-to-resource value for an AI model, determining a first training recipe for training of the AI model, the first training recipe including a first set of reduction strategies to be performed during training of the AI model, the first training recipe being determined through genetic search of an initial population to provide an updated population, the first set of reduction strategies being selected from the updated population, providing the first training recipe for training of the AI model to provide a first trained version of the AI model at least partially by executing one or more of pruning and quantization during training of the AI model, and outputting the first trained version of the AI model for inference.

    BEHAVIOR-BASED STANDARD OPERATING PROCEDURE DETECTION

    公开(公告)号:US20240201789A1

    公开(公告)日:2024-06-20

    申请号:US18084885

    申请日:2022-12-20

    摘要: Implementations are directed to receiving a first set of images included in a first video captured by a camera that monitors a human performing a task; processing the first set of images using a first machine learning (ML) model to determine whether the first set of images depicts a gesture that is included in a predefined set of gestures; in response to determining that the first set of images depicts a gesture included in a predefined set of gestures, processing a second set of images included in the first video using a second ML model to determine a first gesture type of the gesture; comparing the first gesture type with a first expected gesture type to determine whether performance of the task conforms to a standard operating procedure (SOP) for the task; and providing feedback representative of a comparison result in a user interface.

    HUMAN OBJECT INTERACTION DETECTION USING COMPOSITIONAL MODEL

    公开(公告)号:US20240233439A1

    公开(公告)日:2024-07-11

    申请号:US18152627

    申请日:2023-01-10

    摘要: Implementations include actions of receiving an image; extracting a visual HOI and a set of visual embeddings, the visual HOI indicating a subject and an object; obtaining, using a vector library, a set of semantic HOIs and sets of semantic embeddings based on the subject, the object and a set of verbs included in the vector library, each set of semantic embeddings corresponding to a semantic HOI; processing, by a compositional model, the set of visual embeddings to provide a set of transition visual embeddings; processing the sets of semantic embeddings to provide respective sets of transition semantic embeddings; determining a set of scores based on the set of transition visual embeddings and the sets of transition semantic embeddings, each score representing a degree of similarity between the visual HOI and a semantic HOI; and determining at least one predicted HOI represented within the image based on the scores.

    WEIGHTED FACTORIZATION FOR HUMAN-OBJECT-INTERACTION DETECTION

    公开(公告)号:US20240233440A1

    公开(公告)日:2024-07-11

    申请号:US18153166

    申请日:2023-01-11

    IPC分类号: G06V40/20 G06V10/40 G06V10/77

    CPC分类号: G06V40/20 G06V10/40 G06V10/77

    摘要: Implementations include actions of receiving an image, providing a set of features for the image, determining a set of HOIs including one or more HOIs that are potentially represented in the image, providing sets of feature scores by, for each HOI in the set of HOIs, determining, by a first ML model, a set of feature scores for respective features in the set of features, generating, by a second ML model, sets of weights based on the set of HOIs, providing a set of final scores by, for each HOI in the set of HOIs, determining a final score based on a respective set of weights and the set of feature scores, each final score corresponding to a respective HOI in the set of HOIs, and selecting an output HOI for the image from the set of HOIs based on the set of final scores.