-
公开(公告)号:US12073027B2
公开(公告)日:2024-08-27
申请号:US18084885
申请日:2022-12-20
发明人: Keyu Qi , Hailing Zhou , Nan Ke , David Nguyen , Binghao Tang
CPC分类号: G06F3/017 , G06V10/761 , G06V10/82 , G06V20/52
摘要: Implementations are directed to receiving a first set of images included in a first video captured by a camera that monitors a human performing a task; processing the first set of images using a first machine learning (ML) model to determine whether the first set of images depicts a gesture that is included in a predefined set of gestures; in response to determining that the first set of images depicts a gesture included in a predefined set of gestures, processing a second set of images included in the first video using a second ML model to determine a first gesture type of the gesture; comparing the first gesture type with a first expected gesture type to determine whether performance of the task conforms to a standard operating procedure (SOP) for the task; and providing feedback representative of a comparison result in a user interface.
-
公开(公告)号:US20240273399A1
公开(公告)日:2024-08-15
申请号:US18168723
申请日:2023-02-14
发明人: Yao Yang , David Nguyen
IPC分类号: G06N20/00
CPC分类号: G06N20/00
摘要: Implementations for receiving first user input representative of a first accuracy-to-resource value for an AI model, determining a first training recipe for training of the AI model, the first training recipe including a first set of reduction strategies to be performed during training of the AI model, the first training recipe being determined through genetic search of an initial population to provide an updated population, the first set of reduction strategies being selected from the updated population, providing the first training recipe for training of the AI model to provide a first trained version of the AI model at least partially by executing one or more of pruning and quantization during training of the AI model, and outputting the first trained version of the AI model for inference.
-
公开(公告)号:US20240201789A1
公开(公告)日:2024-06-20
申请号:US18084885
申请日:2022-12-20
发明人: Keyu Qi , Hailing Zhou , Nan Ke , David Nguyen , Binghao Tang
CPC分类号: G06F3/017 , G06V10/761 , G06V10/82 , G06V20/52
摘要: Implementations are directed to receiving a first set of images included in a first video captured by a camera that monitors a human performing a task; processing the first set of images using a first machine learning (ML) model to determine whether the first set of images depicts a gesture that is included in a predefined set of gestures; in response to determining that the first set of images depicts a gesture included in a predefined set of gestures, processing a second set of images included in the first video using a second ML model to determine a first gesture type of the gesture; comparing the first gesture type with a first expected gesture type to determine whether performance of the task conforms to a standard operating procedure (SOP) for the task; and providing feedback representative of a comparison result in a user interface.
-
公开(公告)号:US20240233439A1
公开(公告)日:2024-07-11
申请号:US18152627
申请日:2023-01-10
发明人: David Nguyen , Hailing Zhou , Nan Ke
IPC分类号: G06V40/20 , G06F40/30 , G06V10/74 , G06V10/774 , G06V20/70
CPC分类号: G06V40/20 , G06F40/30 , G06V10/761 , G06V10/774 , G06V20/70
摘要: Implementations include actions of receiving an image; extracting a visual HOI and a set of visual embeddings, the visual HOI indicating a subject and an object; obtaining, using a vector library, a set of semantic HOIs and sets of semantic embeddings based on the subject, the object and a set of verbs included in the vector library, each set of semantic embeddings corresponding to a semantic HOI; processing, by a compositional model, the set of visual embeddings to provide a set of transition visual embeddings; processing the sets of semantic embeddings to provide respective sets of transition semantic embeddings; determining a set of scores based on the set of transition visual embeddings and the sets of transition semantic embeddings, each score representing a degree of similarity between the visual HOI and a semantic HOI; and determining at least one predicted HOI represented within the image based on the scores.
-
公开(公告)号:US20240233440A1
公开(公告)日:2024-07-11
申请号:US18153166
申请日:2023-01-11
发明人: David Nguyen , Hailing Zhou , Nan Ke
摘要: Implementations include actions of receiving an image, providing a set of features for the image, determining a set of HOIs including one or more HOIs that are potentially represented in the image, providing sets of feature scores by, for each HOI in the set of HOIs, determining, by a first ML model, a set of feature scores for respective features in the set of features, generating, by a second ML model, sets of weights based on the set of HOIs, providing a set of final scores by, for each HOI in the set of HOIs, determining a final score based on a respective set of weights and the set of feature scores, each final score corresponding to a respective HOI in the set of HOIs, and selecting an output HOI for the image from the set of HOIs based on the set of final scores.
-
公开(公告)号:US20240273702A1
公开(公告)日:2024-08-15
申请号:US18168309
申请日:2023-02-13
发明人: Binghao Tang , Keyu Qi , David Nguyen
CPC分类号: G06T7/0008 , G06T5/50 , G06T5/77 , G06V10/25 , G06V10/761 , G06T2207/10024 , G06T2207/20084 , G06T2207/20224
摘要: Implementations include receiving an image of an object; obtaining a reconstructed image by processing the image through a ML model; obtaining a gradient difference image by comparing the image to the reconstructed image; generating an output image at least partially by suppressing non-significant regions representing non-significant anomalies from the gradient difference image using a non-significant suppression (NSS) map; determining whether an anomaly is depicted in the output image; and in response to determining that an anomaly is depicted in the output image, sending an alert indicating that the object is defective.
-
-
-
-
-