DATA PROCESSING METHOD AND APPARATUS

    公开(公告)号:US20250157071A1

    公开(公告)日:2025-05-15

    申请号:US19024815

    申请日:2025-01-16

    Abstract: This disclosure provides data processing methods and devices relating to artificial intelligence. In an implementation, a method includes: processing a target image by using a first pose recognition model to obtain first pose information of a target object in the target image, processing the target image by using a second pose recognition model to obtain second pose information of the target object in the target image, and constructing a loss based on the first pose information, the second pose information, the two-dimensional projection information, and a corresponding annotation.

    GESTURE RECOGNITION METHOD, ELECTRONIC DEVICE, COMPUTER-READABLE STORAGE MEDIUM, AND CHIP

    公开(公告)号:US20220198836A1

    公开(公告)日:2022-06-23

    申请号:US17689630

    申请日:2022-03-08

    Abstract: A gesture recognition method, an electronic device, a computer-readable storage medium, and a chip, are provided, and relate to the field of artificial intelligence. The gesture recognition method includes: obtaining an image stream, and determining, based on a plurality of consecutive frames of hand images in the image stream, whether a user makes a preparatory action; when the user makes the preparatory action, continuing to obtain an image stream, and determining a gesture action of the user based on a plurality of consecutive frames of hand images in the continuously obtained image stream; and next, further responding to the gesture action to implement gesture interaction with the user. In this application, the preparatory action is determined before gesture recognition is performed, so that erroneous recognition occurring in a gesture recognition process can be reduced, thereby improving a gesture recognition effect.

    IMAGE SEGMENTATION METHOD AND IMAGE PROCESSING APPARATUS

    公开(公告)号:US20210350168A1

    公开(公告)日:2021-11-11

    申请号:US17383181

    申请日:2021-07-22

    Abstract: This application discloses an image segmentation method in the field of artificial intelligence. The method includes: obtaining an input image and a processing requirement; performing multi-layer feature extraction on the input image to obtain a plurality of feature maps; downsampling the plurality of feature maps to obtain a plurality of feature maps with a reference resolution, where the reference resolution is less than a resolution of the input image; fusing the plurality of feature maps with the reference resolution to obtain at least one feature map group; upsampling the feature map group by using a transformation matrix W, to obtain a target feature map group; and performing target processing on the target feature map group based on the processing requirement to obtain a target image.

    Gesture Recognition Method, Apparatus, And Device

    公开(公告)号:US20200167554A1

    公开(公告)日:2020-05-28

    申请号:US16776282

    申请日:2020-01-29

    Abstract: This application provides a gesture recognition method, and relates to the field of man-machine interaction technologies. The method includes: extracting M images from a first video segment in a video stream; performing gesture recognition on the M images by using a deep learning algorithm, to obtain a gesture recognition result corresponding to the first video segment; and performing result combination on gesture recognition results of N consecutive video segments including the first video segment, to obtain a combined gesture recognition result. In the foregoing recognition process, a gesture in the video stream does not need to be segmented or tracked, but phase actions are recognized by using a deep learning algorithm with a relatively fast calculation speed, and then the phase actions are combined, so as to improve a gesture recognition speed, and reduce a gesture recognition delay.

    SEARCH RESULT FEEDBACK METHOD AND APPARATUS, AND STORAGE MEDIUM

    公开(公告)号:US20240126808A1

    公开(公告)日:2024-04-18

    申请号:US18548039

    申请日:2021-12-20

    CPC classification number: G06F16/538 G06F16/5854

    Abstract: In one example method, a first image including M objects is obtained. For N objects in the M objects, when N is greater than or equal to 2, arrangement orders of the N objects is determined, where an arrangement order of any one of the N objects is determined based on at least one of a scene intent weight, a confidence score, or an object relationship score. The scene intent weight is used to indicate a probability that the any object is searched in a scene corresponding to the first image, the confidence score is a similarity between the any object and an image in an image library, and the object relationship score is used to indicate importance of the any object in the first image. Search results of some or all of the N objects are fed back according to the arrangement orders of the N objects.

    METHODS AND SYSTEMS FOR HAND GESTURE-BASED CONTROL OF A DEVICE

    公开(公告)号:US20230082789A1

    公开(公告)日:2023-03-16

    申请号:US17950246

    申请日:2022-09-22

    Abstract: Methods and systems for gesture-based control of a device are described. An input frame is processed to determine a location of a distinguishing anatomical feature in the input frame. A virtual gesture-space is defined based on the location of the distinguishing anatomical feature, the virtual gesture-space being a defined space for detecting a gesture input. The input frame is processed in only the virtual gesture-space, to detect and track a hand. Using information generated from detecting and tracking the at least one hand, a gesture class is determined for the at least one hand. The device may be a smart television, a smart phone, a tablet, etc.

    METHOD AND APPARATUS FOR UPDATING OBJECT RECOGNITION MODEL

    公开(公告)号:US20230020965A1

    公开(公告)日:2023-01-19

    申请号:US17951271

    申请日:2022-09-23

    Abstract: This application provides a method and apparatus for updating an object recognition model in the field of artificial intelligence. In the technical solution provided in this application, a target image and first voice information of a user are obtained. The first voice information indicates a first category of a target object in the target image. A feature library of a first object recognition model is updated based on the target image and the first voice information. The updated first object recognition model includes a feature of the target object and a first label indicating the first category, and the feature of the target object corresponds to the first label. A recognition rate of an object recognition model can be improved more easily according to the technical solution provided in this application.

Patent Agency Ranking