-
公开(公告)号:US20250157071A1
公开(公告)日:2025-05-15
申请号:US19024815
申请日:2025-01-16
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zhihao LI , Jianzhuang LIU , Songcen XU
Abstract: This disclosure provides data processing methods and devices relating to artificial intelligence. In an implementation, a method includes: processing a target image by using a first pose recognition model to obtain first pose information of a target object in the target image, processing the target image by using a second pose recognition model to obtain second pose information of the target object in the target image, and constructing a loss based on the first pose information, the second pose information, the two-dimensional projection information, and a corresponding annotation.
-
公开(公告)号:US20230368505A1
公开(公告)日:2023-11-16
申请号:US18361011
申请日:2023-07-28
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Weimian LI , Kaiqiang ZHU , Fei HUANG , Songcen XU
IPC: G06V10/774 , G06T7/12 , G06T3/00 , G06V10/40 , G06V10/776 , G06V10/74 , G06V10/26 , G06V10/762 , G06F40/289
CPC classification number: G06V10/774 , G06T7/12 , G06T3/0006 , G06V10/40 , G06V10/776 , G06V10/761 , G06V10/26 , G06V10/762 , G06F40/289 , G06T2207/20081
Abstract: This application discloses a model training method, and relates to the artificial intelligence field. The method includes: obtaining a plurality of training samples, where each training sample includes an image and a text, and the text describes a target object in the image; and inputting the plurality of training samples into a target model, so that the target model performs the following procedure until a preset stop condition is met: extracting an image feature of a first image and a text feature of a first text; obtaining a first loss value based on a difference between a first vector and a second vector, where a dimension of the first vector is the same as a dimension of the second vector; and updating the target model based on the first loss value.
-
3.
公开(公告)号:US20220198836A1
公开(公告)日:2022-06-23
申请号:US17689630
申请日:2022-03-08
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Xiaofei WU , Fei HUANG , Songcen XU , Youliang YAN
Abstract: A gesture recognition method, an electronic device, a computer-readable storage medium, and a chip, are provided, and relate to the field of artificial intelligence. The gesture recognition method includes: obtaining an image stream, and determining, based on a plurality of consecutive frames of hand images in the image stream, whether a user makes a preparatory action; when the user makes the preparatory action, continuing to obtain an image stream, and determining a gesture action of the user based on a plurality of consecutive frames of hand images in the continuously obtained image stream; and next, further responding to the gesture action to implement gesture interaction with the user. In this application, the preparatory action is determined before gesture recognition is performed, so that erroneous recognition occurring in a gesture recognition process can be reduced, thereby improving a gesture recognition effect.
-
公开(公告)号:US20210350168A1
公开(公告)日:2021-11-11
申请号:US17383181
申请日:2021-07-22
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhi TIAN , Tong HE , Chunhua SHEN , Youliang YAN , Songcen XU , Yiren ZHOU , Xiaofei WU , Jianzhuang LIU
Abstract: This application discloses an image segmentation method in the field of artificial intelligence. The method includes: obtaining an input image and a processing requirement; performing multi-layer feature extraction on the input image to obtain a plurality of feature maps; downsampling the plurality of feature maps to obtain a plurality of feature maps with a reference resolution, where the reference resolution is less than a resolution of the input image; fusing the plurality of feature maps with the reference resolution to obtain at least one feature map group; upsampling the feature map group by using a transformation matrix W, to obtain a target feature map group; and performing target processing on the target feature map group based on the processing requirement to obtain a target image.
-
公开(公告)号:US20200167554A1
公开(公告)日:2020-05-28
申请号:US16776282
申请日:2020-01-29
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Liang WANG , Songcen XU , Chuanjian LIU , Jun HE
Abstract: This application provides a gesture recognition method, and relates to the field of man-machine interaction technologies. The method includes: extracting M images from a first video segment in a video stream; performing gesture recognition on the M images by using a deep learning algorithm, to obtain a gesture recognition result corresponding to the first video segment; and performing result combination on gesture recognition results of N consecutive video segments including the first video segment, to obtain a combined gesture recognition result. In the foregoing recognition process, a gesture in the video stream does not need to be segmented or tracked, but phase actions are recognized by using a deep learning algorithm with a relatively fast calculation speed, and then the phase actions are combined, so as to improve a gesture recognition speed, and reduce a gesture recognition delay.
-
公开(公告)号:US20240292073A1
公开(公告)日:2024-08-29
申请号:US18656705
申请日:2024-05-07
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Md Ibrahim KHALIL , Peng DAI , Hanwen LIANG , Lizhe CHEN , Varshanth Ravindra RAO , Juwei LU , Songcen XU
IPC: H04N21/8549 , G06F3/0484 , G06V10/70 , G06V20/40
CPC classification number: H04N21/8549 , G06V10/70 , G06V20/46 , G06V20/49 , G06F3/0484
Abstract: Methods and devices for generating a customized video segment from a video are disclosed. The video is partitioned into video segments. For each respective video segment, a respective set of scores is computed, where each score represents a respective content feature in the respective video segment. A respective weighted aggregate score is computed for each respective video segment by applying, to each respective set of scores, a common set of weight values. A selected video segment is outputted as the customized video segment, where the selected video segment is selected from one or more high-ranked video segments having high-ranked weighted aggregate scores.
-
公开(公告)号:US20240126808A1
公开(公告)日:2024-04-18
申请号:US18548039
申请日:2021-12-20
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Lei HAO , Yu WANG , Min WANG , Songcen XU , Weicai ZHONG , Zhenhua ZHAO
IPC: G06F16/538 , G06F16/583
CPC classification number: G06F16/538 , G06F16/5854
Abstract: In one example method, a first image including M objects is obtained. For N objects in the M objects, when N is greater than or equal to 2, arrangement orders of the N objects is determined, where an arrangement order of any one of the N objects is determined based on at least one of a scene intent weight, a confidence score, or an object relationship score. The scene intent weight is used to indicate a probability that the any object is searched in a scene corresponding to the first image, the confidence score is a similarity between the any object and an image in an image library, and the object relationship score is used to indicate importance of the any object in the first image. Search results of some or all of the N objects are fed back according to the arrangement orders of the N objects.
-
公开(公告)号:US20230350499A1
公开(公告)日:2023-11-02
申请号:US18345794
申请日:2023-06-30
Applicant: Huawei Technologies Co., Ltd.
Inventor: Juwei LU , Sayem Mohammad SIAM , Deepak SRIDHAR , Sidharth SINGLA , Yannick VERDIE , Xiaofei WU , Srikanth MURALIDHARAN , Roy YANG , Peng DAI , Songcen XU
CPC classification number: G06F3/017 , G06V40/28 , G06V40/10 , G06V20/46 , G06T7/70 , G06T2207/10016
Abstract: Methods and devices for machine vision-based selection of content are described. One or more hands are detected in a current frame of video data. A respective fingertip location is determined for each of up to two of the detected hands. A content selection gesture is determined corresponding to the up to two detected hands. Selected content is extracted, as indicated by the content selection gesture and based on the up to two fingertip locations. The device may be a smartphone, a tablet, a laptop, a smart light device, a reader device, etc.
-
公开(公告)号:US20230082789A1
公开(公告)日:2023-03-16
申请号:US17950246
申请日:2022-09-22
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Juwei LU , Sayem Mohammad SIAM , Wei ZHOU , Peng DAI , Xiaofei WU , Songcen XU
Abstract: Methods and systems for gesture-based control of a device are described. An input frame is processed to determine a location of a distinguishing anatomical feature in the input frame. A virtual gesture-space is defined based on the location of the distinguishing anatomical feature, the virtual gesture-space being a defined space for detecting a gesture input. The input frame is processed in only the virtual gesture-space, to detect and track a hand. Using information generated from detecting and tracking the at least one hand, a gesture class is determined for the at least one hand. The device may be a smart television, a smart phone, a tablet, etc.
-
公开(公告)号:US20230020965A1
公开(公告)日:2023-01-19
申请号:US17951271
申请日:2022-09-23
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Jun YUE , Li QIAN , Songcen XU , Bin SHAO
IPC: G06V10/774 , G06V10/74 , G06V10/80 , G06V10/26 , G06V10/46 , G06V10/764
Abstract: This application provides a method and apparatus for updating an object recognition model in the field of artificial intelligence. In the technical solution provided in this application, a target image and first voice information of a user are obtained. The first voice information indicates a first category of a target object in the target image. A feature library of a first object recognition model is updated based on the target image and the first voice information. The updated first object recognition model includes a feature of the target object and a first label indicating the first category, and the feature of the target object corresponds to the first label. A recognition rate of an object recognition model can be improved more easily according to the technical solution provided in this application.
-
-
-
-
-
-
-
-
-