-
公开(公告)号:US12014275B2
公开(公告)日:2024-06-18
申请号:US17081758
申请日:2020-10-27
发明人: Xuebo Liu
摘要: A method for text recognition, an electronic device and a storage medium are provided. The method includes: performing feature extraction processing on an image to be detected to obtain a plurality of semantic vectors, each of the plurality of semantic vectors corresponds to one of a plurality of characters of a text sequence in the image to be detected; and sequentially performing recognition processing on the plurality of semantic vectors through a convolutional neutral network to obtain a recognition result of the text sequence.
-
公开(公告)号:US11769499B2
公开(公告)日:2023-09-26
申请号:US17204568
申请日:2021-03-17
发明人: Zilong Zhang , Qing Luan , Lin Sun
摘要: Methods, electronic devices, and storage media for driving an interaction object are provided. The methods include: obtaining an audio signal at a periphery of a display device; obtaining, based on the audio signal, first driving data for driving the interaction object to respond; monitoring, in response to outputting the first driving data, the audio signal for detecting a sound of a target object; and driving, based on a presence state of the sound of the target object in the audio signal, the interaction object to respond.
-
公开(公告)号:US11741629B2
公开(公告)日:2023-08-29
申请号:US17102305
申请日:2020-11-23
发明人: Fubao Xie , Wentao Liu , Chen Qian
IPC分类号: G06T7/73 , G06T11/60 , G06T7/246 , G06T13/40 , G06T7/00 , G06V40/10 , G06V40/16 , G06V10/75 , G06V10/24
CPC分类号: G06T7/73 , G06T7/0014 , G06T7/246 , G06T7/74 , G06T11/60 , G06T13/40 , G06V10/24 , G06V10/754 , G06V40/10 , G06V40/176 , G06T2200/04 , G06T2207/30196 , G06T2207/30201
摘要: Embodiments of the present disclosure disclose image processing methods and apparatuses, image devices, and storage media. The image processing method includes: obtaining an image; obtaining the feature of the limb of the body based on the image, where the limb includes the upper limb and/or the lower limb; determining first-type movement information of the limb based on the feature; and controlling the movement of the limb of a controlled model according to the first-type movement information.
-
公开(公告)号:US11521095B2
公开(公告)日:2022-12-06
申请号:US16052500
申请日:2018-08-01
发明人: Xiaogang Wang , Lijun Wang , Wanli Ouyang , Huchuan Lu
IPC分类号: G06N3/08 , G06K9/62 , G06K9/46 , G06T7/11 , G06T7/73 , G06T7/20 , G06N5/04 , G06N3/04 , G06V10/44 , G06V20/10
摘要: Disclosed are methods, apparatuses and systems for CNN network adaption and object online tracking. The CNN network adaption method comprises: transforming a first feature map into a plurality of sub-feature maps, wherein the first feature map is generated by the pre-trained CNN according to a frame of the target video; convolving each of the sub-feature maps with one of a plurality of adaptive convolution kernels, respectively, to output a plurality of second feature maps with improved adaptability; training, frame by frame, the adaptive convolution kernels.
-
5.
公开(公告)号:US11481975B2
公开(公告)日:2022-10-25
申请号:US17073778
申请日:2020-10-19
发明人: Yuanzhen Hao , Mingyang Huang , Jianping Shi
摘要: An image processing method and apparatus, and a computer-readable storage medium are provided. The method includes: determining a first region matching a target object in a first image; determining a deformation parameter based on a preset deformation effect, the deformation parameter being used for determining a position deviation, generated based on the preset deformation effect, of each pixel point of the target object; and performing deformation processing on the target object in the first image based on the deformation parameter to obtain a second image.
-
公开(公告)号:US20220327385A1
公开(公告)日:2022-10-13
申请号:US17853816
申请日:2022-06-29
发明人: Xingang PAN , Xiaohang ZHAN , Bo DAI , Dahua LIN , Ping LUO
摘要: The present disclosure relates to a network training method, an electronic device and a storage medium. The network training method includes the following steps. At least one implicit vector may be input into at least one pre-trained generative network to obtain a first generated image; the generative network may be obtained with a discriminative network through adversarial trainings with a plurality of natural images. A degradation process may be performed on the first generated image to obtain a first degraded image of the first generated image. The implicit vector and the generative network may be trained according to the first degraded image and a second degraded image of at least one target image; the trained generative network and the trained implicit vector may be used to generate at least one reconstructed image of the target image.
-
公开(公告)号:US11455788B2
公开(公告)日:2022-09-27
申请号:US16828226
申请日:2020-03-24
发明人: Xihui Liu , Jing Shao , Zihao Wang , Hongsheng Li , Xiaogang Wang
IPC分类号: G06V10/00 , G06V10/44 , G06F16/583 , G06N3/08 , G06V30/148
摘要: A method and apparatus for positioning a description statement in an image includes: analyzing a to-be-analyzed description statement and a to-be-analyzed image to obtain a plurality of statement attention weights of the to-be-analyzed description statement and a plurality of image attention weights of the to-be-analyzed image; obtaining a plurality of first matching scores based on the plurality of statement attention weights and a subject feature, a location feature and a relationship feature of the to-be-analyzed image; obtaining a second matching score between the to-be-analyzed description statement and the to-be-analyzed image based on the plurality of first matching scores and the plurality of image attention weights; and determining a positioning result of the to-be-analyzed description statement in the to-be-analyzed image based on the second matching score.
-
公开(公告)号:US11429809B2
公开(公告)日:2022-08-30
申请号:US17077251
申请日:2020-10-22
发明人: Yixiao Ge , Dapeng Chen , Hongsheng Li
摘要: The present disclosure discloses an image processing method and related device thereof. The method includes: acquiring an image to be processed; and performing a feature extraction process on the image to be processed using a target neural network so as to obtain target feature data of the image to be processed, wherein parameters of the target neural network are time average values of parameters of a first neural network which is obtained from training under supervision by a training image set and an average network, and parameters of the average network are time average values of parameters of a second neural network which is obtained from training under supervision by the training image set and the target neural network. A corresponding device is also disclosed. Feature data of image to be processed are obtained via the feature extraction process performed on the image to be processed.
-
公开(公告)号:US20220270313A1
公开(公告)日:2022-08-25
申请号:US17678753
申请日:2022-02-23
发明人: Liu SU
摘要: The present disclosure relates to an image processing method and apparatus, an electronic device and a storage medium. The method includes: in response to a makeup operation on a facial image to be processed, generating, based on a selected first target material, a second target material matching a target part in the facial image to be processed; determining, based on the second target material, an image area in the facial image to be processed where the target part is located; and performing, based on a color of the second target material, a color fusion treatment on the image area where the target part is located, to obtain a fused facial image.
-
公开(公告)号:US11423666B2
公开(公告)日:2022-08-23
申请号:US16950606
申请日:2020-11-17
发明人: Bo Li , Wei Wu , Fangyi Zhang
摘要: A method of detecting target object includes: extracting, through a neural network, a feature of a reference frame and a feature of a frame under detection; inputting each of at least two feature groups from at least two network layers of the neural network into a detector so as to obtain a corresponding detection result group output from the detector; wherein each feature group includes features of the reference frame and of the frame under detection, each detection result group includes a classification result and a regression result with respect to each of a plurality of candidate boxes for a feature group; and acquiring a bounding box for the target object in the frame under detection according to the at least two detection result groups.
-
-
-
-
-
-
-
-
-