-
341.
公开(公告)号:US12125131B2
公开(公告)日:2024-10-22
申请号:US18075346
申请日:2022-12-05
Inventor: Zhe Peng , Yuqiang Liu , Fanyu Geng
CPC classification number: G06T13/40 , G06T7/20 , G10L15/02 , G10L15/063 , G10L15/16 , G10L25/57 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201
Abstract: A method of generating a 3D video, a method of training a neural network model, an electronic device, and a storage medium, which relate to a field of image processing, and in particular to technical fields of computer vision, augmented/virtual reality and deep learning. The method includes: determining, based on an input speech feature, a principal component analysis (PCA) coefficient by using a first network, wherein the PCA coefficient is used to generate the 3D video; correcting the PCA coefficient by using a second network; generating a lip movement information based on the corrected PCA coefficient and a PCA parameter for a neural network model, wherein the neural network model includes the first network and the second network; and applying the lip movement information to a pre-constructed 3D basic avatar model to obtain a 3D video with a lip movement effect.
-
342.
公开(公告)号:US12118319B2
公开(公告)日:2024-10-15
申请号:US17655772
申请日:2022-03-21
Inventor: Jun Xu , Zeming Liu , Zeyang Lei , Zhengyu Niu , Hua Wu , Haifeng Wang
Abstract: The present disclosure provides a dialog method and system, an electronic device and a storage medium, and relates to the field of artificial intelligence (AI) technologies such as deep learning and natural language processing. A specific implementation scheme involves: rewriting a corresponding dialog state based on received dialog information of a user; determining to-be-used dialog action information based on the dialog information of the user and the dialog state; and generating a reply statement based on the dialog information of the user and the dialog action information. According to the present disclosure, the to-be-used dialog action information can be determined based on the dialog information of the user and the dialog state; and then the reply statement is generated based on the dialog action information, thereby providing an efficient dialog scheme.
-
343.
公开(公告)号:US20240338564A1
公开(公告)日:2024-10-10
申请号:US18744501
申请日:2024-06-14
Inventor: Zhifan FENG , Hua WU , Qiaoqiao SHE , Tian WU
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: A large model optimization training method in the artificial intelligence fields, such as large models, deep learning, natural language processing, may include: taking, as candidate queries, queries collected from a predetermined data source and capable of serving as input to a large model in response to determining that an optimization triggering condition is met; screening out target queries from the candidate queries, the target queries being queries which cannot be correctly processed by the large model; and constructing respectively corresponding training samples according to the target queries, the training samples being used for carrying out optimization training on the large model.
-
344.
公开(公告)号:US12112746B2
公开(公告)日:2024-10-08
申请号:US17476333
申请日:2021-09-15
Inventor: Jinfeng Bai , Zhijian Wang , Cong Gao
IPC: G10L15/22
CPC classification number: G10L15/22 , G10L2015/223
Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.
-
345.
公开(公告)号:US12100388B2
公开(公告)日:2024-09-24
申请号:US17747732
申请日:2022-05-18
Inventor: Qingen Zhao
CPC classification number: G10L15/063 , G10L15/02 , G10L15/16 , G10L15/22
Abstract: A method and apparatus for training a speech recognition model, an electronic device and a storage medium are provided. An implementation of the method may include: determining a plurality of feature vectors based on audio feature data corresponding to a first target frame in a sample speech, wherein the sample speech comprises a conversation among a plurality of objects and the sample speech has a corresponding sample text; generating a predicted text element corresponding to the first target frame based on an adjacent text element preceding to a text element corresponding to the first target frame in the sample text, wherein the text element and the adjacent text element are targeting at a target object in the plurality of objects; obtaining a first target text element based on the predicted text element and a first feature vector in the plurality of feature vectors; and adjusting the speech recognition model based on the first target text element and the sample text, to obtain a trained speech recognition model.
-
公开(公告)号:US12086555B2
公开(公告)日:2024-09-10
申请号:US17643053
申请日:2021-12-07
Inventor: Jianglu Hu , Hehan Li , Huifeng Sun , Shuqi Sun , Yue Chang , Tingting Li , Hua Wu , Haifeng Wang
IPC: G06F40/35 , G06F16/332
CPC classification number: G06F40/35 , G06F16/3329
Abstract: The disclosure provides a method for generating a dialogue. The method includes: obtaining an input sentence; determining a type of a task-based response sentence that is to be generated, by updating a current dialogue state based on the input sentence; generating the task-based response sentence by inputting the input sentence into a task-based dialogue response generator; and determining the task-based response sentence as a target response sentence in response to the type of the task-based response sentence being a designated type.
-
公开(公告)号:US20240281609A1
公开(公告)日:2024-08-22
申请号:US18041207
申请日:2022-05-16
Inventor: Pengyuan LV , Jingquan LI , Chengquan ZHANG , Kun YAO , Jingtuo LIU , Junyu HAN
Abstract: The present application provides a method of training a text recognition model. The method includes: inputting a first sample image into the visual feature extraction sub-model to obtain a first visual feature and a first predicted text, the first sample image contains a text and a tag indicating a first actual text; obtaining, by using the semantic feature extraction sub-model, a first semantic feature based on the first predicted text; obtaining, by using the sequence sub-model, a second predicted text based on the first visual feature and the first semantic feature; and training the text recognition model based on the first predicted text, the second predicted text and the first actual text. The present disclosure further provides a method of recognizing a text, an electronic device, and a storage medium.
-
公开(公告)号:US12069172B2
公开(公告)日:2024-08-20
申请号:US17452496
申请日:2021-10-27
Inventor: Bo Jing , Peiqian Zhang , Hongyan Wang
CPC classification number: H04L9/088 , H04L9/3247
Abstract: Provided are a group service implementation method and device, an equipment and a storage medium. The specific solution is described below. A service transaction request is acquired. In response to the service transaction request including to-be-authenticated data and a threshold signature, a signature group corresponding to the threshold signature is determined. Group information of the signature group is acquired by querying a blockchain, where the signature group includes at least two members, the at least two members of the signature group are used for authenticating the to-be-authenticated data by adopting secure multi-party computation and generating the threshold signature for the to-be-authenticated data by adopting a signature private key, and the group information includes at least a verification public key of the threshold signature. The threshold signature is verified by adopting the verification public key in the group information.
-
公开(公告)号:US12067067B2
公开(公告)日:2024-08-20
申请号:US17850930
申请日:2022-06-27
Inventor: JuanJuan Shui , Yingjie Niu , Qiushen Qu , Weipeng Niu , Jiankang Xin
IPC: G06F16/9537 , G06F16/9535 , G06F16/9538
CPC classification number: G06F16/9537 , G06F16/9535 , G06F16/9538
Abstract: A site recommendation method, an electronic device, and a readable storage medium are provided, which relate to the field of automatic driving. The method includes: determining, in response to a query request of a user terminal for a target position, a target site recommended to a target user within a specified range of the target position, wherein the target site includes a site that the target user is interested in under a specified travel condition; and sending the target site to the user terminal.
-
公开(公告)号:US20240275848A1
公开(公告)日:2024-08-15
申请号:US18020618
申请日:2022-08-01
Inventor: Guoxia WANG , Long LI , Zhihua WU
IPC: H04L67/1097 , G06F7/58
CPC classification number: H04L67/1097 , G06F7/582 , G06F7/588
Abstract: The present disclosure provides a content initialization method and apparatus, an electronic device and a storage medium, which relates to a field of computer technology, in particular to fields of deep learning and distributed computing. The content initialization method is applied to any one of a plurality of devices included in a distributed system. A specific implementation scheme of the content initialization method is: determining, according to a size information of a resource space for the distributed system and an identification information of the any one of the plurality of devices, a space information of a first sub-space for the any one of the plurality of devices in the resource space, wherein the space information includes a position information of the first sub-space for the resource space; and determining an initialization content for the first sub-space according to a random seed and the position information.
-
-
-
-
-
-
-
-
-