Patent search ap:"BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD."

341.

发明授权
Method of generating 3D video, method of training model, electronic device, and storage medium 有权

公开(公告)号：US12125131B2

公开(公告)日：2024-10-22

申请号：US18075346

申请日：2022-12-05

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zhe Peng , Yuqiang Liu , Fanyu Geng

IPC: G06T13/40 , G06T7/20 , G10L15/02 , G10L15/06 , G10L15/16 , G10L25/57

CPC classification number: G06T13/40 , G06T7/20 , G10L15/02 , G10L15/063 , G10L15/16 , G10L25/57 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201

Abstract: A method of generating a 3D video, a method of training a neural network model, an electronic device, and a storage medium, which relate to a field of image processing, and in particular to technical fields of computer vision, augmented/virtual reality and deep learning. The method includes: determining, based on an input speech feature, a principal component analysis (PCA) coefficient by using a first network, wherein the PCA coefficient is used to generate the 3D video; correcting the PCA coefficient by using a second network; generating a lip movement information based on the corrected PCA coefficient and a PCA parameter for a neural network model, wherein the neural network model includes the first network and the second network; and applying the lip movement information to a pre-constructed 3D basic avatar model to obtain a 3D video with a lip movement effect.

342.

发明授权
Dialogue state rewriting and reply generating method and system, electronic device and storage medium 有权

公开(公告)号：US12118319B2

公开(公告)日：2024-10-15

申请号：US17655772

申请日：2022-03-21

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Jun Xu , Zeming Liu , Zeyang Lei , Zhengyu Niu , Hua Wu , Haifeng Wang

IPC: G06F40/35 , G06F40/56 , G06N5/02

CPC classification number: G06F40/35 , G06F40/56 , G06N5/02

Abstract: The present disclosure provides a dialog method and system, an electronic device and a storage medium, and relates to the field of artificial intelligence (AI) technologies such as deep learning and natural language processing. A specific implementation scheme involves: rewriting a corresponding dialog state based on received dialog information of a user; determining to-be-used dialog action information based on the dialog information of the user and the dialog state; and generating a reply statement based on the dialog information of the user and the dialog action information. According to the present disclosure, the to-be-used dialog action information can be determined based on the dialog information of the user and the dialog state; and then the reply statement is generated based on the dialog action information, thereby providing an efficient dialog scheme.

343.

发明公开
TRAINING SAMPLE ACQUIRING METHOD AND APPARATUS AS WELL AS LARGE MODEL OPTIMIZATION TRAINING METHOD AND APPARATUS 审中-公开

公开(公告)号：US20240338564A1

公开(公告)日：2024-10-10

申请号：US18744501

申请日：2024-06-14

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zhifan FENG , Hua WU , Qiaoqiao SHE , Tian WU

IPC: G06N3/08

CPC classification number: G06N3/08

Abstract: A large model optimization training method in the artificial intelligence fields, such as large models, deep learning, natural language processing, may include: taking, as candidate queries, queries collected from a predetermined data source and capable of serving as input to a large model in response to determining that an optimization triggering condition is met; screening out target queries from the candidate queries, the target queries being queries which cannot be correctly processed by the large model; and constructing respectively corresponding training samples according to the target queries, the training samples being used for carrying out optimization training on the large model.

344.

发明授权
Method and device for processing voice interaction, electronic device and storage medium 有权

公开(公告)号：US12112746B2

公开(公告)日：2024-10-08

申请号：US17476333

申请日：2021-09-15

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Jinfeng Bai , Zhijian Wang , Cong Gao

IPC: G10L15/22

CPC classification number: G10L15/22 , G10L2015/223

Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.

345.

发明授权
Method and apparatus for training speech recognition model, electronic device and storage medium 有权

公开(公告)号：US12100388B2

公开(公告)日：2024-09-24

申请号：US17747732

申请日：2022-05-18

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Qingen Zhao

IPC: G10L15/06 , G10L15/02 , G10L15/16 , G10L15/22

CPC classification number: G10L15/063 , G10L15/02 , G10L15/16 , G10L15/22

Abstract: A method and apparatus for training a speech recognition model, an electronic device and a storage medium are provided. An implementation of the method may include: determining a plurality of feature vectors based on audio feature data corresponding to a first target frame in a sample speech, wherein the sample speech comprises a conversation among a plurality of objects and the sample speech has a corresponding sample text; generating a predicted text element corresponding to the first target frame based on an adjacent text element preceding to a text element corresponding to the first target frame in the sample text, wherein the text element and the adjacent text element are targeting at a target object in the plurality of objects; obtaining a first target text element based on the predicted text element and a first feature vector in the plurality of feature vectors; and adjusting the speech recognition model based on the first target text element and the sample text, to obtain a trained speech recognition model.

346.

发明授权
Method for generating dialogue, electronic device, and storage medium 有权

公开(公告)号：US12086555B2

公开(公告)日：2024-09-10

申请号：US17643053

申请日：2021-12-07

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Jianglu Hu , Hehan Li , Huifeng Sun , Shuqi Sun , Yue Chang , Tingting Li , Hua Wu , Haifeng Wang

IPC: G06F40/35 , G06F16/332

CPC classification number: G06F40/35 , G06F16/3329

Abstract: The disclosure provides a method for generating a dialogue. The method includes: obtaining an input sentence; determining a type of a task-based response sentence that is to be generated, by updating a current dialogue state based on the input sentence; generating the task-based response sentence by inputting the input sentence into a task-based dialogue response generator; and determining the task-based response sentence as a target response sentence in response to the type of the task-based response sentence being a designated type.

347.

发明公开
METHOD OF TRAINING TEXT RECOGNITION MODEL, AND METHOD OF RECOGNIZING TEXT 审中-公开

公开(公告)号：US20240281609A1

公开(公告)日：2024-08-22

申请号：US18041207

申请日：2022-05-16

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Pengyuan LV , Jingquan LI , Chengquan ZHANG , Kun YAO , Jingtuo LIU , Junyu HAN

IPC: G06F40/30 , G06V30/12

CPC classification number: G06F40/30 , G06V30/12

Abstract: The present application provides a method of training a text recognition model. The method includes: inputting a first sample image into the visual feature extraction sub-model to obtain a first visual feature and a first predicted text, the first sample image contains a text and a tag indicating a first actual text; obtaining, by using the semantic feature extraction sub-model, a first semantic feature based on the first predicted text; obtaining, by using the sequence sub-model, a second predicted text based on the first visual feature and the first semantic feature; and training the text recognition model based on the first predicted text, the second predicted text and the first actual text. The present disclosure further provides a method of recognizing a text, an electronic device, and a storage medium.

348.

发明授权
Group service implementation method and device, equipment and storage medium 有权

公开(公告)号：US12069172B2

公开(公告)日：2024-08-20

申请号：US17452496

申请日：2021-10-27

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Bo Jing , Peiqian Zhang , Hongyan Wang

IPC: H04L9/36 , H04L9/08 , H04L9/32 , H04L9/38

CPC classification number: H04L9/088 , H04L9/3247

Abstract: Provided are a group service implementation method and device, an equipment and a storage medium. The specific solution is described below. A service transaction request is acquired. In response to the service transaction request including to-be-authenticated data and a threshold signature, a signature group corresponding to the threshold signature is determined. Group information of the signature group is acquired by querying a blockchain, where the signature group includes at least two members, the at least two members of the signature group are used for authenticating the to-be-authenticated data by adopting secure multi-party computation and generating the threshold signature for the to-be-authenticated data by adopting a signature private key, and the group information includes at least a verification public key of the threshold signature. The threshold signature is verified by adopting the verification public key in the group information.

349.

发明授权
Site recommendation method, electronic device, and storage medium 有权

公开(公告)号：US12067067B2

公开(公告)日：2024-08-20

申请号：US17850930

申请日：2022-06-27

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： JuanJuan Shui , Yingjie Niu , Qiushen Qu , Weipeng Niu , Jiankang Xin

IPC: G06F16/9537 , G06F16/9535 , G06F16/9538

CPC classification number: G06F16/9537 , G06F16/9535 , G06F16/9538

Abstract: A site recommendation method, an electronic device, and a readable storage medium are provided, which relate to the field of automatic driving. The method includes: determining, in response to a query request of a user terminal for a target position, a target site recommended to a target user within a specified range of the target position, wherein the target site includes a site that the target user is interested in under a specified travel condition; and sending the target site to the user terminal.

350.

发明公开
CONTENT INITIALIZATION METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20240275848A1

公开(公告)日：2024-08-15

申请号：US18020618

申请日：2022-08-01

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Guoxia WANG , Long LI , Zhihua WU

IPC: H04L67/1097 , G06F7/58

CPC classification number: H04L67/1097 , G06F7/582 , G06F7/588

Abstract: The present disclosure provides a content initialization method and apparatus, an electronic device and a storage medium, which relates to a field of computer technology, in particular to fields of deep learning and distributed computing. The content initialization method is applied to any one of a plurality of devices included in a distributed system. A specific implementation scheme of the content initialization method is: determining, according to a size information of a resource space for the distributed system and an identification information of the any one of the plurality of devices, a space information of a first sub-space for the any one of the plurality of devices in the resource space, wherein the space information includes a position information of the first sub-space for the resource space; and determining an initialization content for the first sub-space according to a random seed and the position information.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification