Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Haifeng WANG"

11.

发明申请
METHOD AND APPARATUS OF CONVERTING SCHEMA IN DEEP LEARNING FRAMEWORK, AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20220004526A1

公开(公告)日：2022-01-06

申请号：US17480294

申请日：2021-09-21

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Liujie ZHANG , Yamei LI , Huihuang ZHENG , Hongyu LIU , Xiang LAN , Dianhai YU , Yanjun MA , Tian WU , Haifeng WANG

IPC: G06F16/21 , G06F16/22

Abstract: According to exemplary embodiments of the present disclosure, there is provided a method and apparatus of converting a schema in a deep learning framework, and a computer storage medium. The method of converting the schema in the deep learning framework includes: updating a first schema, based on first syntax elements in the first schema and a context relationship between the first syntax elements in the first schema, so as to obtain an updated first schema; generating second syntax elements corresponding to updated first syntax elements in the updated first schema, based on a mapping relationship between the updated first syntax elements in the updated first schema and second syntax elements in a second schema system; and combining the second syntax elements according to a context relationship between the updated first syntax elements, so as to generate a second schema

12.

发明申请
SPEECH RECOGNITION 有权

公开(公告)号：US20250078839A1

公开(公告)日：2025-03-06

申请号：US18819018

申请日：2024-08-29

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xiaoyin FU , Qiguang ZANG , Fenfen SHENG , Haifeng WANG , Lei JIA

IPC: G10L15/32 , G10L15/02 , G10L15/04 , G10L15/06 , G10L15/183

Abstract: A speech recognition method and a method for training a deep learning model are provided. The speech recognition method includes: obtaining a first speech feature of a speech to-be-recognized, which includes a plurality of speech segment features corresponding to a plurality of speech segments; decoding the first speech feature using a first decoder to obtain a plurality of first decoding results corresponding to a plurality of the words, indicating a first recognition result of words; extracting a second speech feature from the first speech feature based on first a priori information, which includes the plurality of first decoding results, and the second speech feature includes first word-level audio features corresponding to the plurality of words; and decoding the second speech feature using a second decoder to obtain a plurality of second decoding results corresponding to the plurality of words, indicating a second recognition result of the word.

13.

发明申请
DATA GENERATION 有权

公开(公告)号：US20250061311A1

公开(公告)日：2025-02-20

申请号：US18746532

申请日：2024-06-18

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zeyang LEI , Siqi BAO , Hua WU , Haifeng WANG

IPC: G06N3/0475 , G06N3/08

Abstract: A data generation method is provided. The data generation method includes: generating first answer data based on first question data from a user; determining, in response to receiving negative feedback from the user for the first answer data, a first reflection result for the first answer data based on the first answer data and the negative feedback, wherein the first reflection result indicates a diagnosis reason why feedback from the user for the first answer data is negative; and generating second answer data for the first question data based on the first question data and the first reflection result.

14.

发明申请
SPEECH WAKE-UP METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20240420684A1

公开(公告)日：2024-12-19

申请号：US18706313

申请日：2023-01-17

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Saisai ZOU , Lei JIA , Haifeng WANG

IPC: G10L15/16 , G10L13/02 , G10L15/02 , G10L15/08

Abstract: A speech wake-up method, an electronic device, and a storage medium are provided. The method includes: performing a word recognition on a speech to be recognized to obtain a wake-up word recognition result (S210); performing a syllable recognition on the speech to be recognized to obtain a wake-up syllable recognition result, in response to determining that the wake-up word recognition result represents that the speech to be recognized contains a predetermined wake-up word (S220); and determining that the speech to be recognized is a correct wake-up speech, in response to determining that the wake-up syllable recognition result represents that the speech to be recognized contains a predetermined syllable (S230).

15.

发明公开
GENERATIVE DIALOG MODEL TRAINING METHOD AND APPARATUS AS WELL AS GENERATIVE DIALOG IMPLEMENTING METHOD AND APPARATUS 审中-公开

公开(公告)号：US20240338530A1

公开(公告)日：2024-10-10

申请号：US18745550

申请日：2024-06-17

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zhen GUO , Wenquan WU , Hua WU , Haifeng WANG

IPC: G06F40/35 , G06N20/00

CPC classification number: G06F40/35 , G06N20/00

Abstract: A generative dialog model training method in the fields of artificial intelligence, such as deep learning, natural language processing, intelligent dialogs, is disclosed. The generative dialog model training method may include: in response to determination of an update of a safety specification, taking an updated safety specification as a target safety specification, and determining a dialog input corresponding to a current optimization according to the target safety specification, the update being performed on a previous safety specification when a generative dialog model after last optimization is determined not to meet a launch requirement; and optimizing the generative dialog model according to the dialog input and a principle that a reply generated by the generative dialog model conforms to the target safety specification, the generative dialog model being configured to generate the reply corresponding to the dialog input.

16.

发明公开
DEEP LEARNING MODEL BASED DATA GENERATION 审中-公开

公开(公告)号：US20240028909A1

公开(公告)日：2024-01-25

申请号：US18478833

申请日：2023-09-29

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Haifeng WANG , Hua WU , Hao TIAN , Yu SUN , Tian WU , Dou HONG

IPC: G06N3/096

CPC classification number: G06N3/096

Abstract: A data generation method based on a deep learning model and a training method is provided. The data generation method includes: determining an initial input of the deep learning model based on input data; obtaining a first output of the model, where in response to the model determining that generating a reply based on the initial input requires calling a first functional component different from the deep learning model, the first output includes a first token for calling the first functional component and a first intermediate inquiry determined based on the initial input and recognizable by the first functional component; obtaining a first intermediate result determined by the first functional component based on the first intermediate inquiry; determining a second input for the model based on the initial input and the first intermediate result; and obtaining a second output of the model for generating a reply to the initial input.

17.

发明公开
METHOD OF PROCESSING SPEECH INFORMATION, METHOD OF TRAINING MODEL, AND WAKE-UP METHOD 审中-公开

公开(公告)号：US20230360638A1

公开(公告)日：2023-11-09

申请号：US18221593

申请日：2023-07-13

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Saisai ZOU , Lei JIA , Haifeng WANG

IPC: G10L15/02 , G10L15/14

CPC classification number: G10L15/02 , G10L15/14 , G10L2015/027

Abstract: A method of processing a speech information, a method of training a speech model, a speech wake-up method, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of human-computer interaction, deep learning and intelligent speech technologies. A specific implementation solution includes: performing a syllable recognition on a speech information to obtain a posterior probability sequence for the speech information, where the speech information includes a speech frame sequence, the posterior probability sequence corresponds to the speech frame sequence, and each posterior probability in the posterior probability sequence represents a similarity between a syllable in a speech frame matched with the posterior probability and a predetermined syllable; and determining a target peak speech frame from the speech frame sequence based on the posterior probability sequence.

18.

发明公开
METHOD AND APPARATUS FOR TRAINING VOICE WAKE-UP MODEL, METHOD AND APPARATUS FOR VOICE WAKE-UP, DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20230317060A1

公开(公告)日：2023-10-05

申请号：US18328135

申请日：2023-06-02

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Saisai ZOU , Li CHEN , Ruoxi ZHANG , Lei JIA , Haifeng WANG

IPC: G10L15/06 , G10L15/02

CPC classification number: G10L15/063 , G10L15/02

Abstract: The present disclosure provides a method and an apparatus for training a voice wake-up model, a method and an apparatus for voice wake-up, a device and a storage medium, which relates to the field of artificial intelligence and particularly to the field of deep learning and voice technology. A specific implementation lies in: acquiring voice recognition training data and voice wake-up training data that are created, and firstly performing training on a base model according to the voice recognition training data to obtain a model parameter of the base model when a model loss function converges; then updating, based on a model configuration instruction, a configuration parameter of a decoding module in the base model to obtain a first model; and finally performing training on the first model according to the voice wake-up training data to obtain a trained voice wake-up model when the model loss function converges.

19.

发明申请
CONVERSATIONAL RECOMMENDATION METHOD, METHOD OF TRAINING MODEL, DEVICE AND MEDIUM 有权

公开(公告)号：US20230088445A1

公开(公告)日：2023-03-23

申请号：US18059386

申请日：2022-11-28

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Zeming LIU , Hao LIU , Zhengyu Niu , Hua WU , Haifeng WANG , Hui XIONG

IPC: G10L15/16 , G10L15/18 , G10L15/06 , G10L15/22

Abstract: A conversational recommendation method, a method of training a conversational recommendation model, an electronic device, and a storage medium are provided, which are related to a technical field of data processing, in particular to technical fields of voice interaction, deep learning, artificial intelligence and the like. The conversational recommendation method includes: acquiring a historical conversation information; determining a target conversation object to be generated, from a conversation target graph based on the historical conversation information, the conversation target graph includes an object node, the object node is configured to represent a conversation object, and the target conversation object is determined based on the object node; and generating a target conversation information for recommendation based on the target conversation object.

20.

发明申请
MIXTURE-OF-EXPERTS MODEL IMPLEMENTATION METHOD AND SYSTEM, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20250036920A1

公开(公告)日：2025-01-30

申请号：US18026140

申请日：2022-09-20

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Liang SHEN , Haifeng WANG , Huachao WU , Weibao GONG , Zhihua WU , Dianhai YU

IPC: G06N3/045 , G06N3/0495

Abstract: The present disclosure provides a mixture-of-experts (MoE) model implementation method and system, an electronic device, and a storage medium, and relates to the field of artificial intelligence (AI) such as deep learning and distributed storage. The method includes: constructing a communication group, the communication group including a tensor-parallelism communication group, the tensor-parallelism communication group including at least two computing devices, tensor-parallelism segmentation being adopted for sparse parameters of each of the computing devices in a same tensor-parallelism communication group; and training an MoE model based on the communication group. By use of the solutions of the present disclosure, normal operation of model training can be guaranteed.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification