Patent search ap:("Beijing Baidu Netcom Science Technology Co. Page Ltd.") AND inv:"Shiwei HUANG"

1.

发明申请
TASK EXECUTION METHOD FOR LARGE MODEL, DEVICE, AND MEDIUM 有权

公开(公告)号：US20250094792A1

公开(公告)日：2025-03-20

申请号：US18968790

申请日：2024-12-04

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Bo KE , Xuyi CHEN , Zhengjie HUANG , Shikun FENG , Weibin LI , Shiwei HUANG

IPC: G06N3/0495 , G06N3/0475 , G06N3/0499 , G06N3/09

Abstract: A task execution method for a large model, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, particularly to fields of deep learning technology and large model technology. The method includes: executing a modality routing task by using a target computing unit based on a target feature to be processed to obtain a modality recognition result; executing a field routing task by using the target computing unit based on the target feature to be processed and a target field gating model parameter to obtain a field recognition result; and executing a feedforward task by using the target computing unit based on the target feature to be processed and a target feedforward task model parameter to obtain a task execution result

2.

发明申请
PRE-TRAINING METHOD OF NEURAL NETWORK MODEL, ELECTRONIC DEVICE AND MEDIUM 有权

公开(公告)号：US20220129753A1

公开(公告)日：2022-04-28

申请号：US17572921

申请日：2022-01-11

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Yuxiang LU , Jiaxiang LIU , Xuyi CHEN , Shikun FENG , Shuohuan WANG , Yu SUN , Shiwei HUANG , Jingzhou HE

IPC: G06N3/08 , G06N3/04

Abstract: A pre-training method of a neural network model, an electronic device, and a medium. The pre-training data is inputted to the initial neural network model, and the initial neural network model is pre-trained in the first training mode, in the first training mode, the plurality of hidden layers share one hidden layer parameter, and the loss value of the initial neural network model is obtained, if the loss value of the initial neural network model is less than a preset threshold, the initial neural network model continues to be pre-trained in the second training mode, in the second training mode, each of the plurality of hidden layers has its own hidden layer parameter.

3.

发明申请
DIALOG DATA GENERATING 有权

公开(公告)号：US20230085458A1

公开(公告)日：2023-03-16

申请号：US18057651

申请日：2022-11-21

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xin TIAN , Dongfeng HE , Liankai HUANG , Yingzhan LIN , Shiwei HUANG

IPC: G06F40/35 , G06F40/279 , G06F40/186 , G06N3/08

Abstract: A method for generating dialog data is provided. An implementation is: obtaining a target dialog data template, where the target dialog data template includes one or more target single-round dialog data templates, each target single-round dialog data template includes one or more keyword slots and related information about each keyword slot, and the related information about each keyword slot includes location information and attribute information; for each keyword slot, determining, from a keyword data set at least based on the attribute information of the keyword slot, one or more target keywords that match the keyword slot; and for each target single-round dialog data template, filling the target single-round dialog data template with the one or more target keywords based on the location information of the one or more keyword slots, to obtain target dialog data.

4.

发明申请
METHOD OF TRAINING DEEP LEARNING MODEL AND METHOD OF PROCESSING NATURAL LANGUAGE 有权

公开(公告)号：US20230047980A1

公开(公告)日：2023-02-16

申请号：US17976049

申请日：2022-10-28

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xuyi CHEN , Weixin LIU , Yuxiang LU , Jiaxiang LU , Shiwei HUANG

IPC: G06F40/40

Abstract: A method of training a deep learning model, a method of processing a natural language, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence, in particular to deep learning technology and natural language processing technology. The method includes: inputting first sample data into a first deep learning model to obtain a first output result; training the first deep learning model according to the first output result and a first target output result, the first target output result is obtained by processing the first sample data using a reference deep learning model; inputting second sample data into a second deep learning model to obtain a second output result; and training the second deep learning model according to the second output result and a second target output result, to obtain a trained second deep learning model.

5.

发明申请
METHOD AND APPARATUS FOR GENERATING NODE REPRESENTATION, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM 有权

公开(公告)号：US20230004774A1

公开(公告)日：2023-01-05

申请号：US17578683

申请日：2022-01-19

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Weibin LI , Zhifan ZHU , Shikun FENG , Shiwei HUANG , Jingzhou HE

IPC: G06N3/04 , G06N3/08

Abstract: The present disclosure provides a method and apparatus for generating a node representation, an electronic device and a readable storage medium, and relates to the field of deep learning technologies. The method for generating a node representation includes: acquiring a heterogeneous graph to be processed; performing a sampling operation in the heterogeneous graph to be processed according to a first meta path, so as to obtain at least one first walk path; obtaining an initial node representation of each node in the heterogeneous graph to be processed according to the at least one first walk path; and generating the final node representation of each node according to the initial node representation of each node and initial node representations of neighbor nodes of each node. With the present disclosure, accuracy of the generated node representation may be improved.

6.

发明申请
DATA PROCESSING 有权

公开(公告)号：US20250028958A1

公开(公告)日：2025-01-23

申请号：US18908380

申请日：2024-10-07

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xuyi CHEN , Bo KE , Chenhui LI , Zhengjie HUANG , Shiwei HUANG , Weibin LI , Shikun FENG

IPC: G06N3/08

Abstract: A data processing method, and a data processing model and a training method therefor are provided, and relate to the field of artificial intelligence, and specifically, to natural language processing, deep learning technologies, and large model technologies. An implementation solution includes: determining input data, where the input data includes a plurality of tokens; determining a correlation between each of the plurality of tokens and each of a plurality of expert networks based on a gating matrix, where the plurality of expert networks are used to reinforce the plurality of tokens; allocating the plurality of tokens to the plurality of expert networks in a uniform manner based on the correlation and a preset capacity of each expert network, to reinforce the plurality of tokens; and determining a data processing result based on the plurality of reinforced tokens.

7.

发明公开
METHOD AND APPARATUS FOR PROCESSING DIALOGUE, ELECTRONIC DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20230214689A1

公开(公告)日：2023-07-06

申请号：US18121053

申请日：2023-03-14

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Xin TIAN , Yingzhan LIN , Mengfei SONG , Siqi BAO , Shiwei HUANG

IPC: G06N5/04

CPC classification number: G06N5/04

Abstract: A method for processing a dialogue includes: obtaining a dialogue text of the dialogue, in which the dialogue text includes a current question text, or the dialogue text includes the current question text and a historical dialogue text; extracting a current query text from the dialogue text; obtaining a knowledge query result for the current query text by querying a knowledge database based on the current query text; and determining a response text for the current question text based on the knowledge query result and the dialogue text.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification