Patent search ap:("Beijing Baidu Netcom Science Technology Co. Page Ltd.") AND inv:"Jinle ZENG"

1.

发明申请
METHOD OF EXECUTING TASK FOR LARGE LANGUAGE MODEL, DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20240378077A1

公开(公告)日：2024-11-14

申请号：US18782617

申请日：2024-07-24

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Guoxia WANG , Jinle ZENG , Xiyuan XIAO , Jiabin YANG , Dianhai YU , Haifeng WANG

IPC: G06F9/48 , G06F40/40

Abstract: A method of executing a task for a large language model, a device, and a storage medium are provided, which relate to a field of artificial intelligence technology, and in particular to fields of deep learning, large language model, natural language processing and computer vision technologies. The method includes: determining, by using a determination unit, a target attention task from a plurality of attention tasks to be processed, based on a sparse representation corresponding to a feature to be processed, where the target attention task is a task corresponding to a non-fully masked region of the feature, the sparse representation represents a mask position of the feature, and the mask position represents mask endpoint positions in at least two non-intersecting intervals in a mask matrix corresponding to the feature; and executing the target attention task by using a computing unit, so as to obtain an attention feature.

Patent Agency Ranking