-
公开(公告)号:US20240378077A1
公开(公告)日:2024-11-14
申请号:US18782617
申请日:2024-07-24
Inventor: Guoxia WANG , Jinle ZENG , Xiyuan XIAO , Jiabin YANG , Dianhai YU , Haifeng WANG
Abstract: A method of executing a task for a large language model, a device, and a storage medium are provided, which relate to a field of artificial intelligence technology, and in particular to fields of deep learning, large language model, natural language processing and computer vision technologies. The method includes: determining, by using a determination unit, a target attention task from a plurality of attention tasks to be processed, based on a sparse representation corresponding to a feature to be processed, where the target attention task is a task corresponding to a non-fully masked region of the feature, the sparse representation represents a mask position of the feature, and the mask position represents mask endpoint positions in at least two non-intersecting intervals in a mask matrix corresponding to the feature; and executing the target attention task by using a computing unit, so as to obtain an attention feature.