Invention Publication
- Patent Title: DATA PROCESSING METHOD AND RELATED DEVICE
-
Application No.: US18186942Application Date: 2023-03-20
-
Publication No.: US20230229898A1Publication Date: 2023-07-20
- Inventor: Zichao Li , Lu Hou , Xin Jiang
- Applicant: HUAWEI TECHNOLOGIES CO., LTD.
- Applicant Address: CN Guangdong
- Assignee: HUAWEI TECHNOLOGIES CO., LTD.
- Current Assignee: HUAWEI TECHNOLOGIES CO., LTD.
- Current Assignee Address: CN Guangdong
- Priority: CN 2011052624.X 2020.09.29
- Main IPC: G06N3/0499
- IPC: G06N3/0499 ; G06N3/08

Abstract:
A data processing method includes: obtaining to-be-processed data and a target neural network model, where the target neural network model includes a first transformer layer, the first transformer layer includes a first residual branch and a second residual branch, the first residual branch includes a first attention head, and the second residual branch includes a target feed-forward network (FFN) layer; and performing target task related processing on the to-be-processed data based on the target neural network model, to obtain a data processing result, where the target neural network model is for performing a target operation on an output of the first attention head and a first weight value to obtain an output of the first residual branch, and/or the target neural network model is for performing a target operation on an output of the target FFN and a second weight value to obtain an output of the second residual branch.
Information query