Invention Publication
- Patent Title: ATTENTION NEURAL NETWORKS WITH TREE ATTENTION MECHANISMS
-
Application No.: US18343723Application Date: 2023-06-28
-
Publication No.: US20240005131A1Publication Date: 2024-01-04
- Inventor: Himanshu Jain , Lovish Madaan , Prateek Jain , Venkata Sesha Pavana Srinadh Bhojanapalli
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Priority: IN 2221037019 2022.06.28
- Main IPC: G06N3/0455
- IPC: G06N3/0455

Abstract:
Systems and methods for processing inputs using attention neural networks with tree attention layers. Each tree attention layer includes one or more tree attention sub-layers that are each configured to: process query vectors using a decision tree model for the tree attention sub-layer to determine a respective tree path for each query vector; process key vectors using the decision tree model to determine a respective tree path for each key vector; and generate an attended input sequence comprising a respective attended input at each of the plurality of input positions, comprising: generating, for each particular input position, the respective attended input at the particular input position based on (i) the tree path for the query vector at the particular input position (ii) the respective tree paths for the key vectors at each of the plurality of input positions and (iii) the value vectors at a subset of the input positions.
Information query