ATTENTION NEURAL NETWORKS WITH TREE ATTENTION MECHANISMS

    公开(公告)号:US20240005131A1

    公开(公告)日:2024-01-04

    申请号:US18343723

    申请日:2023-06-28

    Applicant: Google LLC

    CPC classification number: G06N3/0455

    Abstract: Systems and methods for processing inputs using attention neural networks with tree attention layers. Each tree attention layer includes one or more tree attention sub-layers that are each configured to: process query vectors using a decision tree model for the tree attention sub-layer to determine a respective tree path for each query vector; process key vectors using the decision tree model to determine a respective tree path for each key vector; and generate an attended input sequence comprising a respective attended input at each of the plurality of input positions, comprising: generating, for each particular input position, the respective attended input at the particular input position based on (i) the tree path for the query vector at the particular input position (ii) the respective tree paths for the key vectors at each of the plurality of input positions and (iii) the value vectors at a subset of the input positions.

Patent Agency Ranking