System and method for cached convolution calculation

    公开(公告)号:US11074317B2

    公开(公告)日:2021-07-27

    申请号:US16352057

    申请日:2019-03-13

    Inventor: Duanduan Yang

    Abstract: A method includes identifying, using at least one processor, input words associated with a user query. The method also includes, for each of one or more of the input words that are contained in a high-frequency word set, retrieving pre-computed element-wise products associated with the input word from a cache. The method further includes performing, using the at least one processor, a convolution operation using the pre-computed element-wise products. In addition, the method includes generating, using the at least one processor, a response to the user query based on results of the convolution operation. The method may also include, for each of one or more of the input words that are not contained in the high-frequency word set, calculating additional element-wise products associated with the input word, and the convolution operation may be performed using the pre-computed element-wise products and the additional element-wise products.

    System and method for hashed compressed weighting matrix in neural networks

    公开(公告)号:US11531859B2

    公开(公告)日:2022-12-20

    申请号:US15853431

    申请日:2017-12-22

    Abstract: A method for a neural network includes receiving an input from a vector of inputs, determining a table index based on the input, and retrieving a hash table from a plurality of hash tables, wherein the hash table corresponds to the table index. The method also includes determining an entry index of the hash table based on an index matrix, wherein the index matrix includes one or more index values, and each of the one or more index values corresponds to a vector in the hash table and determining an entry value in the hash table corresponding to the entry index. The method also includes determining a value index, wherein the vector in the hash table includes one or more entry values, and wherein the value index corresponds to one of the one or more entry values in the vector and determining a layer response.

Patent Agency Ranking