Fast nearest neighbor search for output generation of convolutional neural networks

    公开(公告)号:US11720789B2

    公开(公告)日:2023-08-08

    申请号:US16672352

    申请日:2019-11-01

    Applicant: Apple Inc.

    CPC classification number: G06N3/08 G06F16/90335 G06F17/16 G06F40/30 G06N3/04

    Abstract: In one embodiment, a method includes receiving an input vector corresponding to a query at a neural network model comprising a plurality of layers, wherein the plurality of layers comprise a last layer associated with a mapping matrix, generating a binary matrix based on the mapping matrix, an identity matrix, and one or more Gaussian vectors, generating an integer vector based on the binary matrix and a binary vector associated with the input vector, identifying a plurality of indices corresponding to a plurality of top values of the integer vector for the integer vector, generating an output vector based on the input vector and a plurality of rows of the mapping matrix, wherein the plurality of rows is associated with the plurality of identified indices, respectively, and determining the query is associated with one or more classes based on the output vector.

    Lookup-based convolutional neural network

    公开(公告)号:US11354538B2

    公开(公告)日:2022-06-07

    申请号:US16908570

    申请日:2020-06-22

    Applicant: Apple Inc.

    Abstract: Systems and methods are disclosed for lookup-based convolutional neural networks. For example, methods may include applying a convolutional neural network to image data based on an image to obtain an output, in which a layer of the convolutional network includes filters with weights that are stored as a dictionary (D) of channel weight vectors, a respective lookup index tensor (I) that indexes the dictionary, and a respective lookup coefficient tensor (C), and in which applying the convolutional neural network includes: convolving the channel weight vectors of the dictionary (D) with an input tensor based on the image to obtain an input dictionary (S), and combining entries of the input dictionary (S) that are indexed with indices from the respective lookup index tensor (I) and multiplied with corresponding coefficients from the respective lookup coefficient tensor (C); and storing, displaying, or transmitting data based on the output of the convolutional neural network.

Patent Agency Ranking