ARTIFICIAL INTELLIGENCE SEMICONDUCTOR CHIP HAVING WEIGHTS OF VARIABLE COMPRESSION RATIO

    公开(公告)号:US20200302276A1

    公开(公告)日:2020-09-24

    申请号:US16586500

    申请日:2019-09-27

    摘要: An artificial intelligence (AI) semiconductor having an embedded convolution neural network (CNN) may include a first convolution layer and a second convolution layer, in which the weights of the first layer and the weights of the second layer are quantized in different bit-widths, thus at different compression ratios. In a VGG neural network, the weights of a first group of convolution layers may have a different compression ratio than the weights of a second group of convolution layers. The weights of the CNN may be obtained in a training system including convolution quantization and/or activation quantization. Depending on the compression ratio, the weights of a convolution layer may be trained with or without re-training. An AI task, such as image retrieval, may be implemented in the AI semiconductor having the CNN described above.

    Apparatus for recognition of handwritten Chinese characters

    公开(公告)号:US10296817B1

    公开(公告)日:2019-05-21

    申请号:US15941514

    申请日:2018-03-30

    摘要: Apparatus for recognition of handwritten Chinese characters contains a bus, an input means connecting to the bus for receiving input imagery data created from a handwritten Chinese character, a Cellular Neural Networks or Cellular Nonlinear Networks (CNN) based integrated circuit operatively connecting to the bus for extracting features out of the input imagery data using pre-trained filter coefficients of a plurality of order convolutional layers stored therein, a memory connecting the bus, the memory being configured for storing weight coefficients of fully-connected (FC) layers, a processing unit connecting to the bus for performing computations of FC layers to classify the extracted features from the CNN based integrated circuit to a particular Chinese character in a predefined Chinese character set, and a display unit connecting to the bus for displaying the particular Chinese character. Greater than 95% recognition accuracy is achieved using multiple bi-valued 3×3 filter kernels as pre-trained filter coefficients.

    USING AND TRAINING CELLULAR NEURAL NETWORK INTEGRATED CIRCUIT HAVING MULTIPLE CONVOLUTION LAYERS OF DUPLICATE WEIGHTS IN PERFORMING ARTIFICIAL INTELLIGENCE TASKS

    公开(公告)号:US20210019602A1

    公开(公告)日:2021-01-21

    申请号:US16516229

    申请日:2019-07-18

    IPC分类号: G06N3/04 G06N3/063

    摘要: An integrated circuit may include multiple cellular neural networks (CNN) processing engines coupled in a loop circuit and configured to perform an AI task. Each CNN processing engine includes multiple convolution layers, a first memory buffer to store imagery data and a second memory buffer to store filter coefficients. The CNN processing engines are configured to perform convolution operations over an input image simultaneously in one or more iterations. In each iteration, various sub-images of the input image are loaded to the first memory buffer circularly. A portion of the filter coefficients corresponding to the sub-image are loaded to the second memory buffer in a cyclic order. Data may be arranged in the second memory buffer to facilitate loading of duplicate filter coefficients among at least two convolution layers without requiring duplicate memory space. Methods of training a CNN model having duplicate weights are also provided.

    Object detection and recognition apparatus based on CNN based integrated circuits

    公开(公告)号:US10387740B2

    公开(公告)日:2019-08-20

    申请号:US15984334

    申请日:2018-05-19

    摘要: A deep learning object detection and recognition system contains a number of cellular neural networks (CNN) based integrated circuits (ICs) operatively coupling together via the network bus. The system is configured for detecting and then recognizing one or more objects out of a two-dimensional (2-D) imagery data. The 2-D imagery data is divided into N set of distinct sub-regions in accordance with respective N partition schemes. CNN based ICs are dynamically allocated for extracting features out of each sub-region for detecting and then recognizing an object potentially contained therein. Any two of the N sets of sub-regions overlap each other. N is a positive integer. Object detection is achieved with a two-category classification using a deep learning model based on approximated fully-connected layers, while object recognition is performed using a local database storing feature vectors of known objects.

    SYSTEM AND METHOD FOR ENCODING DATA IN AN IMAGE/VIDEO RECOGNITION INTEGRATED CIRCUIT SOLUTION

    公开(公告)号:US20190220699A1

    公开(公告)日:2019-07-18

    申请号:US15871941

    申请日:2018-01-15

    摘要: Methods of encoding image data for loading into an artificial intelligence (AI) integrated circuit are provided. The AI integrated circuit may have an embedded cellular neural network for implementing AI tasks based on the loaded image data. An encoding method may include: using in input image to generate a plurality of output images, wherein each pixel in the input image is approximated by a combination of values of corresponding pixels in the output images; loading the plurality of output images into the AI chip; executing programming instructions contained in the AI chip to generate an image recognition result based on the at least one of the plurality of output images; and outputting the image recognition result. The encoding method also trains a convolution neural network (CNN) and loads the weights of the CNN into the AI integrated circuit for implementing the AI tasks.

    System and method for encoding data in an image/video recognition integrated circuit solution

    公开(公告)号:US10452955B2

    公开(公告)日:2019-10-22

    申请号:US15871945

    申请日:2018-01-15

    摘要: Methods of encoding image data for loading into an artificial intelligence (AI) integrated circuit are provided. The AI integrated circuit may have an embedded cellular neural network for implementing AI tasks based on the loaded image data. An encoding method may apply image splitting, principal component analysis (PCA) or a combination thereof to an input image to generate a plurality of output images. Each output image has a size smaller than the size of the input image. The method may load the output images into the AI chip, execute programming instructions contained in the AI chip to generate an image recognition result based on the at least one of the plurality of output images, and output the image recognition result. The encoding method also trains a convolution neural network (CNN) and loads the weights of the CNN into the AI integrated circuit for implementing the AI tasks.

    Image classification systems based on CNN based IC and light-weight classifier

    公开(公告)号:US10402628B2

    公开(公告)日:2019-09-03

    申请号:US15963990

    申请日:2018-04-26

    摘要: Image classification system contains a CNN based IC configured for extracting features out of input data by performing convolution operations using filter coefficients of ordered convolutional layers and a classifier IC configured for classifying the input data using reduced set of the extracted features based on a light-weight classifier. Light-weight classifier is derived by: training filter coefficients of the ordered convolutional layers using a dataset containing N labeled data, the trained filter coefficients are for the CNN based IC; outputting respective extracted features of the N labeled data after performing convolution operations of ordered convolutional layers using the trained filter coefficients, each labeled data contains X features; creating the reduced set of the extracted features by eliminating those of the X features that contain zeros in at least M of the N labeled data; and adjusting M until the light-weight classifier achieves satisfactory results using the reduced set.

    SYSTEM AND METHOD FOR ENCODING DATA IN AN IMAGE/VIDEO RECOGNITION INTEGRATED CIRCUIT SOLUTION

    公开(公告)号:US20190220700A1

    公开(公告)日:2019-07-18

    申请号:US15871945

    申请日:2018-01-15

    摘要: Methods of encoding image data for loading into an artificial intelligence (AI) integrated circuit are provided. The AI integrated circuit may have an embedded cellular neural network for implementing AI tasks based on the loaded image data. An encoding method may apply image splitting, principal component analysis (PCA) or a combination thereof to an input image to generate a plurality of output images. Each output image has a size smaller than the size of the input image. The method may load the output images into the AI chip, execute programming instructions contained in the AI chip to generate an image recognition result based on the at least one of the plurality of output images, and output the image recognition result. The encoding method also trains a convolution neural network (CNN) and loads the weights of the CNN into the AI integrated circuit for implementing the AI tasks.