EFFICIENT AND CONCURRENT MODEL EXECUTION
Abstract:
An accelerator is disclosed. A circuit may process a data to produce a processed data. A first tier storage may include a first capacity and a first latency. A second tier storage may include a second capacity and a second latency. The second capacity may be larger than the first capacity, and the second latency may be slower than the first latency. A bus may be used to transfer at least one of the data or the processed data between the first tier storage and the second tier storage.
Public/Granted literature
Information query
Patent Agency Ranking
0/0