发明授权
US4888679A Method and apparatus using a cache and main memory for both vector
processing and scalar processing by prefetching cache blocks including
vector data elements
失效
使用高速缓存和主存储器的方法和装置,用于通过预取包括向量数据元素的高速缓存块来进行矢量处理和标量处理
- 专利标题: Method and apparatus using a cache and main memory for both vector processing and scalar processing by prefetching cache blocks including vector data elements
- 专利标题(中): 使用高速缓存和主存储器的方法和装置,用于通过预取包括向量数据元素的高速缓存块来进行矢量处理和标量处理
-
申请号: US142794申请日: 1988-01-11
-
公开(公告)号: US4888679A公开(公告)日: 1989-12-19
- 发明人: Tryggve Fossum , Ricky C. Hetherington , David B. Fite, Jr. , Dwight P. Manley , Francis X. McKeen , John E. Murray
- 申请人: Tryggve Fossum , Ricky C. Hetherington , David B. Fite, Jr. , Dwight P. Manley , Francis X. McKeen , John E. Murray
- 申请人地址: MA Maynard
- 专利权人: Digital Equipment Corporation
- 当前专利权人: Digital Equipment Corporation
- 当前专利权人地址: MA Maynard
- 主分类号: G06F12/06
- IPC分类号: G06F12/06 ; G06F9/38 ; G06F12/02 ; G06F12/08 ; G06F15/78 ; G06F17/16
摘要:
A main memory and cache suitable for scalar processing are used in connection with a vector processor by issuing prefetch requests in response to the recognition of a vector load instruction. A respective prefetch request is issued for each block containing an element of the vector to be loaded from memory. In response to a prefetch request, the cache is checked for a "miss" and if the cache does not include the required block, a refill request is sent to the main memory. The main memory is configured into a plurality of banks and has a capability of processing multiple references. Therefore the different banks can be referenced simultaneously to prefetch multiple blocks of vector data. Preferably a cache bypass is provided to transmit data directly to the vector processor as the data from the main memory are being stored in the cache. In a preferred embodiment, a vector processor is added to a digital computing system including a scalar processor, a virtual address translation buffer, a main memory and a cache. The scalar processor includes a microcode interpreter which sends a vector load command to the vector processing unit and which also generates vector prefetch requests. The addresses for the data blocks to be prefetched are computed based upon the vector address, the length of the vector and the "stride" or spacing between the addresses of the elements of the vector.
公开/授权文献
信息查询