-
公开(公告)号:US20170221550A1
公开(公告)日:2017-08-03
申请号:US15009200
申请日:2016-01-28
Applicant: Apple Inc.
Inventor: Abdulkadir U. Diril , Adam T. Moerschell , Anthony P. DeLaurier
IPC: G11C11/419
CPC classification number: G11C11/419 , G11C7/22 , G11C19/00
Abstract: Techniques relating to providing clock signals to a storage element. Generally, different portions of a given storage element may be clocked according to different schemes. This technique may be pertinent to a storage element that has a portion for which the associated bit values do not change frequently relative to another portion of the storage element. For such a storage element, a high-frequency portion may be clocked upon an access to the storage element, while a low-frequency portion may be clocked only if there is a change in the associated bit values. This technique can be applied to various storage elements, including registers and FIFO buffer entries. An apparatus may be designed such that the low-frequency and high-frequency portions of a storage element do not change during operation. Alternatively, the low-frequency and high-frequency portions of the storage element may be changeable based on a current operating mode of the apparatus.
-
公开(公告)号:US11843788B2
公开(公告)日:2023-12-12
申请号:US17816136
申请日:2022-07-29
Applicant: Apple Inc.
Inventor: Tyson J. Bergland , Anthony P. DeLaurier , Karthik Ramani , Stephan Lachowsky
IPC: H04N11/02 , H04N19/182 , H03M7/30
CPC classification number: H04N19/182 , H03M7/3059
Abstract: Techniques are disclosed relating to data compression. In some embodiments, compression circuitry determines, at least partially in parallel for multiple different lossless compression techniques, amounts of data needed to represent, using a given lossless compression technique of the multiple lossless compression techniques, individual pixels in a set of pixels being compressed. The compression techniques may include neighbor, origin, and gradient techniques, for example. The compression circuitry may select one of the compression techniques based on comparison, among the compression techniques, of sums of: the determined amount of data for an individual pixel for a given lossless compression technique and compression metadata size for a given lossless compression technique. The compression circuitry may generate and store information that encodes values for the set of pixels using the selected compression technique.
-
公开(公告)号:US11488350B2
公开(公告)日:2022-11-01
申请号:US17338846
申请日:2021-06-04
Applicant: Apple Inc.
Inventor: Anthony P. DeLaurier , Karl D. Mann , Tyson J. Bergland , Winnie W. Yeung
Abstract: Techniques are disclosed relating to compression of data stored at different cache levels. In some embodiments, a memory system implements a storage hierarchy that includes first cache circuitry and second cache circuitry at different levels of the hierarchy. Processor circuitry generates write data to be written to the memory system. In some embodiments, first compression circuitry is configured to compress a first block of write data in response to full accumulation of the first block in the first cache circuitry and second compression circuitry is configured to compress a second block of write data in response to full accumulation of the second block in the second cache circuitry. Write circuitry may write the first and second compressed blocks of data in a single combined write to a higher level in the storage hierarchy.
-
公开(公告)号:US11257278B2
公开(公告)日:2022-02-22
申请号:US16953021
申请日:2020-11-19
Applicant: Apple Inc.
Inventor: Anthony P. DeLaurier , Michael J. Swift , Michal Valient , Robert S. Hartog , Tyson J. Bergland , Gokhan Avkarogullari
IPC: G06F12/1009 , G06T1/60 , G06T15/04 , G06F9/38 , G06T15/00 , G06F12/0811 , G06F9/50 , G06T11/00
Abstract: Techniques are disclosed relating to memory allocation for graphics surfaces. In some embodiments, graphics processing circuitry is configured to access a graphics surface based on an address in a surface space assigned to the graphics surface. In some embodiments, first translation circuitry is configured to translate address information for the surface space to address information in the virtual space based on one or more of the translation entries. In some embodiments, the graphics processing circuitry is configured to provide an address for the access to the graphics surface based on translation by the first translation circuitry and second translation circuitry configured to translate the address in the virtual space to an address in a physical space of a memory configured to store the graphics surface. The disclosed techniques may allow sparse allocation of large graphics surfaces, in various embodiments.
-
公开(公告)号:US20180181489A1
公开(公告)日:2018-06-28
申请号:US15389047
申请日:2016-12-22
Applicant: Apple Inc.
Inventor: Anthony P. DeLaurier , Owen C. Anderson , Michael J. Swift , Aaftab A. Munshi , Terence M. Potter
IPC: G06F12/0815 , G06F12/0811 , G06F12/084
CPC classification number: G06F12/0815 , G06F12/0811 , G06F12/084 , G06F2212/455 , G06F2212/621 , G06T1/60
Abstract: Techniques are disclosed relating to memory consistency in a memory hierarchy with relaxed ordering. In some embodiments, an apparatus includes a first level cache that is shared by a plurality of shader processing elements and a second level cache that is shared by the shader processing elements and at least a texture processing unit. In some embodiments, the apparatus is configured to execute operations specified by graphics instructions that include (1) an attribute of the operation that specifies a type of memory consistency to be imposed for the operation and (2) scope information for the attribute that specifies whether the memory consistency specified by the attribute should be enforced at the first level cache or the second level cache. In some embodiments, the apparatus is configured to determine whether to sequence memory accesses at the first level cache and the second level cache based on the attribute and the scope.
-
16.
公开(公告)号:US20160292907A1
公开(公告)日:2016-10-06
申请号:US14677280
申请日:2015-04-02
Applicant: Apple Inc.
Inventor: Christopher A. Burns , Andrew Pomianowski , Anthony P. DeLaurier
CPC classification number: G06T15/04 , G06T11/001 , G06T11/40 , G06T15/005 , G06T15/80 , G06T2200/04 , G06T2200/12
Abstract: Techniques are disclosed relating to interpolation for texture mapping. In some embodiments, a graphics unit includes circuitry configured to map a texture to a screen space such that a set of multiple in the screen space falls between first and second adjacent texels of the texture in a first dimension. In some embodiments, the graphics unit also includes texture processing circuitry configured to perform different types of interpolation for pixels in the group of pixels. In these embodiments, this includes determining pixel attributes for first and second end groups of pixels in the set of pixels using a nearest-neighbor interpolation technique and attributes of the first and second texels respectively. In these embodiments, this also includes determining pixel attributes for an intermediate group of pixels in the set of pixels using a second, different interpolation technique and attributes of both the first and second texels.
Abstract translation: 公开了关于纹理映射的插值的技术。 在一些实施例中,图形单元包括被配置为将纹理映射到屏幕空间的电路,使得屏幕空间中的多个集合在第一维度中落在纹理的第一和第二相邻纹素之间。 在一些实施例中,图形单元还包括纹理处理电路,其被配置为对像素组中的像素执行不同类型的内插。 在这些实施例中,这包括使用最近邻内插技术和第一和第二纹素的属性来确定像素集合中的像素的第一和第二端组的像素属性。 在这些实施例中,这还包括使用第二不同的插值技术和第一和第二纹素两者的属性来确定像素集合中的中间像素组的像素属性。
-
公开(公告)号:US20210134052A1
公开(公告)日:2021-05-06
申请号:US16673883
申请日:2019-11-04
Applicant: Apple Inc.
Inventor: Anthony P. DeLaurier , Karl D. Mann , Tyson J. Bergland , Winnie W. Yeung
Abstract: Techniques are disclosed relating to compression of data stored at different cache levels. In some embodiments, programmable shader circuitry is configured to execute program instructions of compute kernels that write pixel data. In some embodiments, a first cache is configured to store pixel write data from the programmable shader circuitry and first compression circuitry is configured to compress a first block of pixel write data in response to full accumulation of the first block in the first cache circuitry. In some embodiments, second cache circuitry is configured to store pixel write data from the programmable shader circuitry at a higher level in a storage hierarchy than the first cache circuitry and second compression circuitry is configured to compress a second block of pixel write data in response to full accumulation of the second block in the second cache circuitry. In some embodiments, write circuitry is configured to write the first and second compressed blocks of pixel data in a combined write to a higher level in the storage hierarchy.
-
公开(公告)号:US10289565B2
公开(公告)日:2019-05-14
申请号:US15610008
申请日:2017-05-31
Applicant: Apple Inc.
Inventor: Wolfgang H. Klingauf , Kenneth C. Dyke , Karthik Ramani , Winnie W. Yeung , Anthony P. DeLaurier , Luc R. Semeria , David A. Gotwalt , Srinivasa Rangan Sridharan , Muditha Kanchana
IPC: G06F12/08 , G06F12/12 , G06F12/123 , G06F12/0808 , G06F12/0815 , G06F12/0804
Abstract: Systems, apparatuses, and methods for efficiently allocating data in a cache are described. In various embodiments, a processor decodes an indication in a software application identifying a temporal data set. The data set is flagged with a data set identifier (DSID) indicating temporal data to drop after consumption. When the data set is allocated in a cache, the data set is stored with a non-replaceable attribute to prevent a cache replacement policy from evicting the data set before it is dropped. A drop command with an indication of the DSID of the data set is later issued after the data set is read (consumed). A copy of the data set is not written back to the lower-level memory although the data set is removed from the cache. An interrupt is generated to notify firmware or other software of the completion of the drop command.
-
公开(公告)号:US20180181491A1
公开(公告)日:2018-06-28
申请号:US15389153
申请日:2016-12-22
Applicant: Apple Inc.
Inventor: Anthony P. DeLaurier , Luc R. Semeria , Gokhan Avkarogullari , David A. Gotwalt , Robert S. Hartog , Michael J. Swift
IPC: G06F12/0891 , G06F12/0895
CPC classification number: G06F12/0891 , G06F12/0811 , G06F12/084 , G06F12/0864 , G06F12/0895 , G06F12/1063 , G06F12/109 , G06F2212/1024 , G06F2212/302 , G06F2212/455 , G06F2212/60 , G06F2212/657
Abstract: Techniques are disclosed relating to flushing cache lines. In some embodiments, a graphics processing unit includes a cache and one or more storage elements configured to store a plurality of command buffers that include instructions executable to manipulate data stored in the cache. In some embodiments, ones of the cache lines in the cache are configured to store data to be operated on by instructions in the command buffers and a first tag portion that identifies a command buffer that has stored data in the cache line. In some embodiments, the graphics processing unit is configured to receive a request to flush cache lines that store data of a particular command buffer, and to flush ones of the cache lines having first tag portions indicating the particular command buffer as having data stored in the cache lines while maintaining data stored in other ones of the cache lines as valid.
-
公开(公告)号:US20160364899A1
公开(公告)日:2016-12-15
申请号:US14735707
申请日:2015-06-10
Applicant: Apple Inc.
Inventor: Abdulkadir U. Diril , Adam T. Moerschell , Anthony P. DeLaurier
CPC classification number: G06T15/04 , G06F3/14 , G06T1/20 , G06T2200/28 , G06T2210/36 , G09G5/14 , G09G5/363 , G09G5/39 , G09G5/393 , G09G5/395 , G09G2330/021 , G09G2340/04
Abstract: Techniques are disclosed relating to determining the location of a specified level of detail for a graphics texture. In some embodiments, an apparatus includes texture processing circuitry configured to receive information specifying a particular mipmap in a chain of stored mipmaps for a graphics texture and determine an offset address for the particular mipmap. In these embodiments, the texture processing circuitry is configured to determine the offset address by operating on a value that indicates a greatest potential chain size for chains of mipmaps in a graphics processing element. In these embodiments, the operating includes masking upper bits of the value based on a size of the texture and masking lower bits of the value based on a position of the specified mipmap in the chain of stored mipmaps. Disclosed techniques may reduce power consumption and/or area of circuitry configured to determine the offset.
Abstract translation: 公开了关于确定图形纹理的指定级别的细节的位置的技术。 在一些实施例中,设备包括纹理处理电路,其被配置为接收指定用于图形纹理的存储mipmap的链中的特定mipmap的信息,并确定特定mipmap的偏移地址。 在这些实施例中,纹理处理电路被配置为通过对指示图形处理元件中的mipmap的链的最大潜在链大小的值进行操作来确定偏移地址。 在这些实施例中,操作包括基于纹理的大小掩蔽该值的高位,并且基于所存储的mipmap中的指定的mipmap的位置来屏蔽该值的较低位。 公开的技术可以减少配置成确定偏移的电路的功率消耗和/或电路的面积。
-
-
-
-
-
-
-
-
-