-
公开(公告)号:US20240248753A1
公开(公告)日:2024-07-25
申请号:US18099588
申请日:2023-01-20
Applicant: Arm Limited
Inventor: Elliot Maurice Simon ROSEMARINE , Alexander Eugene CHALFIN , Rune HOLM
IPC: G06F9/48
CPC classification number: G06F9/4881
Abstract: A processor to: receive a task to be executed, the task comprising a task-based parameter associated with the task, for use in determining a position, within an array of data descriptors, of a particular data descriptor of a particular portion of data to be processed in executing the task. Each of the data descriptors in the array of data descriptors has a predetermined size and is indicative of a location in a storage system of a respective portion of data. The processor derives, based on the task, array location data indicative of a location in the storage system of a predetermined data descriptor, and obtains the particular data descriptor, based on the array location data and the task-based parameter. The processor obtains the particular portion of data based on the particular data descriptor and processes the particular portion of data in executing the task.
-
公开(公告)号:US20220198243A1
公开(公告)日:2022-06-23
申请号:US17132750
申请日:2020-12-23
Applicant: Arm Limited
Inventor: John Wakefield BROTHERS, III , Kartikeya BHARDWAJ , Alexander Eugene CHALFIN , Danny Daysang LOH
IPC: G06N3/04
Abstract: A method of processing input data for a given layer of a neural network using a data processing system comprising compute resources for performing convolutional computations is described. The input data comprises a given set of input feature maps, IFMs, and a given set of filters. The method comprises generating a set of part-IFMs including pluralities of part-IFMs which correspond to respective IFMs, of the given set of IFMs. The method further includes grouping part-IFMs in the set of part-IFMs into a set of selections of part-IFMs. The method further includes convolving, by respective compute resources of the data processing system, the set of selections with the given set of filters to compute a set of part-output feature maps. A data processing system for processing input data for a given layer of a neural network is also described.
-
公开(公告)号:US20240248721A1
公开(公告)日:2024-07-25
申请号:US18414230
申请日:2024-01-16
Applicant: Arm Limited
Inventor: Rune HOLM , Alexander Eugene CHALFIN , Elliot Maurice Simon ROSEMARINE
IPC: G06F9/38
CPC classification number: G06F9/3838
Abstract: A method and apparatus for distributing operations for execution. Input data is received and is subdivided into portions, each comprising a first and second sub-portion. A first operation and a second operation are received. Dependencies between the first and second operations are identified. For each portion the first operation is issued for execution on the first sub-portion to produce a first output sub-portion, and completion is tracked. The first operation is issued for execution on the second sub-portion to produce a second output sub-portion. Depending upon satisfaction of the dependencies in respect of the first sub-portion, either the second operation to be executed on the first output sub-portion is issued, if the dependencies are met; or the second operation, to be executed on the first output sub-portion is stalled, if the dependencies are not met. This is repeated for each subsequent portion.
-
公开(公告)号:US20150015593A1
公开(公告)日:2015-01-15
申请号:US13940716
申请日:2013-07-12
Applicant: ARM LIMITED
Inventor: Alexander Eugene CHALFIN , Bradley Albert GRANTHAM
IPC: G06T1/20
CPC classification number: G06T1/20 , G06F9/3851 , G06T15/005 , G06T15/04 , G09G5/363
Abstract: A graphics processing pipeline includes a vertex transformation stage 14 having a vertex transformation cache 20. If a request to transform vertex data is received and the vertex transformation cache 20 indicates that the transformed vertex data for that received request has already been generated, then a pointer to that previously generated transformed vertex data is output within a result data stream in place of the transform vertex data. The transform vertex data is stored to a memory 10 before being retrieved as required by a rasterization stage 16.
Abstract translation: 图形处理流水线包括具有顶点变换高速缓存20的顶点变换阶段14.如果接收到转换顶点数据的请求,并且顶点变换高速缓存20指示已经生成了针对该接收到的请求的经变换的顶点数据,则指针 到先前生成的变换顶点数据在结果数据流中输出,而不是变换顶点数据。 变换顶点数据在被光栅化阶段16所要求的被检索之前被存储到存储器10中。
-
-
-