-
公开(公告)号:US20240193089A1
公开(公告)日:2024-06-13
申请号:US18063478
申请日:2022-12-08
Applicant: Arm Limited
Inventor: Jens OLSON , Jared Corey SMOLENS
IPC: G06F12/0875
CPC classification number: G06F12/0875 , G06F2212/1024 , G06F2212/221 , G06F2212/452
Abstract: A processor comprising a first storage managed as a circular buffer to store a plurality of data structures. Each data structure comprises: an identifier, a size indicator and first data associated with instructions for execution of a task. The processor is configured for searching for a data structure in the first storage. A data structure subsequent to the tail data structure can be located using a storage address in the first storage of a tail data structure and the size indicator of all data structures preceding the second data structure among the plurality of data structures. When a data structure is found, the task may be executed based at least in part on the first data of the found data structure.
-
公开(公告)号:US20230084603A1
公开(公告)日:2023-03-16
申请号:US17474568
申请日:2021-09-14
Applicant: Arm Limited , Apical Limited
Inventor: Eric KUNZE , Jared Corey SMOLENS , Aaron DEBATTISTA , Elliot Maurice Simon ROSEMARINE
Abstract: Aspects of the present disclosure relate to apparatus comprising execution circuitry comprising at least one execution unit to execute program instructions, and control circuitry. The control circuitry receives a stream of processing instructions, and issues each received instruction to one of said at least one execution unit. Responsive to determining that a first type of context switch is to be performed from an initial context to a new context, issuing continues until a pre-emption point in the stream of processing instructions is reached. Responsive to reaching the pre-emption point, state information is stored, and the new context is switched to. Responsive to determining that a context switch is to be performed to return from the new context to the initial context, the processing status is restored from the state information, and the stream of processing instructions is continued.
-
公开(公告)号:US20210303307A1
公开(公告)日:2021-09-30
申请号:US16834833
申请日:2020-03-30
Applicant: Arm Limited
Inventor: Jens OLSON , John Wakefield BROTHERS, III , Jared Corey SMOLENS , Chi-wen CHENG , Daren CROXFORD , Sharjeel SAEED , Dominic Hugo SYMES
Abstract: Herein described is a method of operating an accumulation process in a data processing apparatus. The accumulation process comprises a plurality of accumulations which output a respective plurality of accumulated values, each based on a stored value and a computed value generated by a data processing operation. The method comprises storing a first accumulated value, the first accumulated value being one of said plurality of accumulated values, into a first storage device comprising a plurality of single-bit storage elements; determining that a predetermined trigger has been satisfied with respect to the accumulation process; and in response to the determining, storing at least a portion of a second accumulated value, the second accumulated value being one of said plurality of accumulated values, into a second storage device.
-
公开(公告)号:US20210027148A1
公开(公告)日:2021-01-28
申请号:US16518444
申请日:2019-07-22
Applicant: Arm Limited
Inventor: Lingchuan MENG , John Wakefield BROTHERS, III , Jens OLSON , Jared Corey SMOLENS , Eric KUNZE , Ian Rudolf BRATT
Abstract: A processor arranged to compress neural network activation data comprising an input module for obtaining neural network activation data. The processor also comprises a block creation module arranged to split the neural network activation data into a plurality of blocks; and a metadata generation module for generating metadata associated with at least one of the plurality of blocks. Based on the metadata generated a selection module selects a compression scheme for each of the plurality of blocks, and a compression module for applying the selected compression scheme to the corresponding block to produce compressed neural network activation data. An output module is also provided for outputting the compressed neural network activation data.
-
公开(公告)号:US20240248755A1
公开(公告)日:2024-07-25
申请号:US18099595
申请日:2023-01-20
Applicant: Arm Limited
Inventor: Rune HOLM , Jens OLSON , Jared Corey SMOLENS , Dominic Hugo SYMES , Elliot Maurice Simon ROSEMARINE
CPC classification number: G06F9/4881 , G06F9/3555
Abstract: A processor comprising: a handling unit; a plurality of components each configured to execute a function. The handling unit can receive a task comprising operations on data in a coordinate space having N dimensions, receive a data structure describing execution of the task and comprising a partially ordered set of data items each associated with instructions usable by the plurality of components when executing the task, each data item is associated with a component among the plurality of components, each data item indicates dimensions of the coordinates space for which changes of coordinate causes the function of the associated component to execute, and dimensions of the coordinate space for which changes of coordinate causes the function of the associated component to store data ready to be used by another component. The handling unit iterates over the coordinate space and executes the task using the partially ordered set of data items.
-
公开(公告)号:US20240248754A1
公开(公告)日:2024-07-25
申请号:US18099594
申请日:2023-01-20
Applicant: Arm Limited
Inventor: Elliot Maurice Simon ROSEMARINE , Jared Corey SMOLENS , Rune HOLM , John Wakefield BROTHERS, III , Jens OLSON
IPC: G06F9/48
CPC classification number: G06F9/4881
Abstract: A processor to generate position data indicative of a position within a compressed data stream, wherein, previously, in executing a task, data of the compressed data stream ending at the position has been read by the processor from storage storing the compressed data stream. After reading the data, the processor reads further data of the compressed data stream from the storage, in executing the task, the further data located beyond the position within the compressed data stream. After reading the further data, the processor reads, based on the position data, a portion of the compressed data stream from the storage, in executing the task, starting from the position within the compressed data stream. The processor decompresses the portion of the compressed data stream to generate decompressed data, in executing the task.
-
公开(公告)号:US20230077301A1
公开(公告)日:2023-03-09
申请号:US17469311
申请日:2021-09-08
Applicant: Apical Limited , Arm Limited
Inventor: Aaron DEBATTISTA , Jared Corey SMOLENS
Abstract: Systems, apparatuses and methods suitable for optimizing synchronization mechanisms for multi-core processors are provided. The synchronizing mechanisms may be optimized by receiving a command stream which comprises a plurality of commands including one or more wait commands, wherein each wait command has an associated state and one or more associated conditions; sequentially processing each command in the command stream until a wait command is reached; checking the state associated with the wait command to be processed, wherein if said state is a blocking state, further processing of commands in the command stream is paused until each of said wait command's associated conditions are met, and wherein if said state is a non-blocking state, the next command in the command stream is retrieved and processed.
-
-
-
-
-
-