Abstract:
An electronic device including an array of addressable registers storing data. An input register connected to the array stores an input command parameter ( e.g an opcode of a command) and its associated operands in one or more input registers connected to the addressable register array. A single instance of a command accesses the at least one register of the array. Based on the input command parameter, the command for all of the address operands: reads a datum of the data previously stored in at least one register, updates the datum thereby producing an updated datum, and writes the updated datum into at least one register. The command has multiple address operands referencing the one or more registers and supports two or more of the address operands being identical. The device includes logic circuitry which provides a logical output signal to the processing circuitry indicating which, if any, of the address operands are identical. The processing circuitry based on the logical output, processes first any identical address operands prior to writing the updated datum into the at least one register so that a new instance of the command begins processing by the processing circuitry on a consecutive clock pulse and the command throughput is one command per clock pulse.
Abstract:
A method and apparatus for processing computer instructions. The method comprises forming multiple instruction matrices, wherein each instruction matrix specifies parallel and dependent instructions, and executing the multiple instruction matrices simultaneously in multiple execution units within a fixed number of cycles, wherein each execution unit supports both parallel and serial instruction execution.
Abstract:
Methods and apparatus for controlling a cache memory (110, 112) are described in which the overwwritting of floating point data into any cache line (200) of the cache memory (110, 112) is prohibited when the data stored in such cache line (200) are valid, integer data.
Abstract:
An improved superscalar processor. The processor includes multiple lanes, allowing multiple instructions in a bundle to be executed in parallel. In vector mode, the parallel lanes may be used to execute multiple instances of a bundle, representing multiple iterations of the bundle in a vector run. Scheduling logic determines whether, for each bundle, multiple instances can be executed in parallel. If multiple instances can be executed in parallel, coupling circuitry couples an instance of the bundle from one lane into one or more other lanes. In each lane, register addresses are renamed to ensure proper execution of the bundles in the vector run. Additionally, the processor may include a register bank separate from the architectural register file. Renaming logic can generate addresses to this separate register bank that are longer than used to address architectural registers, allowing longer vectors and more efficient processor operation.
Abstract:
A method and apparatus are provided for processing multiple streams of data on a plurality of execution threads. Data is selected from a plurality of data sources. An address in the data storage device is allocated for the selected data which is loaded into the allocated address. An execution task including the selected data source, the data address and an execution address is constructed and the data task is queued with previously constructed tasks. A determination is made as to which processing resources are required for each task and tasks are selected for execution in dependence on the determination. Tasks selected for execution are distributed across a plurality of processing threads.
Abstract:
The present invention relates an access method for flash memory in an embedded system, including the following steps: divide the flash memory into a main program zone and a storage zone; provide an access circuit having a command register for receiving commands and a microcontroller status buffer; the access circuit detects if the microcontroller executes the command to access the storage zone and, if not, continues to detect; when the microcontroller executes the command to access the storage zone, the access circuit stores the command in the command register for receiving commands, stores the status of the microcontroller in the microcontroller status buffer, and executes the read/write command for the storage zone; the access circuit stores the results of the read/write commands in the command register for receiving commands; and the access circuit retrieves the status of the microcontroller from the microcontroller status buffer to return the microcontroller to its original status to continuously execute access commands. In addition, the present discloses an access circuit which can be used by a microcontroller of an embedded system to read/write external flash memory.
Abstract:
An apparatus for performing multiply-accumulate operations in a microprocessor comprising operand input registers for receiving data to be operated on an adder and a multiplier for performing operations on the data, a result output port for presenting results to the microprocessor, a multiplexer for storing results, an accumulator cache for storing an accumulator value internal to the apparatus, and control circuitry for controlling the operation of the apparatus.