摘要:
In order to provide a computation device such that declining computational efficiency, increasing circuit size, and increasing power consumption are resolved, a computation device according to the present invention comprises: first data storage means for storing data to be computed; computation processing means for executing a computation, using the data; data sort means for selecting data which is instructed from the data to be computed which is stored in the first data storage means and data which is computed with the computation processing means, appending a prescribed delay, on the basis of a parameter, to the data among the instructed data whereof a delay instruction is received, executing a sort of the instructed data on the basis of the parameter without delaying the data whereof the delay instruction is not received, and outputting the data which is computed in the computation processing means and the computation result data of the computation processing means; and second data storage means for storing the computation result data.
摘要:
DSP architectures having improved performance are described. In an exemplary architecture, a DSP includes two MAC units and two ALUs, where one of the ALUs replaces an adder for one of the two MAC units. This DSP may be configured to operate in a dual-MAC/single-ALU configuration, a single-MAC/dual-ALU configuration, or a dual-MAC/dual-ALU configuration. This flexibility allows the DSP to handle various types of signal processing operations and improves utilization of the available hardware. The DSP architectures further includes pipeline registers that break up critical paths and allow operations at a higher clock speed for greater throughput.
摘要:
A programmable signal processing circuit is used to (de-)interleave a data stream. Data from the signal stream is stored in a data memory (28) and read in a different sequence. The programmable signal processing circuit is used for computing addresses, for use in said storing and/or reading. The programmable signal processing circuit has an instruction set that contains an instruction to compute the addresses from preceding addresses that have been used for said storing and/or reading. In response to the instruction the programmable signal processing circuit permutes positions of a plurality of bits from the old address operand and forms of a bit of the new address result as a logic function of a combination of bits from the old address operand. Successive addresses are formed by means of repeated execution of a program loop that contains an address update instruction for computing the addresses.
摘要:
A processor having a unidirectional rotator configured to shift or rotate data in one direction is disclosed. The processor also includes a control unit having logic configured to modify a shift value specified by a registered-based shift, or rotate, instruction in an opposite direction, the modified shift value being usable by the rotator to shift, or rotate, the data in the one direction, and thereby, generate the same result as if the data in the rotator had otherwise been shifted, or rotated, in the opposite direction by the shift value originally specified by the registered-based instruction. The control unit is further configured to bypass the logic and provide to the rotator a shift value specified by a register-based instruction to shift, or rotate, the data in the one direction.
摘要:
A bit manipulation processor, system and method are provided which reduces the number of operations performed during data processing. An additional register is used as a buffer. The buffer has a bit length which is preferably greater than the address boundaries in a memory or register address. A bitstream can be processed using the buffer by itself or in combination with a standard register, depending upon the particular function to be implemented.
摘要:
Digital processor apparatus (1904) having an instruction set architecture (ISA) with instruction words of varying length. In the exemplary embodiment, the processor comprises an extended user-configurable RISC processor with four-stage pipeline (fetch, decode, execute, and writeback) and associated logic (1902, 1908 and 1906) that is adapted to decode and process both 32-bit and 16-bit instruction words present in a single program, thereby increasing the flexibility of the instruction set, and allowing for greater code compression and reduced memory overhead. Free-form use of the different length instructions is provided with no required mode shift. An improved instruction aligner (1908) and code compression architecture is also disclosed.
摘要:
An integrated circuit device (100), comprising an input for receiving an initial data argument (D[31:0]) comprising a plurality of bits. The device also includes circuitry for providing a first shift argument (L[4:0]) indicating a number of shift positions in a first direction, the first shift argument comprising a plurality of bits, and circuitry for providing a second shift argument (R[4:0]) indicating a number of shift positions in a second direction, the second shift argument comprising a plurality of bits. The device also includes a plurality of rotate stages (ROTATE STAGE n ), each comprising an input and an output. One rotate stage (ROTATE STAGE 1), in the plurality of rotate stages, is coupled to receive the initial data argument. Each rotate stage, in the plurality of rotate stages, other than the one rotate stage, is coupled to receive a data argument from an output of another one of the rotate stages in the plurality of rotate stages. Further, each rotate stage, in the plurality of rotate stages, is operable to rotate the data argument input into the corresponding rotate stage in response to less than all bits of at least one of the first and second shift arguments. At least one rotate stage, in the plurality of rotate stages, is operable to rotate the data argument input into the corresponding rotate stage in response to a sum of respective bit positions of the first and second shift arguments.
摘要:
A data processing system (2) is provided which supports shift-and-insert instructions SLI, SRI which serve to shift a source data value by a specified shift amount and then insert bits from that shifted value other than the shifted-in bits into a destination value with the remaining bits within that destination value being unaltered.
摘要:
A method comprising loading a plurality of data bytes from a data cache (304) in response to a load instruction, determining the most significant bit of at least one of the data bytes using a first logic, arranging at least some of the data bytes onto a data bus (318) using a second logic substantially coupled in parallel with the first logic, and performing a sign extension on the data bus using the second logic.