摘要:
A single instruction stream multiple data steam data flow signal processor and method for implementing macro data flow control, in which a plurality of data processors have contentionless access to global memory data. Data processing tasks for the data processors, such as primitive executions together with graph scheduling processes, are developed asynchronously and in parallel from a command program and a macro data flow control graph.
摘要:
A plurality of control processors distribute data to be processed to a corresponding plurality of arithmetic processors using a dynamic allocation arrangement. The arithmetic processors process blocks of information and communicate the blocks of processed information to the control processors for subsequent processing as necessary. The control processors are implemented to efficiently allocate tasks to the arithmetic processors and to partition blocks of data for allocation as separate partitioned tasks to the arithmetic processors. This partitioning of blocks of data to be processed as partitioned tasks significantly reduces the delay until the results are available by distributing the task over a plurality of arithmetic processors. It also significantly improves utilization of processing resources by assigning the task to a plurality of processors that may be available.
摘要:
A data driven method for coordinating the processing of arithmetic tasks in a multiple computer system having a multiplicity of arithmetic processors by determining whether an arithmetic task is in a blocked condition or is in an execution ready condition. A source distributed processor stores data in a local memory for processing by a local processor and then transfers the processed data to a global memory for buffering in preparation for subsequent processing by a destination distributed processor. The source distributed processor generates a produce message to a destination distributed processor to indicate that the data to be transferred is available in a buffer in the global memory. The destination distributed processor loads the data to be transferred from the buffer in the global memory and then generates a consume message to the source distributed processor to indicate that the data has been transferred from the global memory and the buffer in the global memory is now available.