Abstract:
A fast and a scalable approach for computing the forward and inverse DPRT that uses: (i) a parallel array of fixed-point adder trees to compute the additions, (ii) circular shift registers to remove the need for accessing external memory components, (iii) an image block-based approach to DPRT computation that can fit the proposed architecture to available resources, and (iv) fast transpositions that are computed in one or a few clock cycles that do not depend on the size of the input image.
Abstract:
Fast and scalable architectures and methods adaptable to available resources, that (1) compute 2-D convolutions using 1-D convolutions, (2) provide fast transposition and accumulation of results for computing fast cross-correlations or 2-D convolutions, and (3) provide parallel computations using pipelined 1-D convolvers. Additionally, fast and scalable architectures and methods that compute 2-D linear convolutions using Discrete Periodic Radon Transforms (DPRTs) including the use of scalable DPRT, Fast DPRT, and fast 1-D convolutions.
Abstract:
Fast and a scalable algorithms and methods adaptable to available resources for computing (1) the DPRT on multicore CPUs by distributing the computation of the DPRT primary directions among the different cores, and (2) the DPRT on GPUs using parallel, distributed, and synchronized ray computations among the GPU cores with “ray” referring to one of the sums required for computing the DPRT or its inverse along a prime direction.
Abstract:
A fast and a scalable approach for computing the forward and inverse DPRT that uses: (i) a parallel array of fixed-point adder trees to compute the additions, (ii) circular shift registers to remove the need for accessing external memory components, (iii) an image block-based approach to DPRT computation that can fit the proposed architecture to available resources, and (iv) fast transpositions that are computed in one or a few clock cycles that do not depend on the size of the input image.