摘要:
A deconvolution can be decomposed into multiple convolutions. Results of the convolutions constitute an output of the deconvolution. Zeros may be added to an input tensor of the deconvolution to generate an upsampled input tensor. Subtensors having the same size as the kernel of the deconvolution may be identified from the upsampled input tensor. A subtensor may include one or more input activations and one or more zeros. Subtensors having same distribution patterns of input activations may be used to generate a reduced kernel. The reduced kernel includes a subset of the kernel. The position of a weight in the reduced kernel may be the same as the positions of an input activation in the subtensor. Multiple reduced kernels may be generated based on multiple subtensors having different distribution patterns of activations. Each of the convolutions may use the input tensor and a different one of the reduced kernels.
摘要:
A human-machine-interface system comprising: register-file-memory, configured to store input-data; a first-processing-element-slice, a second-processing-element-slice, and a controller. Each of the processing-slices comprise: a register configured to store register-data; and a processing-element configured to apply an arithmetic and logic operation on the register-data in order to provide convolution-output-data. The controller is configured to: load input-data from the register-file-memory into the first-register as the first-register-data; and load: (i) input-data from the register-file-memory, or (ii) the first-register-data from the first-register, into the second-register as the second-register-data.
摘要:
In one embodiment, a matrix operation may be performed using a plurality of input matrices, wherein the matrix operation is associated with one or more convolution operations. The plurality of input matrices may be partitioned into a plurality of input partitions, wherein the plurality of input matrices is partitioned based on a number of available processing elements. The plurality of input partitions may be distributed among a plurality of processing elements, wherein each input partition is distributed to a particular processing element of the plurality of processing elements. A plurality of partial matrix operations may be performed using the plurality of processing elements, and partial matrix data may be transmitted between the plurality of processing elements while performing the plurality of partial matrix operations. A result of the matrix operation may be determined based on the plurality of partial matrix operations.
摘要:
Methods and systems are disclosed for increased speed of processing operations on data in two-dimensional arrays, and for detecting a feature in an image. A method for detecting a feature in an image comprises storing, in a set of data memories within a parallel processing system, first image data representing pixels of a first image. The method further comprises storing, in a memory of a host processor coupled to the parallel processing system, feature kernel data representing a set of weight matrices. A method for increased speed of processing operations on data in two-dimensional arrays comprises storing, in a set of data memories within a parallel processing system, first array data representing elements of a first array. The method further comprises, for each of multiple selected elements within the first array, performing a processing operation on the selected element to produce an output element corresponding to the selected element.
摘要:
The present invention pertains to a process for automatically analyzing mixed DNA samples. Specifically, the process comprises the steps of obtaining a mixed DNA sample; amplifying the DNA sample to produce a product; detecting the product to produce a signal; and analyzing the signal to determine information about the composition of the mixed DNA sample. This DNA mixture analysis is useful for finding criminals and convicting them. This mixture analysis provides high quality estimates, and can determine genotypes, mixture weights, and likelihood ratios. This analysis provides confidence measures in the results it computes, and generates reports and intuitive visualizations. The process automates a tedious manual procedure, thereby reducing the cost, time, and effort involved in DNA forensic analysis. The system can greatly accelerate the rate of DNA crime analysis, and be used to exonerate innocent people.
摘要:
A determination target region includes a combination of a red gray image, a green gray image, and a blue gray image. A CPU part transforms a three gray value matrixes to a single first data array uniquely associated with a pixel position in the determination target region. The CPU part calculates a normalized correlation value with a single second data array transformed from a model image according to the same predetermined rule, and determines whether or not a match is found with the model image according to whether or not the normalized correlation value exceeds a predetermined threshold value.
摘要:
A computer readable medium stores computer executable instructions for causing a computer programmed thereby to perform a method of extrapolating values of pixels of a video object (402), so as to define values for at least one pixel (412) outside of the perimeter (408) of the video object (402), the at least one pixel (412) outside of the perimeter (408) being within a block boundary (406) around the video object (402). The method comprises: scanning a line of pixels within the block boundary (406), wherein the scanning identifies at least one pixel (412) outside of the perimeter (408), and wherein each identified pixel (412) is part of a segment with two end pixels, at least one of the two end pixels having a perimeter (408) pixel value; for each identified pixel (412), if both end pixels of the segment including the identified pixel (412) have perimeter (408) pixel values; assigning to the identified pixel (412) an average of the perimeter (408) pixel values, and, if only one end pixel of the segment including the identified pixel (412) has a perimeter (408) pixel value, assigning to the identified pixel (412) the perimeter (408) pixel value.
摘要:
A two dimensional video convolver (10) generates visually acceptable images on a standard television receiver and includes a first adder stage (12, 14), a second adder stage (16, 18, 20), and two pixel time period delays (22, 24). The first adder stage simultaneously receives binary data from three adjacent pixels in a predetermined line in a first direction of a picture to be reproduced on a display during each pixel scan period. The binary data of each of three pixels is processed with a predetermined weight to generate a binary first stage output signal representative of a weighted sum of the plurality of the three pixels in the predetermined line. The second adder stage is responsive to the binary first adder stage output signal during a time period for each of three predetermined adjacent lines in a second orthogonal direction of a picture to be reproduced on a display. The second adder stage processes the binary weighted sum of each of the three lines with a predetermined weight to generate a video convolver output signal representative of a weighted sum of the three pixels in a first direction over three adjacent lines in a second orthogonal direction. Weighting is accomplished in the first and second adder stages by using binary data bits for a pixel as received when the weight for that pixel is a numerical one, and by shifting the received binary data bits by a predetermined number of y bits in a first direction to provide a multiply function when a respective weight is greater than one and equals 2 y .
摘要:
A method of correlation signal processing particularly for the processing of correlation surfaces which are utilised in estimating the motion within sequences of video pictures, involves matching an assumed quadratic surface in the region of a stationary value.
摘要:
Mécanisme servant à déterminer la parallaxe entre une première et une deuxième image numérique et employant un procédé itératif d'exploration et de corrélation de configuration d'image. Ledit procédé permet de déterminer rapidement la parallaxe pour chaque pixel qui est commun aux images captées par paire d'images stéréo. En utilisant tout d'abord une sous-configuration dispersée de points pixels de référence distribués sur l'ensemble d'une première configuration d'image, on peut obtenir rapidement une sous-configuration similairement dispersée de points de raccordement, qui seront utilisés comme points de départ pour des étapes d'exploration et de corrélation plus fournies, en vue de la deuxième configuration d'image. Les valeurs de parallaxe résultantes peuvent ensuite être soumises à une fonction d'interpolation de précision des sous-pixels. Une fois que la parallaxe entre les configurations de la première et de la deuxième image a été établie, les sites peuvent être calculés en utilisant un mécanisme dépendant de la parallaxe classique.