摘要:
A graphical user interface (GUI) of a business process management (BPM) system is provided to construct a process model that is displayed on a graphical display device as a graphical representation comprising nodes representing process events, activities, or decision points and including computer vision (CV) nodes representing video stream processing, with flow connectors defining operational sequences of nodes and data flow between nodes of the process model. The process model is executed to perform a process represented by the process model including executing CV nodes of the process model by performing video stream processing represented by the CV nodes of the process model. The available CV nodes include a set of video pattern detection nodes, and a set of video pattern relation nodes defining a video grammar of relations between video patterns detectable by the video pattern detection nodes.
摘要:
A signal processing module comprises at least one operational unit incorporating computation units, input and output interfaces able to be linked to a bus and a memory storing data destined for the computation units, the memory being organized so that each data word is stored column-wise over several addresses according to an order dependent on the application, a column having a width of one bit, the words being transferred in series to the computation units.
摘要:
The present invention proposes a dynamically reconfigurable multi-level parallel single instruction multiple data array processing system which has a pixel level parallel image processing element array and a row-parallel array processor. The PE array mainly implements a linear operation which is adapted to be executed in parallel in the low and middle levels of image processing and the RP array implements an operation which is adapted to execute in row-parallel in the low and middle levels of image processing or more complex nonlinear operations. In particularly, such a system can be dynamically reconfigured as an SOM neural network at a low cost of area, and the neural network supports high level of image processing such as a high speed online neural network training and image feature recognition, and completely overcomes a defect that a high level of image processing can't be done by pixel-level parallel processing array in the existing programmable vision chips and parallel vision processors, and facilitates an intelligent and portable real time on-chip vision image system with a complete function at low device cost and low power consumption.
摘要:
A computer-implemented method for method for detecting features in an image. The method includes receiving first and second images at one or more processors. The method also includes processing the first and second images to detect one or more features within the first and second images respectively. The method further includes generating a third image based on processed portions of the first and second images and outputting the third image to another processor. A mobile computing device and GPU are also provided.
摘要:
Large-area electronics (LAE) enables the formation of a large number of sensors capable of spanning dimensions on the order of square meters. An example is X-ray imagers, which have been scaling both in dimension and number of sensors, today reaching millions of pixels. However, processing of the sensor data requires interfacing thousands of signals to CMOS ICs, because the implementation of complex functions in LAE has proven unviable due to the low electrical performance and inherent variability of the active devices available, namely amorphous silicon (a-Si) thin-film transistors (TFTs) on glass. Envisioning applications that perform sensing on even greater scales, disclosed is an approach whereby high-quality image detection is performed directly in the LAE domain using TFTs. The high variability and number of process defects affecting both the TFTs and sensors are overcome using a machine-learning algorithm, known as Error-Adaptive Classifier Boosting (EACB), to form an embedded classifier. Through EACB, the high-dimensional sensor data can be reduced to a small number of weak-classifier decisions, which can then be combined in the CMOS domain to generate a strong-classifier decision.
摘要:
A method for calculating a feature descriptor on a single instruction, multiple data (SIMD) processor is described. The method includes generating histogram bin indexes in a first register. The method also includes generating weights in a second register. The method further includes updating an entire histogram table in a register file based on the histogram bin indexes and the weights without storing any histogram bin to memory. Histogram bins are updated in parallel with a single instruction.
摘要:
A novel data-parallel algorithm is presented for topic modeling on a highly-parallel hardware architectures. The algorithm is a Markov-Chain Monte Carlo algorithm used to estimate the parameters of the LDA topic model. This algorithm is based on a highly parallel partially-collapsed Gibbs sampler, but replaces a stochastic step that draws from a distribution with an optimization step that computes the mean of the distribution directly and deterministically. This algorithm is correct, it is statistically performant, and it is faster than state-of-the art algorithms because it can exploit the massive amounts of parallelism by processing the algorithm on a highly-parallel architecture, such as a GPU. Furthermore, the partially-collapsed Gibbs sampler converges about as fast as the collapsed Gibbs sampler and identifies solutions that are as good, or even better, as the collapsed Gibbs sampler.
摘要:
A processing method and a processing apparatus for a single-channel convolution layer, and a processing method and apparatus for a multi-channel convolution layer are provided. The processing method for a single-channel convolution layer includes following steps. Data to be processed is divided into a plurality of data blocks, the plurality of data blocks are read by a plurality of graphic processor thread groups into local memories of the plurality of graphic processor thread groups, a plurality of filters are read by the plurality of graphic processor thread groups into the local memories of the plurality of graphic processor thread groups and convolutions of corresponding data points in the plurality of data blocks and the plurality of filters simultaneously are calculated by a plurality of threads in the plurality of graphic processor thread groups.
摘要:
There has been a significant advance in the capabilities of electro-optical sensors to search wide areas and provide data streams that contain information critical to system operators. The problem being addressed by this invention is the accurate and timely interpretation of the observations made by these sensor suites and the instantiation of the processing on practical low power, high throughput processors which enable deployment on a wide variety of platforms. The interpretation of sensor observations will also depend upon a) the general situation, e.g. level of hostility, and b) collateral data, e.g. normal or abnormal operations of the platforms themselves. Can accurate and timely situation awareness be achieved? Yes, humans do it all the time. Can it be done on small, ultra-low power, ultra-high throughput processors? Yes 3D stacked analog ASIC circuits enable such processors.
摘要:
This invention enables effective corner detection of pixels of an image using the FAST algorithm using a vector SIMD processor. This invention loads an 8×8 pixel block that includes four 7×7 pixel blocks including the 16 peripheral pixels to be tested for each of four center pixels. This invention rearranges the 64 pixels of the 8×8 block to form a 16 element array for each center pixel preferably using a vector permutation instruction. This invention uses vector SIMD subtraction and compare and vector SIMD addition and compare to make the FAST algorithm comparisons. The N consecutive pixels determinations of the FAST algorithm are made from the results of plural shift and AND operations. The corresponding center pixel is marked a corner or not a corner dependent upon of the results of plural shift and AND operations.