摘要:
A method and an apparatus for recognizing an object are disclosed. The apparatus may extract a plurality of features from an input image using a single recognition model and recognize an object in the input image based on the extracted features. The single recognition model may include at least one compression layer configured to compress input information and at least one decompression layer configured to decompress the compressed information to determine the features.
摘要:
A method of identifying a distracting element in an image (e.g., 1100), is disclosed. A visual attention map (e.g., 1120) is determined for the image (1100), the visual attention map (1120) representing one or more regions of the image, at least one of the regions corresponding to at least a portion of a subject of the image. A salient region map (e.g., 1110) is determined for the image (1100), the salient region map comprising a distribution of visual attraction values defining one or more further regions of the image (1100), the one or more further regions being categorised as salient. An intersection between the visual attention map (1120) and the salient region map (1110) is determined to identify a distracting element in the image (1100). The distracting element corresponds to at least one of the salient regions.
摘要:
The invention concerns a method for creating a saliency map of an image. It comprises the steps of an image comprising the step of hierarchical decomposition of the image into frequential sub-bands, According to the invention, the method comprises the steps of - movement estimation between the current image (I(t)) and a preceding image (I(t-nT)) calculated from the decomposition into frequential sub-bands, estimating a dominant movement for the image and a local movement for each pixel of the current image, - creation of a temporal saliency map obtained from the global and local movement estimation.
摘要:
The invention concerns a device and a method for creating a saliency map of an image. It comprises the steps of: Projection (E1) of said image according to the luminance (A) component and if said image is a color image, according to the luminance (A) component and according to the chrominance components (Cr1, Cr2); Perceptual sub-bands decomposition (E3, T2, T'2, T’’2) of said components (A, Cr1, Cr2) according to the visibility threshold of a human eye; Extraction (E7) of the salient elements of the sub-bands related to the luminance (A) component; Contour enhancement (E8, T7) of said salient elements in each sub-band related to the luminance (A) component; Calculation (T7) of a saliency map from the contour enhancement, for each sub-band related to the luminance (A) component. Creation (T8) of the saliency map as a function of the saliency maps obtained for each sub-band.
摘要:
Described is a system for decoding spiking reservoirs even when the spiking reservoir has continuous synaptic plasticity. The system uses a set of training patterns to train a neural network having a spiking reservoir comprised of spiking neurons. A test pattern duration d is estimated for a set of test patterns P, and each test pattern is presented to the spiking reservoir for a duration of d/P seconds. Output spikes from the spiking reservoir are generated via readout neurons. The output spikes are measured and the measurements are used to compute firing rate codes, each firing rate code corresponding to a test pattern in the set of test patterns P. The firing rate codes are used to decode performance of the neural network by computing a discriminability index (DI) to discriminate between test patterns in the set of test patterns P.
摘要:
Certain aspects of the present disclosure support a method and apparatus for analog signal reconstruction and recognition via sub-threshold modulation. The analog waveform recognition in a sub-threshold region of an artificial neuron of the artificial nervous system can be performed by providing a predicted waveform in parallel to an input associated with the artificial neuron. The predicted waveform can be compared with the input and the signal can be generated based at least in part on the comparison. The signal can be a detection signal that detects matching and mismatching between the input and the predicted waveform
摘要:
A method of generating a descriptor of at least part of an image includes receiving image data representing the at least part of the image. The image data is processed to identify at least one texture characteristic of the at least part of the image, thereby generating texture data indicative of a texture of the at least part of the image. The texture data is processed with the image data, thereby generating weighted texture data. A descriptor of the at least part of the image is generated using the weighted texture data.
摘要:
A computer-implemented technique for feature extraction includes obtaining an electronic image of an object and performing an edge detection algorithm on the electronic image. The technique further includes performing an edge pooling algorithm and sampling the electronic image edge patches, color patches and texture patches. A set of patches is selected from the edge patches, color patches and texture patches by selecting an ( i th + 1) patch to be within the set of patches based on a Euclidean distance from an i th patch of the set of patches for each of the set of edge patches, the set of color patches and the set of texture patches. A part selection algorithm and a part pooling algorithm is performed to obtain parts that are registered to the object.