摘要:
A method and system including a high speed computer algorithm for rotating images stored in a computer memory, such as those captured by a CCD array. The method and system may be used with an over-the-belt optical character recognition (OCR) reader that captures a pixelized image of the surface of a parcel as it moves along a conveyor. A run-length encoded output image is obtained from a run-length encoded input image, wherein the output image is a rotated version of the input image. A predefined pixel mapping sequence for a rotation angle with a rational tangent is used to map image pixel data. Mapping image pixel data using a predefined pixel mapping sequence allows rotation without floating point computations, and allows the use of a reverse pixel mapping technique to map output runs from an input image in order to avoid the creation of holes or artifacts in the output image. Mapped output runs are scaled to preserve the aspect ratio of the rotated image. The order of processing the elements of the input run-length input table is selected so as to directly create a run-length encoded output table.
摘要:
Methods and apparatus for gray image based text identification. A gray image of a document is preferably subsampled to reduce the amount of information to be processed, while retaining sufficient information for successful processing. The subsampled image is subjected to preprocessing to remove horizontal and vertical lines. The image is then subjected to a morphological open operation. The image is then segmented to separate foreground and background information to produce a foreground image. Region filtering and merging are performed on the foreground image. Region features are then extracted and region identification performed. Homogenous regions are grouped and noise elimination performed, resulting in a number of small regions of known types. Optical character recognition can then be performed on each of the regions. The use of the information provided by variations in pixel lightness and darkness enables text identification to proceed quickly and efficiently.
摘要:
This invention provides a system and method for capturing, detecting and extracting features of an ID, such as a 1D barcode, that employs an efficient processing system based upon a CPU-controlled vision system on a chip (VSoC) architecture, which illustratively provides a linear array processor (LAP) constructed with a single instruction multiple data (SIMD) architecture in which each pixel of the rows of the pixel array are directed to individual processors in a similarly wide array. The pixel data are processed in a front end (FE) process that performs rough finding and tracking of regions of interest (ROIs) that potentially contain ID-like features. The ROI-finding process occurs in two parts so as to optimize the efficiency of the LAP in neighborhood operations—a row-processing step that occurs during image pixel readout from the pixel array and an image-processing step that occurs typically after readout occurs. The relative motion of the ID-containing ROI with respect to the pixel array is tracked and predicted. An optional back end (BE) process employs the predicted ROI to perform feature-extraction after image capture. The feature extraction derives candidate ID features that are verified by a verification step that confirms the ID, creates a refined ROI, angle of orientation and feature set. These are transmitted to a decoding processor or other device.
摘要:
A system and method for tracking movement of objects, such as people, through a scene. A background image comprising image information representing a background of the scene is generated. Image information for a video frame is compared with image information of the background image to generate regions in a video frame which potentially represent objects to be tracked in the scene. Region clusters are formed by combinations of regions if such combination satisfy predetermined criteria, and the region clusters are hypotheses of objects to be tracked. Each region cluster is evaluated to determine whether each represents an object to be tracked, based on object model information and correspondence to region clusters in a prior video frame. A confidence value is generated for each region cluster that represents a likelihood that the region represents an object to be tracked. A correspondence is determined between region clusters in consecutive video frames. Tracks of objects through the scene over time are constructed based upon the correspondence determined between region clusters in consecutive video frames.
摘要:
A classification method and apparatus for classifying an input into one of a plurality of possible outputs. Information representative of the input is compared to a neuron, where the neuron comprises a boundary defined by two or more neuron axes of different length. One of the possible outputs is then selected as corresponding to the input in accordance with that comparison. The invention is also a training method and apparatus for creating a new neuron or adjusting an existing neuron. A feature vector representative of a training input is generated, where the training input corresponds to one of a plurality of possible outputs. If no existing neuron corresponding to the training input encompasses the feature vector, then a new neuron is created, where the new neuron comprises a boundary defined by two or more neuron axes of different length. If the neuron encompasses the feature vector and if the neuron does not correspond to the training input, then the neuron is adjusted spatially, where the adjusted neuron comprises a boundary defined by two or more adjusted neuron axes of different length.
摘要:
A method and apparatus for decoding codes applied to objects for use with a camera and a conveyor system wherein the camera includes an image sensor having a two dimensional field of view (FOV) and the conveyor system moves objects in a first direction of travel through the FOV such that objects enter the FOV along an entry edge and exit the FOV along an exit edge, the method comprising the steps of providing a processor programmed to perform the steps of obtaining images of the FOV, for each image identifying code candidates in at least portions of the image, ordering at least a subset of the code candidates for decoding in a candidate order wherein the candidate order is determined at least in part as a function of the first direction of travel through the FOV, attempting to decode code candidates in the order specified by the direction of travel and when a new image event occurs, foregoing attempts to decode at least a portion of the identified code candidates.
摘要:
A handheld device and method using the device, the device comprising a sensor receiving light from within a field of view (FOV) to generate a plurality of consecutive images of the FOV, a structured light source that is controllable to generate a plurality of light patterns, the source arranged to project at least one light patterns into the FOV where at least a portion of a pattern reflects from an object and is captured by the sensor and a processor to receive images, the processor programmed to control the source to project a pattern into the FOV, locate the pattern in at least one of the generated images, locate discontinuities in the pattern and use the discontinuities to measure at least one dimension.
摘要:
This invention provides a system and method for processing discrete image data within an overall set of acquired image data based upon a focus of attention within that image. The result of such processing is to operate upon a more limited subset of the overall image data to generate output values required by the vision system process. Such output value can be a decoded ID or other alphanumeric data. The system and method is performed in a vision system having two processor groups, along with a data memory that is smaller in capacity than the amount of image data to be read out from the sensor array. The first processor group is a plurality of SIMD processors and at least one general purpose processor, co-located on the same die with the data memory. A data reduction function operates within the same clock cycle as data-readout from the sensor to generate a reduced data set that is stored in the on-die data memory. At least a portion of the overall, unreduced image data is concurrently (in the same clock cycle) transferred to the second processor while the first processor transmits at least one region indicator with respect to the reduced data set to the second processor. The region indicator represents at least one focus of attention for the second processor to operate upon.
摘要:
This invention provides a system and method for capturing, detecting and extracting features of an ID, such as a 1D barcode, that employs an efficient processing system based upon a CPU-controlled vision system on a chip (VSoC) architecture, which illustratively provides a linear array processor (LAP) constructed with a single instruction multiple data (SIMD) architecture in which each pixel of the rows of the pixel array are directed to individual processors in a similarly wide array. The pixel data are processed in a front end (FE) process that performs rough finding and tracking of regions of interest (ROIs) that potentially contain ID-like features. The ROI-finding process occurs in two parts so as to optimize the efficiency of the LAP in neighborhood operations—a row-processing step that occurs during image pixel readout from the pixel array and an image-processing step that occurs typically after readout occurs. The relative motion of the ID-containing ROI with respect to the pixel array is tracked and predicted. An optional back end (BE) process employs the predicted ROI to perform feature-extraction after image capture. The feature extraction derives candidate ID features that are verified by a verification step that confirms the ID, creates a refined ROI, angle of orientation and feature set. These are transmitted to a decoding processor or other device.
摘要:
A system is provided for removing a background pattern from a binary image. The system comprises a device for storing the binary image, and a processing device. The processing device may perform various steps, including (1) dividing the binary image stored in the storage device into a plurality of stored image regions, (2) for each of the plurality of stored image regions, calculating the image density value associated with such stored image region, (3) counting the number of stored image regions associated with each image density value, (4) identifying the image density values having the maximum number of associated stored image regions, (5) selecting as a threshold density value a less dense value from the identified image density values, and (6) for each stored image region stored in the storage device having a density value less than the threshold density value, resetting the stored image region to a zero value.