摘要:
Methods, apparatuses, systems, and computer-readable media for rejecting false positive detection and tracking of image objects are presented. According to one or more aspects, a computing device may implement embodiments of the invention to use the movement of the mobile device for distinguishing false positives from true movement of the mobile device depicted in the field of view of the camera. In one embodiment, the actual movement of the mobile device may be measured using multi-modal sensor data from inertial sensors such as accelerometers and gyroscopes. In another embodiment, the actual movement of the device is calculated using the global movement of the mobile phone with reference to other objects in the field of view of the camera.
摘要:
Methods, apparatuses, systems, and computer-readable media for rejecting false positive detection and tracking of image objects are presented. According to one or more aspects, a computing device may implement embodiments of the invention to use the movement of the mobile device for distinguishing false positives from true movement of the mobile device depicted in the field of view of the camera. In one embodiment, the actual movement of the mobile device may be measured using multi-modal sensor data from inertial sensors such as accelerometers and gyroscopes. In another embodiment, the actual movement of the device is calculated using the global movement of the mobile phone with reference to other objects in the field of view of the camera.
摘要:
A method for recognizing a text block in an object is disclosed. The text block includes a set of characters. A plurality of images of the object are captured and received. The object in the received images is then identified by extracting a pattern in one of the object images and comparing the extracted pattern with predetermined patterns. Further, a boundary of the object in each of the object images is detected and verified based on predetermined size information of the identified object. Text blocks in the object images are identified based on predetermined location information of the identified object. Interim sets of characters in the identified text blocks are generated based on format information of the identified object. Based on the interim sets of characters, a set of characters in the text block in the object is determined.
摘要:
A method for processing a multi-channel image is disclosed. The method includes generating a plurality of grayscale images from the multi-channel image. At least one text region is identified in the plurality of grayscale images and text region information is determined from the at least one text region. The method generates text information of the multi-channel image based on the text region information. If the at least one text region includes a plurality of text regions, text region information from the plurality of text regions is merged to generate the text information. The plurality of the grayscale images is processed in parallel. In identifying the at least one text region, at least one candidate text region may be identified in the plurality of grayscale images and the at least one text region may be identified in the identified candidate text region.
摘要:
A method for processing a multi-channel image is disclosed. The method includes generating a plurality of grayscale images from the multi-channel image. At least one text region is identified in the plurality of grayscale images and text region information is determined from the at least one text region. The method generates text information of the multi-channel image based on the text region information. If the at least one text region includes a plurality of text regions, text region information from the plurality of text regions is merged to generate the text information. The plurality of the grayscale images is processed in parallel. In identifying the at least one text region, at least one candidate text region may be identified in the plurality of grayscale images and the at least one text region may be identified in the identified candidate text region.
摘要:
A method of scanning an image of a document with a portable electronic device includes interactively indicating in substantially real time on a user interface of the portable electronic device, an instruction for capturing at least one portion of an image to enhance quality. The indication is in response to identifying degradation associated with the portion(s) of the image. The method also includes capturing the portion(s) of the image with the portable electronic device according to the instruction. The method further includes stitching the captured portion(s) of the image in place of a degraded portion of a reference image corresponding to the document, to create a corrected stitched image of the document.
摘要:
Techniques are described for identifying blurred images and recognizing text. One or more images of text may be captured. A change of movement associated with each image of the one or more images may be calculated. The change of movement associated with an image of the one or more images represents a change in an amount of acceleration of the device used to capture the image while the image was being captured. A steady image may be selected from the one or more images to use for text recognition. The steady image can be selected using the variances of acceleration associated with each image of the one or more images.
摘要:
A method includes receiving an indication of a set of image regions identified in image data. The method further includes, selecting image regions from the set of image regions for text extraction at least partially based on image region stability.
摘要:
A method includes tracking an object in each of a plurality of frames of video data to generate a tracking result. The method also includes performing object processing of a subset of frames of the plurality of frames selected according to a multi-frame latency of an object detector or an object recognizer. The method includes combining the tracking result with an output of the object processing to produce a combined output.
摘要:
A method for taking a panorama mosaic photograph includes displaying a partial image of a previously taken image as a guide image on a viewer of an image to be currently taken and taking a number of images constituting the panorama mosaic photograph according to a photography operation; projecting the taken images onto a common cylindrically curved surface; and joining the projected images into a single image.