Text region detection system and method
    1.
    发明授权
    Text region detection system and method 有权
    文本区域检测系统及方法

    公开(公告)号:US08867828B2

    公开(公告)日:2014-10-21

    申请号:US13324282

    申请日:2011-12-13

    IPC分类号: G06K9/62 G06K9/32

    摘要: A method for detecting a text region in an image is disclosed. The method includes detecting a candidate text region from an input image. A set of oriented gradient images is generated from the candidate text region, and one or more detection window images of the candidate text region are captured. A sum of oriented gradients is then calculated for a region in one of the oriented gradient images. It is classified whether each detection window image contains text by comparing the associated sum of oriented gradients and a threshold. Based on the classifications of the detection window images, it is determined whether the candidate text region is a true text region.

    摘要翻译: 公开了一种用于检测图像中的文本区域的方法。 该方法包括从输入图像中检测候选文本区域。 从候选文本区域生成一组定向梯度图像,并且捕获候选文本区域的一个或多个检测窗口图像。 然后针对一个定向梯度图像中的区域计算定向梯度的总和。 通过比较相关联的归一化梯度和阈值来分类每个检测窗口图像是否包含文本。 基于检测窗口图像的分类,确定候选文本区域是否是真实文本区域。

    TEXT REGION DETECTION SYSTEM AND METHOD
    2.
    发明申请
    TEXT REGION DETECTION SYSTEM AND METHOD 有权
    文本区域检测系统和方法

    公开(公告)号:US20120224765A1

    公开(公告)日:2012-09-06

    申请号:US13324282

    申请日:2011-12-13

    IPC分类号: G06K9/62 G06K9/34

    摘要: A method for detecting a text region in an image is disclosed. The method includes detecting a candidate text region from an input image. A set of oriented gradient images is generated from the candidate text region, and one or more detection window images of the candidate text region are captured. A sum of oriented gradients is then calculated for a region in one of the oriented gradient images. It is classified whether each detection window image contains text by comparing the associated sum of oriented gradients and a threshold. Based on the classifications of the detection window images, it is determined whether the candidate text region is a true text region.

    摘要翻译: 公开了一种用于检测图像中的文本区域的方法。 该方法包括从输入图像中检测候选文本区域。 从候选文本区域生成一组定向梯度图像,并且捕获候选文本区域的一个或多个检测窗口图像。 然后针对一个定向梯度图像中的区域计算定向梯度的总和。 通过比较相关联的归一化梯度和阈值来分类每个检测窗口图像是否包含文本。 基于检测窗口图像的分类,确定候选文本区域是否是真实文本区域。

    Text detection using image regions
    3.
    发明授权
    Text detection using image regions 有权
    使用图像区域进行文本检测

    公开(公告)号:US08942484B2

    公开(公告)日:2015-01-27

    申请号:US13412853

    申请日:2012-03-06

    IPC分类号: G06K9/00 G06K9/32 G06K9/46

    摘要: A method includes receiving an indication of a set of image regions identified in image data. The method further includes, selecting image regions from the set of image regions for text extraction at least partially based on image region stability.

    摘要翻译: 一种方法包括接收在图像数据中标识的一组图像区域的指示。 该方法还包括:至少部分地基于图像区域稳定性从用于文本提取的图像区域集合中选择图像区域。

    MOBILE FAX MACHINE WITH IMAGE STITCHING AND DEGRADATION REMOVAL PROCESSING
    4.
    发明申请
    MOBILE FAX MACHINE WITH IMAGE STITCHING AND DEGRADATION REMOVAL PROCESSING 审中-公开
    移动传真机与图像缝合和降解处理

    公开(公告)号:US20130027757A1

    公开(公告)日:2013-01-31

    申请号:US13194872

    申请日:2011-07-29

    IPC分类号: H04N1/387

    摘要: A method of scanning an image of a document with a portable electronic device includes interactively indicating in substantially real time on a user interface of the portable electronic device, an instruction for capturing at least one portion of an image to enhance quality. The indication is in response to identifying degradation associated with the portion(s) of the image. The method also includes capturing the portion(s) of the image with the portable electronic device according to the instruction. The method further includes stitching the captured portion(s) of the image in place of a degraded portion of a reference image corresponding to the document, to create a corrected stitched image of the document.

    摘要翻译: 使用便携式电子设备扫描文档的图像的方法包括在便携式电子设备的用户界面上基本上实时地交互地指示用于捕获图像的至少一部分以提高质量的指令。 该指示是响应于识别与图像的部分相关联的退化。 该方法还包括根据该指令用便携式电子设备捕获图像的部分。 该方法还包括将图像的拍摄部分拼接代替对应于文档的参考图像的劣化部分,以创建文档的经过校正的拼接图像。

    Object tracking and processing
    5.
    发明授权
    Object tracking and processing 有权
    对象跟踪和处理

    公开(公告)号:US09349066B2

    公开(公告)日:2016-05-24

    申请号:US13567412

    申请日:2012-08-06

    摘要: A method includes tracking an object in each of a plurality of frames of video data to generate a tracking result. The method also includes performing object processing of a subset of frames of the plurality of frames selected according to a multi-frame latency of an object detector or an object recognizer. The method includes combining the tracking result with an output of the object processing to produce a combined output.

    摘要翻译: 一种方法包括跟踪视频数据的多个帧中的每一个中的对象以产生跟踪结果。 该方法还包括执行根据对象检测器或对象识别器的多帧等待时间选择的多个帧的帧的子集的对象处理。 该方法包括将跟踪结果与对象处理的输出组合以产生组合输出。

    Method to reject false positives detecting and tracking image objects
    6.
    发明授权
    Method to reject false positives detecting and tracking image objects 有权
    拒绝检测和跟踪图像对象的误报的方法

    公开(公告)号:US08836799B2

    公开(公告)日:2014-09-16

    申请号:US13550531

    申请日:2012-07-16

    IPC分类号: H04N5/228

    摘要: Methods, apparatuses, systems, and computer-readable media for rejecting false positive detection and tracking of image objects are presented. According to one or more aspects, a computing device may implement embodiments of the invention to use the movement of the mobile device for distinguishing false positives from true movement of the mobile device depicted in the field of view of the camera. In one embodiment, the actual movement of the mobile device may be measured using multi-modal sensor data from inertial sensors such as accelerometers and gyroscopes. In another embodiment, the actual movement of the device is calculated using the global movement of the mobile phone with reference to other objects in the field of view of the camera.

    摘要翻译: 提出了用于拒绝图像对象的假阳性检测和跟踪的方法,装置,系统和计算机可读介质。 根据一个或多个方面,计算设备可以实现本发明的实施例,以使用移动设备的移动来区分在相机视野中描绘的移动设备的真实移动的假阳性。 在一个实施例中,可以使用来自诸如加速度计和陀螺仪的惯性传感器的多模态传感器数据来测量移动设备的实际移动。 在另一个实施例中,使用移动电话参照摄像机视野中的其他对象的全局移动来计算设备的实际移动。

    Apparatus and method for separating music and voice using independent component analysis algorithm for two-dimensional forward network
    8.
    发明授权
    Apparatus and method for separating music and voice using independent component analysis algorithm for two-dimensional forward network 有权
    用于二维前向网络的独立分量分析算法分离音乐和声音的装置和方法

    公开(公告)号:US07122732B2

    公开(公告)日:2006-10-17

    申请号:US10859469

    申请日:2004-06-02

    IPC分类号: G10H7/00

    摘要: Provided is an apparatus and method for separating music and voice using an independent component analysis method for a two-dimensional forward network. The apparatus of separating music and voice can separate voice signal and a music signal, each of which are independently recorded, from a mixed signal, in a short convergence time by using the independent component analysis method, which estimates a signal mixing process according to a difference in record positions of sensors. Thus, users can easily select accompaniment from their own compact discs(CDs), digital video discs(DVDs), or audio cassette tapes, or FM radio, and listen to music of improved quality in real time. Accordingly, the users can just enjoy the music or sing along. Furthermore, since the independent component analysis method in the apparatus of separating music and voice is simple and time taken to perform the method is not long, the method can be easily used in a digital signal processor (DSP) chip, a microprocessor, or the like.

    摘要翻译: 提供了一种用于使用用于二维前向网络的独立分量分析方法来分离音乐和语音的装置和方法。 分离音乐和声音的装置可以通过使用独立分量分析方法将混合信号中的每个独立记录的语音信号和音乐信号在短的收敛时间内分离,该独立分量分析方法根据 传感器记录位置差异。 因此,用户可以容易地从其自己的光盘(CD),数字视频光盘(DVD)或音频盒式磁带或FM收音机中选择伴奏,并且实时地听音质提高质量。 因此,用户可以欣赏音乐或唱歌。 此外,由于分离音乐和语音的装置中的独立分量分析方法简单,并且执行该方法所花费的时间不长,所以该方法可以容易地用于数字信号处理器(DSP)芯片,微处理器或 喜欢。

    Parallel processing method and apparatus for determining text information from an image
    9.
    发明授权
    Parallel processing method and apparatus for determining text information from an image 有权
    用于从图像确定文本信息的并行处理方法和装置

    公开(公告)号:US09202127B2

    公开(公告)日:2015-12-01

    申请号:US13539797

    申请日:2012-07-02

    摘要: A method for processing a multi-channel image is disclosed. The method includes generating a plurality of grayscale images from the multi-channel image. At least one text region is identified in the plurality of grayscale images and text region information is determined from the at least one text region. The method generates text information of the multi-channel image based on the text region information. If the at least one text region includes a plurality of text regions, text region information from the plurality of text regions is merged to generate the text information. The plurality of the grayscale images is processed in parallel. In identifying the at least one text region, at least one candidate text region may be identified in the plurality of grayscale images and the at least one text region may be identified in the identified candidate text region.

    摘要翻译: 公开了一种用于处理多通道图像的方法。 该方法包括从多声道图像生成多个灰度图像。 在多个灰度图像中识别至少一个文本区域,并且从至少一个文本区域确定文本区域信息。 该方法基于文本区域信息生成多通道图像的文本信息。 如果至少一个文本区域包括多个文本区域,则合并来自多个文本区域的文本区域信息以生成文本信息。 多个灰度图像并行处理。 在识别至少一个文本区域时,可以在多个灰度图像中识别至少一个候选文本区域,并且可以在识别的候选文本区域中识别至少一个文本区域。

    Blurred image detection for text recognition
    10.
    发明授权
    Blurred image detection for text recognition 失效
    模糊图像检测用于文本识别

    公开(公告)号:US08665338B2

    公开(公告)日:2014-03-04

    申请号:US13039875

    申请日:2011-03-03

    IPC分类号: H04N5/228

    摘要: Techniques are described for identifying blurred images and recognizing text. One or more images of text may be captured. A change of movement associated with each image of the one or more images may be calculated. The change of movement associated with an image of the one or more images represents a change in an amount of acceleration of the device used to capture the image while the image was being captured. A steady image may be selected from the one or more images to use for text recognition. The steady image can be selected using the variances of acceleration associated with each image of the one or more images.

    摘要翻译: 描述了识别模糊图像和识别文本的技术。 可以捕获一个或多个文本图像。 可以计算与一个或多个图像的每个图像相关联的移动变化。 与一个或多个图像的图像相关联的运动的变化表示在拍摄图像时用于捕获图像的装置的加速度的变化。 可以从用于文本识别的一个或多个图像中选择稳定的图像。 可以使用与一个或多个图像的每个图像相关联的加速度的方差来选择稳定图像。