METHOD AND DEVICE FOR ACQUIRING KEYWORDS
    51.
    发明申请
    METHOD AND DEVICE FOR ACQUIRING KEYWORDS 审中-公开
    获取关键词的方法和设备

    公开(公告)号:US20120288203A1

    公开(公告)日:2012-11-15

    申请号:US13466538

    申请日:2012-05-08

    IPC分类号: G06K9/46

    摘要: Locating text areas in an image and recognizing text contents in the text areas through optical character recognition, OCR; selecting a first class of pending keywords from the recognized text contents to search for webpages; extracting a second class of pending keywords from the retrieved webpages; and determining one or more keywords corresponding to the image from at least the second class of pending keywords. With the embodiment, both OCR and webpage searching can be combined so that the webpages can be retrieved based upon the first class of pending keywords recognized and selected through OCR to ensure convergence of the keywords and then the second class of pending keywords can be selected from the retrieved webpages to ensure correctness of the keywords.

    摘要翻译: 在图像中定位文本区域,并通过光学字符识别OCR识别文本区域中的文本内容; 从识别的文本内容中选择一类待处理的关键字来搜索网页; 从检索的网页中提取第二类待处理的关键字; 以及从至少所述第二类未决关键字确定与所述图像相对应的一个或多个关键字。 利用该实施例,可以组合OCR和网页搜索,使得可以基于通过OCR识别和选择的第一类未决关键字来检索网页,以确保关键字的收敛,然后可以从第 检索的网页以确保关键字的正确性。

    Video text processing apparatus
    52.
    发明授权
    Video text processing apparatus 有权
    视频文本处理装置

    公开(公告)号:US07929765B2

    公开(公告)日:2011-04-19

    申请号:US12778336

    申请日:2010-05-12

    IPC分类号: G06K9/18

    CPC分类号: G06K9/3266 G06K2209/01

    摘要: Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.

    摘要翻译: 通过删除冗余帧和非文本帧,从给定的视频帧中选择包含文本区域的视频帧,通过删除假笔划来选择所选帧中的文本区域,文本区域中的文本行被提取和二值化。

    VIDEO TEXT PROCESSING APPARATUS
    53.
    发明申请
    VIDEO TEXT PROCESSING APPARATUS 有权
    视频文字处理设备

    公开(公告)号:US20100220930A1

    公开(公告)日:2010-09-02

    申请号:US12778336

    申请日:2010-05-12

    IPC分类号: G06K9/46

    CPC分类号: G06K9/3266 G06K2209/01

    摘要: Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.

    摘要翻译: 通过删除冗余帧和非文本帧,从给定的视频帧中选择包含文本区域的视频帧,通过删除假笔划来选择所选帧中的文本区域,文本区域中的文本行被提取和二值化。

    Video text processing apparatus
    54.
    发明授权
    Video text processing apparatus 有权
    视频文本处理装置

    公开(公告)号:US07787705B2

    公开(公告)日:2010-08-31

    申请号:US10737209

    申请日:2003-12-17

    IPC分类号: G06K9/38

    CPC分类号: G06K9/3266 G06K2209/01

    摘要: Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.

    摘要翻译: 通过删除冗余帧和非文本帧,从给定的视频帧中选择包含文本区域的视频帧,通过删除假笔划来选择所选帧中的文本区域,文本区域中的文本行被提取和二值化。

    Degraded character image generation method and apparatus

    公开(公告)号:US20060056697A1

    公开(公告)日:2006-03-16

    申请号:US11200202

    申请日:2005-08-10

    IPC分类号: G06K9/18

    摘要: A method and apparatus for generating a degraded character image at various levels of degradation automatically is presented in this invention. The method comprises rendering the character image on a scene plane; translating and rotating the scene plane according to various parameters; determining a projection region of the character image on an image plane according to various parameters; generating a pixel region mask; and generating a final degraded image by super-sampling. Thus various degraded character images are generated on various conditions of degradation. The generated synthetic characters can be used for performance evaluation and training data augmentation in optical character recognition (OCR).

    Video text processing apparatus
    56.
    发明申请
    Video text processing apparatus 有权
    视频文本处理装置

    公开(公告)号:US20050201619A1

    公开(公告)日:2005-09-15

    申请号:US10737209

    申请日:2003-12-17

    CPC分类号: G06K9/3266 G06K2209/01

    摘要: Video frames that contain text areas are selected from given video frames by removing redundant frames and non-text frames, the text areas in the selected frames are located by removing false strokes, and text lines in the text areas are extracted and binarized.

    摘要翻译: 通过删除冗余帧和非文本帧,从给定的视频帧中选择包含文本区域的视频帧,通过删除假笔划来选择所选帧中的文本区域,文本区域中的文本行被提取和二值化。

    Pattern re-recognizing table generating device and pattern recognizing
device to improve a reliability for a recognition of a pattern
overlapping or intersecting a line in an image
    59.
    发明授权
    Pattern re-recognizing table generating device and pattern recognizing device to improve a reliability for a recognition of a pattern overlapping or intersecting a line in an image 有权
    图案重新识别表生成装置和图案识别装置,用于提高识别与图像中的线重叠或相交的图案的可靠性

    公开(公告)号:US6052480A

    公开(公告)日:2000-04-18

    申请号:US228139

    申请日:1999-01-11

    CPC分类号: G06K9/346 G06K2209/01

    摘要: A character box extracting unit extracts a line forming a character box. Then, the character box intersection calculating unit calculates the intersection of the character box with a character pattern. An intersection corresponding unit associates intersections with each other based on the directional property of character lines, distance between the character lines, etc. An in-box character extracting unit extracts a virtual image according to the association information between the intersections. A character size evaluating unit obtains from an optional character string an average character size of a character including the virtual image, and extracts a true character pattern by removing a redundant virtual image based on the average character size. A character structure analyzing and evaluating unit obtains from a prepared table a true image corresponding to the virtual image and extracts a true character pattern, thereby correctly extracting the pattern from the image in which the line crosses the pattern.11

    摘要翻译: 字符盒提取单元提取形成字符框的行。 然后,字符框交点计算单元计算字符框与字符模式的交集。 交叉对应单元基于字符行的方向属性,字符行之间的距离等将交点相互关联。盒内字符提取单元根据交叉点之间的关联信息提取虚拟图像。 字符尺寸评估单元从可选字符串获得包括虚拟图像的字符的平均字符大小,并且通过基于平均字符大小去除冗余虚拟图像来提取真实字符图案。 字符结构分析和评估单元从准备好的表中获取与虚拟图像相对应的真实图像,并提取真实的字符图案,从而从图形中正确地提取图案。

    Method apparatus for assigning temporary and true labels to digital image
    60.
    发明授权
    Method apparatus for assigning temporary and true labels to digital image 失效
    用于将临时和真实标签分配给数字图像的方法装置

    公开(公告)号:US5909507A

    公开(公告)日:1999-06-01

    申请号:US843187

    申请日:1997-04-14

    摘要: A method and apparatus for assigning a temporary label to each connected area in an image by scanning the image by using a window which has a size of two pixels in the vertical direction and of a plurality of pixels in the horizontal direction. A set of values of pixels contained in the above window is obtained and one of predetermined temporary label assignment rules corresponding to the obtained set of pixel values is selected. A temporary label is assigned to each pixel contained in the window, based on the above one of the temporary label assignment rules determined as above, and on temporary labels of pixels in the second group in the window at the above each location. In addition, the temporary labels are converted to true labels, by scanning the image pixel within the at least one circumscribing area only, where each circumscribing area is predetermined so that the at least one circumscribing area contains all pixels which do not belong to a background area in the image.

    摘要翻译: 一种用于通过使用在垂直方向上具有两个像素的大小和在水平方向上的多个像素的窗口扫描图像来将临时标签分配给图像中的每个连接区域的方法和装置。 获得包含在上述窗口中的一组像素值,并且选择与获得的像素值集合对应的预定临时标签分配规则之一。 基于上述确定的上述临时标签分配规则和在上述每个位置的窗口中的第二组中的像素的临时标签,将临时标签分配给包含在窗口中的每个像素。 另外,临时标签被转换为真实的标签,通过仅扫描至少一个限定区域内的图像像素,其中每个限定区域是预定的,使得至少一个限定区域包含不属于背景的所有像素 图像中的区域。