Straightening out distorted perspective on images
    91.
    发明授权
    Straightening out distorted perspective on images 有权
    矫正图像扭曲的视角

    公开(公告)号:US08885972B2

    公开(公告)日:2014-11-11

    申请号:US13561242

    申请日:2012-07-30

    IPC分类号: G06K9/40 G06T3/00 G06K9/32

    摘要: Methods for correcting distortions in an image including text, or an image of a page that includes text, are disclosed. The methods include identifying reliable and substantially straight lines from elements in the image. Vanishing points are determined from the lines. Parameters associated with a rectangle are determined. A coordinate conversion is performed.

    摘要翻译: 公开了用于校正包括文本的图像或包括文本的页面的图像中的失真的方法。 这些方法包括从图像中的元素识别可靠且基本上直的线。 消失点是从线上确定的。 确定与矩形相关联的参数。 执行坐标转换。

    Image reflow at word boundaries
    92.
    发明授权
    Image reflow at word boundaries 有权
    图像回流在字边界

    公开(公告)号:US08855413B2

    公开(公告)日:2014-10-07

    申请号:US13107084

    申请日:2011-05-13

    申请人: Ding-Yuan Tang

    发明人: Ding-Yuan Tang

    IPC分类号: G06K9/00 G06K9/48

    摘要: Described is a method for identifying text or other information in one or more images and reflowing images of individual elements of text at a word boundary or character boundary on devices of different sizes. The text may be rescaled while retaining the look and feel of the original text. The size may be scaled according to one or more parameters. Text may be captured in a plurality of images and merged together to form a single document or document-like collection. Text may be fully recognized, indexed, sorted and/or be made searchable. Text may be wrapped around objects and features identified as non-text or non-informational elements in an image. Borders or edges between successive elements of text may be smoothed, combined, overlapped and/or blended. Backgrounds of text may be adjusted to make the appearance of successive elements aesthetically pleasing or as close to the original as possible. Fonts may be automatically generated for display of the text on any device in its original form—with breaks in the text at a word or other natural language boundary not found in the original representation of the text.

    摘要翻译: 描述了用于在一个或多个图像中识别文本或其他信息的方法,以及在不同大小的设备上的字边界或字符边界处的文本的各个元素的回流图像。 文本可以重新缩放,同时保留原始文本的外观和感觉。 可以根据一个或多个参数来缩放大小。 文本可以被捕获在多个图像中并且合并在一起以形成单个文档或类文件集合。 文本可以被完全识别,索引,排序和/或可搜索。 文本可以包裹在图像中被识别为非文本或非信息元素的对象和特征。 文本的连续元素之间的边界或边缘可以被平滑化,组合,重叠和/或混合。 可以调整文本的背景,使连续元素的外观在美观上或尽可能接近原始。 可以自动生成字体,以便在其原始形式的任何设备上显示文本,文本中的文本中的文字或文本原始表示中未找到的其他自然语言边界。

    Dictionary Markup System and Method
    93.
    发明申请
    Dictionary Markup System and Method 审中-公开
    词典标注系统和方法

    公开(公告)号:US20140188456A1

    公开(公告)日:2014-07-03

    申请号:US13728885

    申请日:2012-12-27

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2735

    摘要: A method for providing the appropriate meaning of an entry in a text is described. The method includes the steps of determining if there are alternative meanings of the entry in an electronic dictionary and if there are alternative meanings determining the dictionary markup theme associated with each of the alternative meanings of the entry. Also, the theme associated with the text is determined. For a hierarchical structure associated with themes of entries in the electronic dictionary, the distance between the theme of the text with the dictionary markup theme of the alternative meanings of the entry is compared. Based on the distance between the theme of the text and the dictionary markup theme of the alternative meanings of the entry, the appropriate meaning is selected.

    摘要翻译: 描述了用于提供文本中条目的适当含义的方法。 该方法包括以下步骤:确定在电子词典中是否存在条目的替代含义,以及如果存在确定与条目的每个替代含义相关联的字典标记主题的替代含义。 此外,确定与文本相关联的主题。 对于与电子词典中的条目的主题相关联的层次结构,比较文本的主题与条目的替代含义的字典标记主题之间的距离。 基于文本主题与词条标注主题之间的距离,选择适当的含义。

    Methods of object search and recognition
    94.
    发明授权
    Methods of object search and recognition 有权
    对象搜索和识别方法

    公开(公告)号:US08571262B2

    公开(公告)日:2013-10-29

    申请号:US12877954

    申请日:2010-09-08

    IPC分类号: G06K9/00 G06K9/54

    摘要: Embodiments of the invention disclose techniques for processing of machine-readable forms of unfixed or flexible format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants.

    摘要翻译: 本发明的实施例公开了用于处理未固定或灵活格式的机器可读形式的技术。 可以可选地指定辅助简要描述以确定图像的空间取向。 搜索文档的元素的方法除了初步图像处理的操作之外还包括以下主要操作:从几个可用变体中选择结构描述的品种,确定图像的取向,选择文本对象,其中文本 必须被识别,并确定最小所需的识别量,识别文本对象,搜索表单的元素。 搜索表单的元素包括以下动作:在结构描述中选择搜索到的元素,从结构描述中获取搜索约束的算法,搜索元素,测试获得的变体。

    Defining a layout of text lines of CJK and non-CJK characters
    95.
    发明授权
    Defining a layout of text lines of CJK and non-CJK characters 有权
    定义CJK和非CJK字符的文本行的布局

    公开(公告)号:US08559718B1

    公开(公告)日:2013-10-15

    申请号:US13457968

    申请日:2012-04-27

    申请人: Yuri Chulinin

    发明人: Yuri Chulinin

    IPC分类号: G06K9/00

    摘要: A method is described for creating a scheme for dividing a text line of Chinese, Japanese or Korean (CJK) characters into character cells prior to applying classifiers and recognizing individual characters. Gaps between characters are found as a window is moved down the length of a text line. A histogram is built based on distances from the start of the window to a respective gap as the window is moved. The window is moved to the end of each gap after each gap is found and distances measured. This is repeated until the window reaches the end of the text line. A linear division graph (LDG) is constructed according to the detected gaps. Penalties for certain distances are applied. An optimum path is one with a minimal penalty sum and can be used as a scheme for dividing a text line into character cells.

    摘要翻译: 描述了一种用于创建在应用分类器并识别单个字符之前将中文,日文或韩文(CJK)字符的文本行分割成字符单元的方案。 当窗口沿文本行的长度向下移动时,会发现字符之间的间隙。 当窗口移动时,基于从窗口开始到相应间隙的距离构建直方图。 找到每个间隙并测量距离后,窗口移动到每个间隙的末端。 直到窗口到达文本行的末尾为止。 根据检测到的间隙构建线性分割图(LDG)。 适用于某些距离的罚则。 最佳路径是具有最小惩罚总和的路径,可用作将文本行划分为字符单元格的方案。

    Enhanced multilayer compression of image files using OCR systems
    96.
    发明授权
    Enhanced multilayer compression of image files using OCR systems 有权
    使用OCR系统增强图像文件的多层压缩

    公开(公告)号:US08548241B2

    公开(公告)日:2013-10-01

    申请号:US13545917

    申请日:2012-07-10

    IPC分类号: G06K9/34 G06K9/46

    CPC分类号: H04N1/41 G06K9/00456

    摘要: Described herein is a method for segmenting a document image into a picture component, a special or significant picture component, and a non-picture component. The non-picture component is compressed and may include character blocks. Separately, picture components are compressed with a lossy algorithm or with a preliminary defined compression ratio. Subsequently, the compressed picture component, significant picture component and the compressed non-picture component are saved in memory or in a storage location so that the document image may be recomposed based on the compressed picture component or compressed significant picture component and the compressed non-picture component.

    摘要翻译: 这里描述了一种用于将文档图像分割为图像分量,特殊或有效图像分量和非图像分量的方法。 非图像分量被压缩并且可以包括字符块。 另外,图像分量用有损算法或初步定义的压缩比进行压缩。 随后,压缩图像分量,有效图像分量和压缩非图像分量被保存在存储器或存储位置中,使得文档图像可以基于压缩图像分量或压缩的有效图像分量和压缩的非图像分量重新构成, 图片组件。