Detection of lists in vector graphics documents
    1.
    发明申请
    Detection of lists in vector graphics documents 有权
    检测矢量图形文件中的列表

    公开(公告)号:US20070185837A1

    公开(公告)日:2007-08-09

    申请号:US11351065

    申请日:2006-02-09

    IPC分类号: G06F17/30 G06F17/00

    摘要: Various technologies and techniques detect lists in vector graphics based documents and use them in meaningful ways. The system detects at least one list in a vector graphics based document using a set of rules. Pattern detection logic identifies characters, symbols, numbers, letters, and/or images that may start a list. Additional pattern detection logic determines if a list exists. The system can identify and parse bulleted lists, numbered or lettered lists, and nested lists that are any combination of both. Once identified, the content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.

    摘要翻译: 各种技术和技术检测基于矢量图形的文档中的列表,并以有意义的方式使用它们。 系统使用一组规则来检测基于矢量图形的文档中的至少一个列表。 模式检测逻辑可识别可能启动列表的字符,符号,数字,字母和/或图像。 附加模式检测逻辑确定列表是否存在。 系统可以识别和解析项目符号列表,编号或字母列表,以及两者的任意组合的嵌套列表。 一旦识别,内容将被转换为修改格式。 内容可以以更适合于目的地应用程序输出或使用的修改格式输出到目标应用程序。

    Analyzing lines to detect tables in documents
    2.
    发明申请
    Analyzing lines to detect tables in documents 有权
    分析行以检测文档中的表

    公开(公告)号:US20070186152A1

    公开(公告)日:2007-08-09

    申请号:US11350614

    申请日:2006-02-09

    IPC分类号: G06F17/00

    CPC分类号: G06F17/211

    摘要: Various technologies and techniques detect tables in vector graphics based documents and use them in meaningful ways. The system detects at least one table in a vector graphics based document using a set of rules. The rules include analyzing a set of content representing horizontal and vertical lines to find intersections and identifying table cells based on the intersections. Once identified, the table content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.

    摘要翻译: 各种技术和技术检测基于矢量图形的文档中的表格,并以有意义的方式使用它们。 系统使用一组规则检测基于矢量图形的文档中的至少一个表。 这些规则包括分析一组表示水平和垂直线的内容,以便根据交点找到交叉点和识别表格单元格。 一旦确定,表格内容将被转换为修改的格式。 内容可以以更适合于目的地应用程序输出或使用的修改格式输出到目标应用程序。

    Creation of semantic objects for providing logical structure to markup language representations of documents
    3.
    发明申请
    Creation of semantic objects for providing logical structure to markup language representations of documents 失效
    创建语义对象,以提供逻辑结构来标记文档的语言表示

    公开(公告)号:US20070136660A1

    公开(公告)日:2007-06-14

    申请号:US11302639

    申请日:2005-12-14

    IPC分类号: G06F17/00 G06F7/00

    摘要: Semantic objects are created that provide a structure for markup language representations of documents. The semantic objects include text runs that are produced from the markup language representation and that are placed into semantic blocks that group text runs according to how text is logically structured in the document being represented. The text runs of each semantic block are ordered to correspond to the logical order of the document being represented. The semantic blocks corresponding to each page of the document being represented are ordered to correspond to the logical order of the document being represented. The ordered semantic blocks including the ordered text runs are saved as a semantic object which can they be utilized to make use of the logical structure of the document being represented by the markup language.

    摘要翻译: 创建语义对象,为文档的标记语言表示提供结构。 语义对象包括从标记语言表示产生的文本运行,并且被放置到语义块中,该语义块根据文本在正在表示的文档中的逻辑结构如何运行。 每个语义块的文本运行被排序以对应于正在表示的文档的逻辑顺序。 对应于正在表示的文档的每个页面的语义块被排序以对应于正在表示的文档的逻辑顺序。 包括有序文本运行的有序语义块被保存为语义对象,它们可以被利用来利用由标记语言表示的文档的逻辑结构。

    Creation of semantic objects for providing logical structure to markup language representations of documents
    4.
    发明授权
    Creation of semantic objects for providing logical structure to markup language representations of documents 失效
    创建语义对象,以提供逻辑结构来标记文档的语言表示

    公开(公告)号:US07853869B2

    公开(公告)日:2010-12-14

    申请号:US11302639

    申请日:2005-12-14

    IPC分类号: G06F17/00

    摘要: Semantic objects are created that provide a structure for markup language representations of documents. The semantic objects include text runs that are produced from the markup language representation and that are placed into semantic blocks that group text runs according to how text is logically structured in the document being represented. The text runs of each semantic block are ordered to correspond to the logical order of the document being represented. The semantic blocks corresponding to each page of the document being represented are ordered to correspond to the logical order of the document being represented. The ordered semantic blocks including the ordered text runs are saved as a semantic object which can they be utilized to make use of the logical structure of the document being represented by the markup language.

    摘要翻译: 创建语义对象,为文档的标记语言表示提供结构。 语义对象包括从标记语言表示产生的文本运行,并且被放置到语义块中,该语义块根据文本在正在表示的文档中的逻辑结构如何运行。 每个语义块的文本运行被排序以对应于正在表示的文档的逻辑顺序。 对应于正在表示的文档的每个页面的语义块被排序以对应于正在表示的文档的逻辑顺序。 包括有序文本运行的有序语义块被保存为语义对象,它们可以被利用来利用由标记语言表示的文档的逻辑结构。

    Palette-based, multi-tint, named color methods and systems
    5.
    发明申请
    Palette-based, multi-tint, named color methods and systems 有权
    基于调色板,多色,命名的颜色方法和系统

    公开(公告)号:US20060238542A1

    公开(公告)日:2006-10-26

    申请号:US11112636

    申请日:2005-04-22

    IPC分类号: G09G5/02

    CPC分类号: G09G5/06 G06F17/3025

    摘要: Palette-based, multi-tint, named-color methods and systems utilize a pixel-by-pixel indexing technique in which individual index values into a palette of interest can be used in different ways for rendering associated images across different devices. For some devices, the index values are used to index into the palette of interest to ascertain a specific indexed color value that is then used to render that pixel of the associated image. For other devices, the index value is used as a means to compute a color value that these other devices then use to render that pixel of the associated image.

    摘要翻译: 基于调色板的多色调命名颜色方法和系统利用逐像素索引技术,其中可以以不同的方式将各个索引值用于在不同设备之间呈现相关联的图像。 对于某些设备,索引值用于索引到感兴趣的调色板中,以确定特定的索引颜色值,然后用于渲染相关图像的像素。 对于其他设备,索引值用作计算这些其他设备然后用于渲染关联图像的像素的颜色值的方法。

    ASSOCIATING OPTICAL CHARACTER RECOGNITION TEXT DATA WITH SOURCE IMAGES
    6.
    发明申请
    ASSOCIATING OPTICAL CHARACTER RECOGNITION TEXT DATA WITH SOURCE IMAGES 有权
    与源图像相关的光学字符识别文本数据

    公开(公告)号:US20100080493A1

    公开(公告)日:2010-04-01

    申请号:US12240670

    申请日:2008-09-29

    IPC分类号: G06K7/10

    CPC分类号: G06F17/211 G06K9/00442

    摘要: A system and method for associating optical character recognition text data with source images are provided. In one embodiment, an association module of a computing system is configured to receive text data from an OCR engine; associate the text data with a source image; and output associated optical character recognition data including the source image, the text data associated with the source image, and a plurality of referrers. Each referrer of the plurality of referrers may indicate a different image reference. The plurality of referrers are configured to cause the viewer application to output the text data associated with the source image to each instance of the source image that is rendered as part of the fixed-layout document in accordance with the multiple image references.

    摘要翻译: 提供了用于将光学字符识别文本数据与源图像相关联的系统和方法。 在一个实施例中,计算系统的关联模块被配置为从OCR引擎接收文本数据; 将文本数据与源图像相关联; 并输出包括源图像,与源图像相关联的文本数据和多个引用者的相关联的光学字符识别数据。 多个引导者的每个引用者可以指示不同的图像引用。 多个引导者被配置为使得观看者应用程序根据多个图像引用将与源图像相关联的文本数据输出到作为固定布局文档的一部分呈现的源图像的每个实例。

    Associating optical character recognition text data with source images
    7.
    发明授权
    Associating optical character recognition text data with source images 有权
    将光学字符识别文本数据与源图像相关联

    公开(公告)号:US08411956B2

    公开(公告)日:2013-04-02

    申请号:US12240670

    申请日:2008-09-29

    IPC分类号: G06K9/18

    CPC分类号: G06F17/211 G06K9/00442

    摘要: A system and method for associating optical character recognition text data with source images are provided. In one embodiment, an association module of a computing system is configured to receive text data from an OCR engine; associate the text data with a source image; and output associated optical character recognition data including the source image, the text data associated with the source image, and a plurality of referrers. Each referrer of the plurality of referrers may indicate a different image reference. The plurality of referrers are configured to cause the viewer application to output the text data associated with the source image to each instance of the source image that is rendered as part of the fixed-layout document in accordance with the multiple image references.

    摘要翻译: 提供了用于将光学字符识别文本数据与源图像相关联的系统和方法。 在一个实施例中,计算系统的关联模块被配置为从OCR引擎接收文本数据; 将文本数据与源图像相关联; 并输出包括源图像,与源图像相关联的文本数据和多个引用者的相关联的光学字符识别数据。 多个引导者的每个引用者可以指示不同的图像引用。 多个引导者被配置为使得观看者应用程序根据多个图像引用将与源图像相关联的文本数据输出到作为固定布局文档的一部分呈现的源图像的每个实例。