Method and apparatus for document processing
    22.
    发明申请
    Method and apparatus for document processing 有权
    文件处理方法和装置

    公开(公告)号:US20050251735A1

    公开(公告)日:2005-11-10

    申请号:US10837043

    申请日:2004-04-30

    CPC分类号: G06F17/2247 G06F17/2288

    摘要: Modular content framework and document format methods and systems are described. The described framework and format define a set of building blocks for composing, packaging, distributing, and rendering document-centered content. These building blocks define a platform-independent framework for document formats that enable software and hardware systems to generate, exchange, and display documents reliably and consistently. The framework and format have been designed in a flexible and extensible fashion. In addition to this general framework and format, a particular format, known as the reach package format, is defined using the general framework. The reach package format is a format for storing paginated documents. The contents of a reach package can be displayed or printed with full fidelity among devices and applications in a wide range of environments and across a wide range of scenarios.

    摘要翻译: 描述了模块化内容框架和文档格式方法和系统。 描述的框架和格式定义了一组构建块,用于组合,打包,分发和呈现以文档为中心的内容。 这些构建模块定义了一个独立于平台的文档格式框架,可以使软件和硬件系统可靠,可靠地生成,交换和显示文档。 框架和格式的设计灵活可扩展。 除了这个一般框架和格式之外,使用通用框架定义了一种称为覆盖包格式的特定格式。 覆盖包格式是用于存储分页文档的格式。 可以在广泛的环境和广泛的场景下,在设备和应用程序之间以完全保真的方式显示或打印覆盖包的内容。

    Search techniques for page-based document layouts
    24.
    发明授权
    Search techniques for page-based document layouts 有权
    基于页面的文档布局的搜索技术

    公开(公告)号:US07647317B2

    公开(公告)日:2010-01-12

    申请号:US11694558

    申请日:2007-03-30

    IPC分类号: G06F17/30 G06F7/00

    摘要: Systems, methods, and/or techniques (“tools”) for improved search techniques for page-based document layouts are described herein. The tools may analyze markup elements defined for pages within source documents, and may determine whether the markup elements for the page may include at least part of a search string.

    摘要翻译: 本文描述了用于基于页面的文档布局的改进的搜索技术的系统,方法和/或技术(“工具”)。 工具可以分析为源文档中的页面定义的标记元素,并且可以确定页面的标记元素是否可以包括搜索字符串的至少一部分。

    Analyzing lines to detect tables in documents
    25.
    发明申请
    Analyzing lines to detect tables in documents 有权
    分析行以检测文档中的表

    公开(公告)号:US20070186152A1

    公开(公告)日:2007-08-09

    申请号:US11350614

    申请日:2006-02-09

    IPC分类号: G06F17/00

    CPC分类号: G06F17/211

    摘要: Various technologies and techniques detect tables in vector graphics based documents and use them in meaningful ways. The system detects at least one table in a vector graphics based document using a set of rules. The rules include analyzing a set of content representing horizontal and vertical lines to find intersections and identifying table cells based on the intersections. Once identified, the table content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.

    摘要翻译: 各种技术和技术检测基于矢量图形的文档中的表格,并以有意义的方式使用它们。 系统使用一组规则检测基于矢量图形的文档中的至少一个表。 这些规则包括分析一组表示水平和垂直线的内容,以便根据交点找到交叉点和识别表格单元格。 一旦确定,表格内容将被转换为修改的格式。 内容可以以更适合于目的地应用程序输出或使用的修改格式输出到目标应用程序。

    Search Techniques for Page-Based Document Layouts
    27.
    发明申请
    Search Techniques for Page-Based Document Layouts 有权
    基于页面的文档布局的搜索技术

    公开(公告)号:US20080243814A1

    公开(公告)日:2008-10-02

    申请号:US11694558

    申请日:2007-03-30

    IPC分类号: G06F17/30

    摘要: Systems, methods, and/or techniques (“tools”) for improved search techniques for page-based document layouts are described herein. The tools may analyze markup elements defined for pages within source documents, and may determine whether the markup elements for the page may include at least part of a search string.

    摘要翻译: 本文描述了用于基于页面的文档布局的改进的搜索技术的系统,方法和/或技术(“工具”)。 工具可以分析为源文档中的页面定义的标记元素,并且可以确定页面的标记元素是否可以包括搜索字符串的至少一部分。

    Detection of lists in vector graphics documents
    28.
    发明申请
    Detection of lists in vector graphics documents 有权
    检测矢量图形文件中的列表

    公开(公告)号:US20070185837A1

    公开(公告)日:2007-08-09

    申请号:US11351065

    申请日:2006-02-09

    IPC分类号: G06F17/30 G06F17/00

    摘要: Various technologies and techniques detect lists in vector graphics based documents and use them in meaningful ways. The system detects at least one list in a vector graphics based document using a set of rules. Pattern detection logic identifies characters, symbols, numbers, letters, and/or images that may start a list. Additional pattern detection logic determines if a list exists. The system can identify and parse bulleted lists, numbered or lettered lists, and nested lists that are any combination of both. Once identified, the content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.

    摘要翻译: 各种技术和技术检测基于矢量图形的文档中的列表,并以有意义的方式使用它们。 系统使用一组规则来检测基于矢量图形的文档中的至少一个列表。 模式检测逻辑可识别可能启动列表的字符,符号,数字,字母和/或图像。 附加模式检测逻辑确定列表是否存在。 系统可以识别和解析项目符号列表,编号或字母列表,以及两者的任意组合的嵌套列表。 一旦识别,内容将被转换为修改格式。 内容可以以更适合于目的地应用程序输出或使用的修改格式输出到目标应用程序。

    Creation of semantic objects for providing logical structure to markup language representations of documents
    29.
    发明申请
    Creation of semantic objects for providing logical structure to markup language representations of documents 失效
    创建语义对象,以提供逻辑结构来标记文档的语言表示

    公开(公告)号:US20070136660A1

    公开(公告)日:2007-06-14

    申请号:US11302639

    申请日:2005-12-14

    IPC分类号: G06F17/00 G06F7/00

    摘要: Semantic objects are created that provide a structure for markup language representations of documents. The semantic objects include text runs that are produced from the markup language representation and that are placed into semantic blocks that group text runs according to how text is logically structured in the document being represented. The text runs of each semantic block are ordered to correspond to the logical order of the document being represented. The semantic blocks corresponding to each page of the document being represented are ordered to correspond to the logical order of the document being represented. The ordered semantic blocks including the ordered text runs are saved as a semantic object which can they be utilized to make use of the logical structure of the document being represented by the markup language.

    摘要翻译: 创建语义对象,为文档的标记语言表示提供结构。 语义对象包括从标记语言表示产生的文本运行,并且被放置到语义块中,该语义块根据文本在正在表示的文档中的逻辑结构如何运行。 每个语义块的文本运行被排序以对应于正在表示的文档的逻辑顺序。 对应于正在表示的文档的每个页面的语义块被排序以对应于正在表示的文档的逻辑顺序。 包括有序文本运行的有序语义块被保存为语义对象,它们可以被利用来利用由标记语言表示的文档的逻辑结构。

    Methods and systems for providing index data for print job data
    30.
    发明申请
    Methods and systems for providing index data for print job data 审中-公开
    为打印作业数据提供索引数据的方法和系统

    公开(公告)号:US20060209334A1

    公开(公告)日:2006-09-21

    申请号:US11080371

    申请日:2005-03-15

    IPC分类号: G06F3/12

    摘要: Various embodiments develop (and consume), along with rendered print job data, metadata that describes certain characteristics of the print job data. This metadata can be provided, along with the rendered data, from a client device to a print server and can allow the print server to ascertain the nature or context of the print job data. In some embodiments, the metadata can describe such things as page boundaries and state transition data. By ascertaining the nature or context of the print job data, the print server is able to intelligently act upon this information and, in at least some embodiments, implement additional print server features that would not be possible if only rendered data were sent to the print server.

    摘要翻译: 各种实施例开发(和消耗)以及所渲染的打印作业数据,描述打印作业数据的某些特性的元数据。 该元数据可以与呈现的数据一起从客户端设备提供给打印服务器,并且可以允许打印服务器确定打印作业数据的性质或上下文。 在一些实施例中,元数据可以描述诸如页面边界和状态转换数据的事物。 通过确定打印作业数据的性质或上下文,打印服务器能够智能地对该信息采取行动,并且在至少一些实施例中实现如果仅将呈现的数据发送到打印机,那么将不可能实现附加的打印服务器特征 服务器。