Method of searching and extracting text information from drawings
    1.
    发明授权
    Method of searching and extracting text information from drawings 失效
    从图纸中搜索和提取文本信息的方法

    公开(公告)号:US5995659A

    公开(公告)日:1999-11-30

    申请号:US926217

    申请日:1997-09-09

    IPC分类号: G06K9/20 G06K9/34

    CPC分类号: G06K9/00456

    摘要: A method for recognizing text in graphical drawings includes creating a binarized representation of a drawing to form an electronic image of pixels. The image is discriminated between text regions and lines in the image by grouping pixels into blocks and comparing blocks with a predetermined format to identify text regions. The lines that remain in the text regions are removed to create text only regions. The text is recognized in the text only regions.

    摘要翻译: 用于在图形图形中识别文本的方法包括创建图形的二值化表示以形成像素的电子图像。 通过将像素分组成块并且以预定格式比较块以识别文本区域来在图像中的文本区域和线条之间区分图像。 删除保留在文本区域中的行以仅创建仅区域。 该文本仅在文本区域中被识别。

    Automatic validation method for multimedia product manuals

    公开(公告)号:US07120642B2

    公开(公告)日:2006-10-10

    申请号:US10237453

    申请日:2002-09-09

    IPC分类号: G06F17/30 G06F17/00

    摘要: A Product Document Constraint Specification Language (PDCSL) is provided for a document author to represent various types of documentation guidelines that must be enforced within documents or across different documents. A Document Constraint Analyzer (DCA) takes as input a set of document files together with a document constraint specification file, extracts and examines the contents, attributes, and relationships associated with the document objects, and evaluates the logical expressions specified in the document constraints. If a document constraint is not satisfied, an action can be taken to correct the documents or provide an explanation to the document author.

    Method and apparatus for extracting anchorable information units from complex PDF documents
    3.
    发明授权
    Method and apparatus for extracting anchorable information units from complex PDF documents 失效
    从复杂PDF文档中提取可锚定信息单元的方法和装置

    公开(公告)号:US07013309B2

    公开(公告)日:2006-03-14

    申请号:US09996271

    申请日:2001-11-28

    IPC分类号: G06F16/30 G06F17/21

    摘要: A method for extracting Anchorable Information Units (AIUs), from a Portable Document Format (PDF) file, which may either be created using either an editor or by scanning in documents. The method includes parsing the portable document format document into textual portions and non-text portions, and extracting structure from the textual portions and the non-text portions. The method further includes determining text within textual portions, and text the non-text portions, and hyperlinking a plurality of keywords within the textual portions and non-text portions to a related document.

    摘要翻译: 一种从便携式文件格式(PDF)文件中提取安装信息单元(AIU)的方法,可以使用编辑器或通过文档扫描来创建。 该方法包括将便携式文档格式文档解析为文本部分和非文本部分,以及从文本部分和非文本部分提取结构。 该方法还包括确定文本部分内的文本,并且文本非文本部分,以及将文本部分和非文本部分内的多个关键字超链接到相关文档。

    System and method for GUI supported specifications for automating form field extraction with database mapping
    5.
    发明授权
    System and method for GUI supported specifications for automating form field extraction with database mapping 失效
    用于GUI支持的用于使用数据库映射自动化表单域提取的规范的系统和方法

    公开(公告)号:US08095871B2

    公开(公告)日:2012-01-10

    申请号:US11122867

    申请日:2005-05-05

    IPC分类号: G06F17/00 G06F17/21

    CPC分类号: G06F17/30917

    摘要: A GUI (Graphical User Interface) supported specification method for form field extraction and database mapping in a computer system that includes converting a form file into a fixed electronic document format by using a GUI which is used to specify the form file and conversion parameters and extracting fields from the fixed electronic document format by using the GUI that is used to specify the fields to be extracted; and mapping the fields onto the database schema by using a GUI which is used to specify the mapping between the fields and the database schema.

    摘要翻译: GUI(图形用户界面)支持在计算机系统中用于表单域提取和数据库映射的规范方法,其包括通过使用用于指定表单文件和转换参数的GUI将表单文件转换为固定电子文档格式,并提取 通过使用用于指定要提取的字段的GUI来从固定电子文档格式的字段; 并通过使用用于指定字段和数据库模式之间的映射的GUI将字段映射到数据库模式。

    Automated systems and methods to support electronic business transactions for spare parts
    6.
    发明授权
    Automated systems and methods to support electronic business transactions for spare parts 失效
    自动化系统和方法来支持备件的电子商务交易

    公开(公告)号:US07613638B2

    公开(公告)日:2009-11-03

    申请号:US11216852

    申请日:2005-08-31

    IPC分类号: G06F17/30

    摘要: Systems and methods are provided for implementing electronic business applications for managing and selling spare parts, wherein electronic catalogs of spare parts are used to present static and/or real-time spare parts data from disparate backend data sources in a uniform, integrated manner, and wherein business logic programs are provided to support transaction activities using the electronic catalogs of spare parts, such as navigating the catalog content, and retrieving static and/or real-time spare parts data and initiating spare parts sales with the backend business information systems and spare parts data source.

    摘要翻译: 提供了用于实施用于管理和销售备件的电子商务应用的系统和方法,其中使用备件的电子目录以统一的,综合的方式呈现来自不同后端数据源的静态和/或实时备件数据,以及 其中提供业务逻辑程序以使用备件的电子目录来支持事务活动,例如浏览目录内容,以及检索静态和/或实时备件数据,并且利用后端业务信息系统和备用来启动备件销售 零件数据源。

    Method for querying XML documents using a weighted navigational index
    7.
    发明授权
    Method for querying XML documents using a weighted navigational index 有权
    使用加权导航索引查询XML文档的方法

    公开(公告)号:US07370061B2

    公开(公告)日:2008-05-06

    申请号:US11204061

    申请日:2005-08-15

    IPC分类号: G06F17/00

    摘要: A technique for optimizing the archival and management of data stored as XML documents is capable of handling mixed data including highly structured data and unstructured data. The technique maps the structured data to a relational database while storing the unstructured data in its native XML format. The data is updated using a rules database that maps updating rules against attributes and classes of elements within the documents. A document checking/validation engine performs the updates based on rule verification. A search engine searches the documents using both a path index table and a weighted content index.

    摘要翻译: 用于优化存储为XML文档的数据的归档和管理的技术能够处理包括高度结构化数据和非结构化数据的混合数据。 该技术将结构化数据映射到关系数据库,同时以非原生XML格式存储非结构化数据。 使用规则数据库更新数据,该数据库将更新规则与文档中元素的属性和类别进行映射。 文档检查/验证引擎基于规则验证来执行更新。 搜索引擎使用路径索引表和加权内容索引来搜索文档。