Statistically driven sentence realizing method and apparatus
    1.
    发明申请
    Statistically driven sentence realizing method and apparatus 失效
    统计驱动句子实现方法和装置

    公开(公告)号:US20050234705A1

    公开(公告)日:2005-10-20

    申请号:US11152352

    申请日:2005-06-14

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2881

    摘要: A method of, and system for, generating a sentence from a semantic representation maps the semantic representation to an unordered set of syntactic nodes. Simplified generation grammar rules and statistical goodness measure values from a corresponding analysis grammar are then used to create a tree structure to order the syntactic nodes. The sentence is then generated from the tree structure. The generation grammar is a simplified (context free) version of a corresponding full (context sensitive) analysis grammar. In the generation grammar, conditions on each rule are ignored except those directly related to the semantic representation. The statistical goodness measure values, which are calculated through an analysis training phase in which a corpus of example sentences is processed using the full analysis grammar, are used to guide the generation choice to prefer substructures most commonly found in a particular syntactic/semantic context during analysis.

    摘要翻译: 用于从语义表示生成句子的方法和系统将语义表示映射到无序的语法节点集合。 然后,使用来自对应分析语法的简化生成语法规则和统计学好的度量值来创建树结构来排序句法节点。 然后从树结构生成句子。 生成语法是相应的完整(上下文相关)分析语法的简化(上下文自由)版本。 在生成语法中,忽略每个规则的条件,除了与语义表示直接相关的条件。 通过使用完整分析语法处理例句的语料库的分析训练阶段计算出的统计学好度量值被用于指导生成选择以优选在特定句法/语义语境中最常见的子结构 分析。

    Creating a document index from a flex- and Yacc-generated named entity recognizer
    2.
    发明申请
    Creating a document index from a flex- and Yacc-generated named entity recognizer 审中-公开
    从flex和Yacc生成的命名实体识别器创建文档索引

    公开(公告)号:US20060047691A1

    公开(公告)日:2006-03-02

    申请号:US10954610

    申请日:2004-09-30

    IPC分类号: G06F17/00

    CPC分类号: G06F17/278

    摘要: Methods of constructing a document index including named entity information generated by at least one tool associated with parsing computer programs are presented. The methods include using a lexical analyzer generator, e.g. Flex, and/or a parser generator, e.g. Yacc, to generate named entity recognizers. The named entity recognizers are used to identify named entities in documents, in particular, very large document sets such as web pages available on the Internet. The identified named entities are stored as named entity annotations in the document index. Also, methods of performing searches using the document index are presented. The searches are performed based on queries that can be received on an application programming interface (API). Relevant documents are obtained using the named entity annotations, which can be returned across the API. Also presented are associated computer readable media.

    摘要翻译: 提出了构建包括由与解析计算机程序相关联的至少一个工具生成的命名实体信息的文档索引的方法。 这些方法包括使用词法分析器生成器,例如, Flex,和/或解析器生成器,例如。 Yacc,以生成命名实体识别器。 命名实体识别器用于识别文档中的命名实体,特别是非常大的文档集,如Internet上可用的网页。 识别的命名实体作为命名实体注释存储在文档索引中。 此外,呈现使用文档索引执行搜索的方法。 搜索是基于可以在应用程序编程接口(API)上接收的查询来执行的。 使用命名实体注释获取相关文档,可以通过API返回。 还提供了相关联的计算机可读介质。

    Rendering tables with natural language commands
    4.
    发明申请
    Rendering tables with natural language commands 有权
    使用自然语言命令渲染表

    公开(公告)号:US20050210061A1

    公开(公告)日:2005-09-22

    申请号:US10804815

    申请日:2004-03-18

    IPC分类号: G06F17/30

    摘要: The present invention relates to a method of manipulating a software application and processing data stored in a data source. The method includes receiving a natural language input and analyze the natural language input to identify semantic information contained therein. Portions of the natural language input are associated with command objects and entity objects of a schema based on the semantic information and the natural language input. The method also includes rendering data from the data source in a table of columns and rows based on the schema and the associated portions of the natural language input.

    摘要翻译: 本发明涉及一种操纵软件应用程序和处理存储在数据源中的数据的方法。 该方法包括接收自然语言输入并分析自然语言输入以识别其中包含的语义信息。 基于语义信息和自然语言输入,自然语言输入的部分与模式的命令对象和实体对象相关联。 该方法还包括基于自然语言输入的模式和相关联的部分在列和行的表中呈现来自数据源的数据。

    SYSTEM FOR NORMALIZING A DISCOURSE REPRESENTATION STRUCTURE AND NORMALIZED DATA STRUCTURE
    7.
    发明申请
    SYSTEM FOR NORMALIZING A DISCOURSE REPRESENTATION STRUCTURE AND NORMALIZED DATA STRUCTURE 有权
    统一代表结构和正则化数据结构的系统

    公开(公告)号:US20080033715A1

    公开(公告)日:2008-02-07

    申请号:US11872435

    申请日:2007-10-15

    IPC分类号: G06F17/27

    CPC分类号: G06F17/279

    摘要: A system and method for normalizing a discourse representation structure (DRS) are provided. The elements of the structure are rewritten and sorted in a way such that structures which may appear different but are nonetheless equivalent can be associated with the same, normalized representation. Various embodiments can also include a data structure for a DRS. The DRS may be represented by an array of boxes, each having a set of elements which in turn has a predefined structure suitable for representing a wide variety of linguistic information.

    摘要翻译: 提供了一种用于规范语篇表示结构(DRS)的系统和方法。 结构的元素被重写和排序,使得可能看起来不同但是等同的结构可以与相同的归一化表示相关联。 各种实施例还可以包括用于DRS的数据结构。 DRS可以由一组框来表示,每个框具有一组元素,这些元素又具有适于表示各种各样的语言信息的预定义结构。

    Method and apparatus for determining unbounded dependencies during syntactic parsing

    公开(公告)号:US07113905B2

    公开(公告)日:2006-09-26

    申请号:US10028228

    申请日:2001-12-20

    IPC分类号: G06F17/27 G06F17/28

    CPC分类号: G06F17/271

    摘要: A method is provided for identifying non-local relationships between licensing elements in a text segment and a word or phrase external to the text segment during a syntactic parse. Under the method, certain syntactic rules for combining words or phrases with text segments indicate that there is a possibility that the word or phrase being combined with the text segment will fill a gap in a relationship within the text segment. Based on this possibility, the text segment is searched to determine if there are any unfilled gaps in the text segment. Under some embodiments, if an unfilled gap is found, the location of the gap and the role the word or phrase plays in the gap are stored in a data structure associated with the syntactic node formed by combining the word or phrase with the text segment.