Selectively deleting clusters of conceptually related words from a generative model for text
    1.
    发明授权
    Selectively deleting clusters of conceptually related words from a generative model for text 有权
    从文本的生成模型中选择性地删除与概念相关的词的簇

    公开(公告)号:US07877371B1

    公开(公告)日:2011-01-25

    申请号:US11703582

    申请日:2007-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3071

    摘要: One embodiment of the present invention provides a system that selectively deletes clusters of conceptually-related words from a probabilistic generative model for textual documents. During operation, the system receives a current model, which contains terminal nodes representing random variables for words and contains one or more cluster nodes representing clusters of conceptually related words. Nodes in the current model are coupled together by weighted links, so that if an incoming link from a node that has fired causes a cluster node to fire with a probability proportionate to a weight of the incoming link, an outgoing link from the cluster node to another node causes the other node to fire with a probability proportionate to the weight of the outgoing link. Next, the system processes a given cluster node in the current model for possible deletion. This involves determining a number of outgoing links from the given cluster node to terminal nodes or cluster nodes in the current model. If the determined number of outgoing links is less than a minimum value, or if the frequency with which the given cluster node fires is less than a minimum frequency, the system deletes the given cluster node from the current model.

    摘要翻译: 本发明的一个实施例提供一种系统,其从文本文档的概率生成模型中选择性地删除与概念相关的词的簇。 在操作期间,系统接收当前模型,其包含代表词的随机变量的终端节点,并且包含表示与概念相关的词的簇的一个或多个簇节点。 当前模型中的节点通过加权链路耦合在一起,使得如果来自已被触发的节点的传入链路使得簇节点以与入局链路权重成比例的概率发射,则从群集节点到 另一个节点导致另一个节点以与输出链路的权重成比例的概率触发。 接下来,系统处理当前模型中的给定集群节点以进行可能的删除。 这涉及确定从给定的集群节点到当前模型中的终端节点或集群节点的输出链路的数量。 如果确定的出站链路数量小于最小值,或者如果给定的集群节点触发的频率小于最小频率,则系统将从当前模型中删除给定的集群节点。

    OPEN ENTITY EXTRACTION SYSTEM
    2.
    发明申请
    OPEN ENTITY EXTRACTION SYSTEM 审中-公开
    开放实体提取系统

    公开(公告)号:US20100131529A1

    公开(公告)日:2010-05-27

    申请号:US12324737

    申请日:2008-11-26

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F16/9577 G06F16/951

    摘要: Methods, computer program products, and systems related to providing gadgets that generate content based on entities extracted according to patterns defined by extractors are provided. A plurality of distinct extractors that define patterns for identifying entities in text are received from a plurality of users. The extractors are stored in a repository. The pattern defined by each of the extractors is processed into a pattern matching engine. The extractors are made available for subscription from a first user subscribing to a first extractor. A modification indication is received from a composition program regarding a first document of a first user, and in response to receiving the modification indication, the pattern matching engine corresponding to the first extractor is applied to the first document and identifies a first entity. The first entity is provided to a first software gadget that presents information relating to the first entity to the user.

    摘要翻译: 提供了与提供基于根据由提取器定义的模式提取的实体生成内容的小工具相关的方法,计算机程序产品和系统。 从多个用户接收多个不同的提取器,其定义用于识别文本中的实体的模式。 提取器存储在存储库中。 由每个提取器定义的模式被处理成模式匹配引擎。 提取器可用于从订阅第一提取器的第一用户订阅。 从关于第一用户的第一文档的合成程序接收到修改指示,并且响应于接收到修改指示,将与第一提取器对应的模式匹配引擎应用于第一文档并识别第一实体。 第一实体被提供给向用户呈现与第一实体有关的信息的第一软件小配件。

    Database replication
    4.
    发明授权
    Database replication 有权
    数据库复制

    公开(公告)号:US09002793B1

    公开(公告)日:2015-04-07

    申请号:US13645989

    申请日:2012-10-05

    IPC分类号: G06F17/30 G06F12/08

    摘要: A write request is received at a database server from a client application for writing data to persistent data storage. In response to receiving the write request, the database server selects a set of multiple replication servers. The data is sent from the database server to the selected set of multiple replication servers for writing to the persistent data storage. Confirmation is received at the database server from replication servers in the selected set of multiple replication servers. In response to receiving confirmation from the replication servers in the selected set of multiple replication servers, the database server sends to the client application information indicating success of the write request.

    摘要翻译: 在数据库服务器处从用于将数据写入持久数据存储的客户端应用程序接收到写入请求。 响应于接收到写入请求,数据库服务器选择一组多个复制服务器。 数据从数据库服务器发送到选定的多个复制服务器集合,以写入持久性数据存储。 在选定的多个复制服务器集中的复制服务器在数据库服务器上收到确认。 响应于从所选择的多个复制服务器集合中的复制服务器接收到确认,数据库服务器向客户端发送表示写入请求成功的信息。