摘要:
One embodiment of the present invention provides a system that selectively deletes clusters of conceptually-related words from a probabilistic generative model for textual documents. During operation, the system receives a current model, which contains terminal nodes representing random variables for words and contains one or more cluster nodes representing clusters of conceptually related words. Nodes in the current model are coupled together by weighted links, so that if an incoming link from a node that has fired causes a cluster node to fire with a probability proportionate to a weight of the incoming link, an outgoing link from the cluster node to another node causes the other node to fire with a probability proportionate to the weight of the outgoing link. Next, the system processes a given cluster node in the current model for possible deletion. This involves determining a number of outgoing links from the given cluster node to terminal nodes or cluster nodes in the current model. If the determined number of outgoing links is less than a minimum value, or if the frequency with which the given cluster node fires is less than a minimum frequency, the system deletes the given cluster node from the current model.
摘要:
Methods, computer program products, and systems related to providing gadgets that generate content based on entities extracted according to patterns defined by extractors are provided. A plurality of distinct extractors that define patterns for identifying entities in text are received from a plurality of users. The extractors are stored in a repository. The pattern defined by each of the extractors is processed into a pattern matching engine. The extractors are made available for subscription from a first user subscribing to a first extractor. A modification indication is received from a composition program regarding a first document of a first user, and in response to receiving the modification indication, the pattern matching engine corresponding to the first extractor is applied to the first document and identifies a first entity. The first entity is provided to a first software gadget that presents information relating to the first entity to the user.
摘要:
A system and method of voice communication with a machine are provided. The system includes a guide for containing at least one input element disposed in an arrangement, the arrangement having a coordinate system for locating the input element, and a processor for processing a user selection of the input element.
摘要:
A write request is received at a database server from a client application for writing data to persistent data storage. In response to receiving the write request, the database server selects a set of multiple replication servers. The data is sent from the database server to the selected set of multiple replication servers for writing to the persistent data storage. Confirmation is received at the database server from replication servers in the selected set of multiple replication servers. In response to receiving confirmation from the replication servers in the selected set of multiple replication servers, the database server sends to the client application information indicating success of the write request.