摘要:
After receipt of a training and execution plan, a trainer operator is automatically trained based on specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. Both trainee operators are automatically retained for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.
摘要:
Disclosed are methods and apparatus for extracting information from one or more documents. A training and execution plan is received, and such plan specifies invocation of a trainer operator for initiating training of a trainee operator based on a set of training documents so as to generate a new trained operator that is to then be invoked so as to extract information from one or more unknown documents. The trainee operator is configured to extract information from one or more unknown documents, and each training document is associated with classified information. After receipt of the training and execution plan, the trainer operator is automatically executed to train the trainee operator based on the specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. After receipt of the training and execution plan, both the trainee operator are automatically retained for later use in extracting information from one or more unknown documents and the new trained operator for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.
摘要:
A system and method is described for large scale entity-specific classification of each entity-specific set of candidates in a collection of candidates for each specific entity in a collection of entities. The collection of entities may comprise a specific category or domain of entities (e.g. schools, restaurants, manufacturers, products, events, people). Candidates may comprise webpages or other resources with resource identifiers. Entity specific sets of candidates may be found by leveraging search engine query results and user interaction therewith for queries based on entity-specific attributes. The relationship(s) or class(es) for which candidate resources are being classified relative to a specific entity may comprise an authoritative, official home page (OHP), or other class (e.g. fan page, review, aggregator) relative to a specific entity. A feature generator generates entity-specific features for candidates. In accordance with its features, one or more classifiers rank each candidate for a specific class for a specific entity.
摘要:
A system and method is described for large scale entity-specific classification of each entity-specific set of candidates in a collection of candidates for each specific entity in a collection of entities. The collection of entities may comprise a specific category or domain of entities (e.g. schools, restaurants, manufacturers, products, events, people). Candidates may comprise webpages or other resources with resource identifiers. Entity specific sets of candidates may be found by leveraging search engine query results and user interaction therewith for queries based on entity-specific attributes. The relationship(s) or class(es) for which candidate resources are being classified relative to a specific entity may comprise an authoritative, official home page (OHP), or other class (e.g. fan page, review, aggregator) relative to a specific entity. A feature generator generates entity-specific features for candidates. In accordance with its features, one or more classifiers rank each candidate for a specific class for a specific entity.
摘要:
Method, system, and programs for providing one or more explanations. An inquiry is received via a communication platform where the inquiry is about how a set of entities are related. Information is retrieved from a knowledge storage in accordance with the set of entities and such information describes a plurality of entities and relationships existing among the plurality of entities. Based on such retrieved information, one or more explanations with respect to each relationship by which the set of entities are connected are generated. The one or more explanations are then transmitted as a response to the inquiry.
摘要:
A method for hierarchically controlling an access multicast group is disclosed, which divides the access authority control hierarchies of the multicast group and configures control rules for each authority control hierarchy. The method includes: performing authority control on an accessing user in a present authority control hierarchy according to the configured control rules, and if the user does not pass the authority control, then rejecting the user accessing the multicast group requested by the user; if the user passes the authority control, then going into the next authority control hierarchy to perform the authority control on the accessing user until accessing all the configured authority control hierarchies. Accordingly, a device for hierarchically controlling an access multicast group is provided, which includes: a division module, a control module, and a triggering module. Thus, the method and the device can hierarchically and flexibly control the on-demand multicast group of a user.
摘要:
Methods and apparatus for making recommendations of content items to users of computer systems include compiling a database relating a list of items and corresponding explanations; receiving from a user, through a computer user interface, a request for a recommendation; extracting from the database a preliminary list of items related to the request; identifying distances between the extracted items based on the explanation corresponding to each item; and identifying a subset of the preliminary list to form a recommendation list having a limited number of recommendation results with a desired balance of both high relevancy and high diversity relative to each other.
摘要:
Methods and systems are provided for providing recommendations to users of a computer-based network of items of potential interest to the users. Items and people of potential interest to users may be determined using obtained word-based social behavior information, semantically-sensitive vocabulary taxonomies, and determined implied topic-specific social networks. The user may be presented with a graphical user interface including the recommendation, an explanation of the rationale relating to the recommendation, and an opportunity for the user to provide feedback relating to the recommendation or the rationale. The feedback may be used to improve future recommendations.
摘要:
A method for generating a schema mapping. A provided mapping M12 relates schema S1 to schema S2. A provided mapping M23 relates schema S2 to schema S3. A mapping M13 is generated from schema S1 to schema S3 as a composition of mappings M12 and M23. Mappings M12, M23, and M13 are each expressed in terms of at least one second-order nested tuple-generating dependency (SO nested tgd). Mapping M13 does not expressly recite any element of schema S2. At least one schema of the schemas S1 and S2 may comprise at least one complex type expression nested inside another complex type expression. Mapping M13 may define the composition of the mappings M12 and M23 with respect to a relationship semantics or a transformation semantics.
摘要:
An improved method for information retrieval in web query and recommendation systems, where items that are likely unfamiliar to the users of the system, but potentially relevant, are recommended. In a recommendation system having ratings by a plurality of users for a plurality of items, items are assigned to one or more data regions based on item attributes or user activity. Source regions are identified for each of the data regions. For a given user, data regions with which both the user and the user's social network are unfamiliar are identified. Within a given data region, the relevance of items to the user within such regions is evaluated using ratings provided by other users who have entered ratings similar to the user in source regions for the data region. Items receiving the highest relevance score are recommended to the user.