Apparatus and methods for operator training in information extraction
    1.
    发明授权
    Apparatus and methods for operator training in information extraction 有权
    信息提取操作员训练的装置和方法

    公开(公告)号:US08412652B2

    公开(公告)日:2013-04-02

    申请号:US12398126

    申请日:2009-03-04

    IPC分类号: G06F15/18

    CPC分类号: G09B19/00

    摘要: After receipt of a training and execution plan, a trainer operator is automatically trained based on specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. Both trainee operators are automatically retained for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.

    摘要翻译: 在接收到训练和执行计划之后,训练员操作员将根据指定的培训文件自动进行培训,以便生成一个新的训练有素的操作员,从文档中提取信息。 新受过训练的操作员是受训操作员的新版本。 两名学员操作员都会自动保留以供以后使用,从一个或多个未知文件中提取信息。 在接收到训练和执行计划之后,新的受过训练的操作者被自动执行一个或多个未知文件,以从这样的一个或多个未知文件中提取信息。

    APPARATUS AND METHODS FOR OPERATOR TRAINING IN INFORMATION EXTRACTION
    2.
    发明申请
    APPARATUS AND METHODS FOR OPERATOR TRAINING IN INFORMATION EXTRACTION 有权
    信息提取中操作员培训的装置和方法

    公开(公告)号:US20100227301A1

    公开(公告)日:2010-09-09

    申请号:US12398126

    申请日:2009-03-04

    IPC分类号: G09B19/00

    CPC分类号: G09B19/00

    摘要: Disclosed are methods and apparatus for extracting information from one or more documents. A training and execution plan is received, and such plan specifies invocation of a trainer operator for initiating training of a trainee operator based on a set of training documents so as to generate a new trained operator that is to then be invoked so as to extract information from one or more unknown documents. The trainee operator is configured to extract information from one or more unknown documents, and each training document is associated with classified information. After receipt of the training and execution plan, the trainer operator is automatically executed to train the trainee operator based on the specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. After receipt of the training and execution plan, both the trainee operator are automatically retained for later use in extracting information from one or more unknown documents and the new trained operator for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.

    摘要翻译: 公开了用于从一个或多个文档中提取信息的方法和装置。 接收到训练和执行计划,并且该计划规定了基于一组训练文件来引导训练者操作员启动对训练操作员的训练,以便产生一个新的经过训练的操作者,然后被调用以便提取信息 来自一个或多个未知文件。 受训操作员被配置为从一个或多个未知文档中提取信息,并且每个训练文档与分类信息相关联。 在收到培训和执行计划后,培训师操作员将根据指定的培训文件自动执行培训受训操作员,以便生成一个新的训练有素的操作员,从文档中提取信息。 新受过训练的操作员是受训操作员的新版本。 在接收到训练和执行计划之后,训练者操作员将被自动保留以便以后用于从一个或多个未知文件中提取信息,并且新训练的操作者用于随后用于从一个或多个未知文档中提取信息。 在接收到训练和执行计划之后,新的受过训练的操作者被自动执行一个或多个未知文件,以从这样的一个或多个未知文件中提取信息。

    LARGE SCALE ENTITY-SPECIFIC RESOURCE CLASSIFICATION
    3.
    发明申请
    LARGE SCALE ENTITY-SPECIFIC RESOURCE CLASSIFICATION 有权
    大规模实体特定资源分类

    公开(公告)号:US20110264651A1

    公开(公告)日:2011-10-27

    申请号:US12764694

    申请日:2010-04-21

    IPC分类号: G06F17/30 G06F15/18

    CPC分类号: G06F17/30867

    摘要: A system and method is described for large scale entity-specific classification of each entity-specific set of candidates in a collection of candidates for each specific entity in a collection of entities. The collection of entities may comprise a specific category or domain of entities (e.g. schools, restaurants, manufacturers, products, events, people). Candidates may comprise webpages or other resources with resource identifiers. Entity specific sets of candidates may be found by leveraging search engine query results and user interaction therewith for queries based on entity-specific attributes. The relationship(s) or class(es) for which candidate resources are being classified relative to a specific entity may comprise an authoritative, official home page (OHP), or other class (e.g. fan page, review, aggregator) relative to a specific entity. A feature generator generates entity-specific features for candidates. In accordance with its features, one or more classifiers rank each candidate for a specific class for a specific entity.

    摘要翻译: 描述了用于在实体集合中的每个特定实体的候选集合中的每个实体特定的候选者集合的大规模实体特定分类的系统和方法。 实体的收集可以包括实体(例如学校,餐馆,制造商,产品,事件,人)的特定类别或领域。 候选人可以包括具有资源标识符的网页或其他资源。 可以通过利用搜索引擎查询结果和与其进行用户交互来查找基于实体特定属性的查询来找到实体特定的候选者集合。 候选资源相对于特定实体被分类的关系或类可以包括权威的官方主页(OHP)或相对于特定实体的其他类(例如,粉丝专页,评论,聚合者) 实体。 特征生成器为候选者生成实体特定的特征。 根据其特征,一个或多个分类器为特定实体的特定类别的每个候选者排名。

    Large scale entity-specific resource classification
    4.
    发明授权
    Large scale entity-specific resource classification 有权
    大规模实体专有资源分类

    公开(公告)号:US09317613B2

    公开(公告)日:2016-04-19

    申请号:US12764694

    申请日:2010-04-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: A system and method is described for large scale entity-specific classification of each entity-specific set of candidates in a collection of candidates for each specific entity in a collection of entities. The collection of entities may comprise a specific category or domain of entities (e.g. schools, restaurants, manufacturers, products, events, people). Candidates may comprise webpages or other resources with resource identifiers. Entity specific sets of candidates may be found by leveraging search engine query results and user interaction therewith for queries based on entity-specific attributes. The relationship(s) or class(es) for which candidate resources are being classified relative to a specific entity may comprise an authoritative, official home page (OHP), or other class (e.g. fan page, review, aggregator) relative to a specific entity. A feature generator generates entity-specific features for candidates. In accordance with its features, one or more classifiers rank each candidate for a specific class for a specific entity.

    摘要翻译: 描述了用于在实体集合中的每个特定实体的候选集合中的每个实体特定的候选者集合的大规模实体特定分类的系统和方法。 实体的收集可以包括实体(例如学校,餐馆,制造商,产品,事件,人)的特定类别或领域。 候选人可以包括具有资源标识符的网页或其他资源。 可以通过利用搜索引擎查询结果和与其进行用户交互来查找基于实体特定属性的查询来找到实体特定的候选者集合。 候选资源相对于特定实体被分类的关系或类可以包括权威的官方主页(OHP)或相对于特定实体的其他类(例如,粉丝专页,评论,聚合者) 实体。 特征生成器为候选者生成实体特定的特征。 根据其特征,一个或多个分类器为特定实体的特定类别的每个候选者排名。

    Display entity relationship
    5.
    发明授权
    Display entity relationship 有权
    显示实体关系

    公开(公告)号:US09043360B2

    公开(公告)日:2015-05-26

    申请号:US12972179

    申请日:2010-12-17

    IPC分类号: G06F17/30 G06N5/02

    摘要: Method, system, and programs for providing one or more explanations. An inquiry is received via a communication platform where the inquiry is about how a set of entities are related. Information is retrieved from a knowledge storage in accordance with the set of entities and such information describes a plurality of entities and relationships existing among the plurality of entities. Based on such retrieved information, one or more explanations with respect to each relationship by which the set of entities are connected are generated. The one or more explanations are then transmitted as a response to the inquiry.

    摘要翻译: 用于提供一个或多个解释的方法,系统和程序。 通过通信平台接收询问,其中查询是关于一组实体如何相关。 根据该组实体从知识存储器检索信息,并且这样的信息描述了存在于多个实体之间的多个实体和关系。 基于这种检索的信息,生成关于连接该组实体的每个关系的一个或多个解释。 然后作为对查询的响应来发送一个或多个解释。

    Method and Device for Hierarchically Controlling Accessed Multicast Group
    6.
    发明申请
    Method and Device for Hierarchically Controlling Accessed Multicast Group 审中-公开
    层次控制访问组播组的方法和设备

    公开(公告)号:US20120140771A1

    公开(公告)日:2012-06-07

    申请号:US13384321

    申请日:2010-06-08

    申请人: Shuang Liu Cong Yu

    发明人: Shuang Liu Cong Yu

    IPC分类号: H04L12/56

    摘要: A method for hierarchically controlling an access multicast group is disclosed, which divides the access authority control hierarchies of the multicast group and configures control rules for each authority control hierarchy. The method includes: performing authority control on an accessing user in a present authority control hierarchy according to the configured control rules, and if the user does not pass the authority control, then rejecting the user accessing the multicast group requested by the user; if the user passes the authority control, then going into the next authority control hierarchy to perform the authority control on the accessing user until accessing all the configured authority control hierarchies. Accordingly, a device for hierarchically controlling an access multicast group is provided, which includes: a division module, a control module, and a triggering module. Thus, the method and the device can hierarchically and flexibly control the on-demand multicast group of a user.

    摘要翻译: 公开了分层控制接入组播组的方法,其分割了组播组的接入权限控制层次,并为每个权限控制层次配置了控制规则。 该方法包括:根据配置的控制规则对当前权限控制层级中的访问用户执行权限控制,如果用户没有通过权限控制,则拒绝接入用户请求的组播组的用户; 如果用户通过权限控制,则进入下一个权限控制层级以对访问用户执行权限控制,直到访问所有配置的权限控制层次结构。 因此,提供了用于分级控制访问多播组的设备,其包括:分割模块,控制模块和触发模块。 因此,该方法和设备可以分层和灵活地控制用户的点播多播组。

    DIVERSIFYING RECOMMENDATION RESULTS THROUGH EXPLANATION
    7.
    发明申请
    DIVERSIFYING RECOMMENDATION RESULTS THROUGH EXPLANATION 有权
    通过解释推动推荐结果

    公开(公告)号:US20100235317A1

    公开(公告)日:2010-09-16

    申请号:US12403140

    申请日:2009-03-12

    IPC分类号: G06N5/02 G06F17/30 G06F17/10

    CPC分类号: G06F17/30864

    摘要: Methods and apparatus for making recommendations of content items to users of computer systems include compiling a database relating a list of items and corresponding explanations; receiving from a user, through a computer user interface, a request for a recommendation; extracting from the database a preliminary list of items related to the request; identifying distances between the extracted items based on the explanation corresponding to each item; and identifying a subset of the preliminary list to form a recommendation list having a limited number of recommendation results with a desired balance of both high relevancy and high diversity relative to each other.

    摘要翻译: 向计算机系统用户提供内容项目建议的方法和装置包括编辑与项目列表和相应说明相关的数据库; 从用户接收通过计算机用户界面的建议请求; 从数据库提取与请求相关的项目的初步列表; 基于与每个项目相对应的说明来识别所提取的项目之间的距离; 以及识别所述初步列表的子集以形成具有有限数量的推荐结果的推荐列表,其具有相对于彼此的高相关性和高分集之间的期望平衡。

    Recommendation System Using Social Behavior Analysis and Vocabulary Taxonomies
    8.
    发明申请
    Recommendation System Using Social Behavior Analysis and Vocabulary Taxonomies 有权
    推荐系统使用社会行为分析和词汇分类法

    公开(公告)号:US20090164897A1

    公开(公告)日:2009-06-25

    申请号:US11961599

    申请日:2007-12-20

    IPC分类号: G06F3/00

    CPC分类号: G06F17/30867 G06F3/00

    摘要: Methods and systems are provided for providing recommendations to users of a computer-based network of items of potential interest to the users. Items and people of potential interest to users may be determined using obtained word-based social behavior information, semantically-sensitive vocabulary taxonomies, and determined implied topic-specific social networks. The user may be presented with a graphical user interface including the recommendation, an explanation of the rationale relating to the recommendation, and an opportunity for the user to provide feedback relating to the recommendation or the rationale. The feedback may be used to improve future recommendations.

    摘要翻译: 提供了方法和系统,以向用户提供对用户潜在兴趣的项目的基于计算机的网络的建议。 可以使用获得的基于字的社会行为信息,语义敏感的词汇分类法和确定的隐含的话题专用社交网络来确定用户潜在兴趣的项目和人。 可以向用户呈现图形用户界面,包括推荐,与推荐有关的理由的解释,以及用户提供与推荐或理由有关的反馈的机会。 反馈可用于改进今后的建议。

    Sequential composition of schema mappings
    9.
    发明申请
    Sequential composition of schema mappings 失效
    模式映射的顺序组合

    公开(公告)号:US20070168381A1

    公开(公告)日:2007-07-19

    申请号:US11334582

    申请日:2006-01-18

    IPC分类号: G06F7/00

    摘要: A method for generating a schema mapping. A provided mapping M12 relates schema S1 to schema S2. A provided mapping M23 relates schema S2 to schema S3. A mapping M13 is generated from schema S1 to schema S3 as a composition of mappings M12 and M23. Mappings M12, M23, and M13 are each expressed in terms of at least one second-order nested tuple-generating dependency (SO nested tgd). Mapping M13 does not expressly recite any element of schema S2. At least one schema of the schemas S1 and S2 may comprise at least one complex type expression nested inside another complex type expression. Mapping M13 may define the composition of the mappings M12 and M23 with respect to a relationship semantics or a transformation semantics.

    摘要翻译: 一种用于生成模式映射的方法。 提供的映射M 12将模式S 1与模式S 2相关联。 提供的映射M 23将模式S 2与模式S 3相关联。 映射M 13从模式S 1生成到模式S 3作为映射M 12的组合,并且 M 23。 映射M 12,M 23和M 13各自表示为至少一个二阶嵌套元组生成依赖关系(SO 嵌套tgd)。 映射M 不明确地背诵模式S 2的任何元素。 模式S 1和S 2的至少一个模式可以包括嵌套在另一个复杂类型表达式内的至少一个复杂类型表达式。 映射M 13可以相对于关系语义或变换语义定义映射M 12和M 23的组成。

    System and method for finding unexpected, but relevant content in an information retrieval system
    10.
    发明授权
    System and method for finding unexpected, but relevant content in an information retrieval system 有权
    在信息检索系统中发现意外但相关内容的系统和方法

    公开(公告)号:US08204878B2

    公开(公告)日:2012-06-19

    申请号:US12688364

    申请日:2010-01-15

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: An improved method for information retrieval in web query and recommendation systems, where items that are likely unfamiliar to the users of the system, but potentially relevant, are recommended. In a recommendation system having ratings by a plurality of users for a plurality of items, items are assigned to one or more data regions based on item attributes or user activity. Source regions are identified for each of the data regions. For a given user, data regions with which both the user and the user's social network are unfamiliar are identified. Within a given data region, the relevance of items to the user within such regions is evaluated using ratings provided by other users who have entered ratings similar to the user in source regions for the data region. Items receiving the highest relevance score are recommended to the user.

    摘要翻译: 一种改进的Web查询和推荐系统中的信息检索方法,推荐系统用户可能不熟悉的项目。 在具有用于多个项目的多个用户的评级的推荐系统中,基于项目属性或用户活动将项目分配给一个或多个数据区域。 为每个数据区域识别源区域。 对于给定的用户,识别用户和用户的社交网络不熟悉的数据区域。 在给定的数据区域内,使用在数据区域的源区域中输入与用户类似的评级的其他用户提供的评级来评估项目对这些区域内的用户的相关性。 建议用户接收到最高相关性分数的项目。