APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION
    1.
    发明申请
    APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION 审中-公开
    概念中心信息提取的装置和方法

    公开(公告)号:US20100241639A1

    公开(公告)日:2010-09-23

    申请号:US12408450

    申请日:2009-03-20

    IPC分类号: G06F17/30

    CPC分类号: G06F16/345 G06F16/313

    摘要: Disclosed are methods and apparatus for extracting (or annotating) structured information from web content. Web content of interest from a particular domain is represented as one or more tree instances having a plurality of branching nodes that each correspond to a web object such that the tree instances correspond to one or more structured data instances. The particular domain is associated with domain knowledge that includes one or more presentation rulesets that each specifies a particular structure for a set of data instances, a domain-specific concept labeler, one or more specified properties of the web objects in the tree instances, and a concept schema that specifies a representation of the data to be extracted from the web content. A structured data instance that conforms to the concept schema is extracted from the one or more tree instances based on the domain knowledge for the particular domain. Extraction of the structured data instances is accomplished by (i) using the domain-specific concept labeler to annotate a subset of nodes of the tree instances; and (ii) using a locally adaptive concept annotator to extract the structured data instances based on the annotated segments and the local properties associated with such annotated segments. The extracted structured data instance is stored as structured output records in a database.

    摘要翻译: 公开了从网页内容中提取(或注释)结构化信息的方法和装置。 来自特定域的感兴趣的Web内容被表示为具有多个分支节点的一个或多个树实例,每个分支节点对应于web对象,使得树实例对应于一个或多个结构化数据实例。 特定域与域知识相关联,其包括一个或多个呈现规则集,每个表示规则集指定一组数据实例的特定结构,特定于域的概念标签器,树实例中的web对象的一个​​或多个指定的属性,以及 一个概念模式,指定要从Web内容中提取的数据的表示。 基于特定域的域知识,从一个或多个树实例提取符合概念模式的结构化数据实例。 结构化数据实例的提取是通过(i)使用域特定概念标签器来注释树实例的节点的子集来实现的; 以及(ii)使用本地适应性概念注释器基于所注释的段和与这些注释段相关联的本地属性来提取结构化数据实例。 提取的结构化数据实例作为结构化输出记录存储在数据库中。

    System for opinion reconciliation
    4.
    发明授权
    System for opinion reconciliation 有权
    意见调解制度

    公开(公告)号:US07895149B2

    公开(公告)日:2011-02-22

    申请号:US11957779

    申请日:2007-12-17

    IPC分类号: G06N5/00

    CPC分类号: G06N5/04

    摘要: A system is disclosed for reconciling opinions generated by agents with respect to one or more predicates. The disclosed system may use observed variables and a probabilistic model including latent parameters to estimate a truth score associated with each of the predicates. The truth score, as well as one or more of the latent parameters of the probabilistic model, may be estimated based on the observed variables. The truth score generated by the disclosed system may enable publishers to reliably represent the truth of a predicate to interested users.

    摘要翻译: 披露了一种系统,用于协调代理人对一种或多种谓词产生的意见。 所公开的系统可以使用观测变量和包括潜在参数的概率模型来估计与每个谓词相关联的真值得分。 可以基于观察到的变量来估计真实分数以及概率模型的一个或多个潜在参数。 由所公开的系统产生的真相得分可以使得发布者能够可靠地向感兴趣的用户表示谓词的真实性。

    Methods and apparatus for mapping source schemas to a target schema using schema embedding
    5.
    发明授权
    Methods and apparatus for mapping source schemas to a target schema using schema embedding 有权
    使用模式嵌入将源模式映射到目标模式的方法和装置

    公开(公告)号:US07921072B2

    公开(公告)日:2011-04-05

    申请号:US11141357

    申请日:2005-05-31

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/3092

    摘要: Methods and apparatus are provided for mapping XML source documents to target documents using schema embeddings. According to one aspect of the invention, one or more edges in the one or more source schemas are mapped to one or more paths in at least one target schema. The disclosed mapping techniques ensure that (i) one or more source documents that conform to one or more of the source schemas can be recovered from one or more target documents that conform to the at least one target schema, if a mapping exists between the one or more of the source schemas and the at least one target schema; (ii) queries on one or more source documents that conform to one or more of the source schemas in a given query language can be answered on one or more target documents that conform to the at least one target schema; and (iii) the one or more target documents conform to a target schema.

    摘要翻译: 提供了使用模式嵌入将XML源文档映射到目标文档的方法和装置。 根据本发明的一个方面,一个或多个源模式中的一个或多个边缘被映射到至少一个目标模式中的一个或多个路径。 所公开的映射技术确保(i)符合一个或多个源模式的一个或多个源文档可以从符合至少一个目标模式的一个或多个目标文档中恢复,如果一个 或更多的源模式和至少一个目标模式; (ii)可以在符合所述至少一个目标模式的一个或多个目标文档上回答关于符合给定查询语言中的一个或多个源模式的一个或多个源文档的查询; 和(iii)一个或多个目标文档符合目标模式。

    Method and apparatus for composing XSL transformations with XML publishing views
    7.
    发明授权
    Method and apparatus for composing XSL transformations with XML publishing views 有权
    使用XML发布视图组合XSL转换的方法和装置

    公开(公告)号:US09152735B2

    公开(公告)日:2015-10-06

    申请号:US10626835

    申请日:2003-07-24

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30914

    摘要: A method and apparatus are provided for composing XSL transformations with XML publishing views. XSL transformations are performed on XML documents defined as views of relational databases. A portion of a relational database can be exported to an XML document. An initial view query defines an XML view on the relational database and an XSLT stylesheet specifies at least one transformation. The initial view query is modified to account for an effect of the transformation and the modified view query is applied to the relational database to obtain the XML document. When the modified view query is evaluated on a relational database instance, the same XML document is obtained as would be obtained by evaluating the XSLT stylesheet on the original XML view.

    摘要翻译: 提供了一种用于使用XML发布视图组合XSL转换的方法和装置。 对定义为关系数据库视图的XML文档执行XSL转换。 关系数据库的一部分可以导出到XML文档。 初始视图查询定义关系数据库上的XML视图,XSLT样式表至少指定一个转换。 修改初始视图查询以解决转换的影响,并将修改的视图查询应用于关系数据库以获取XML文档。 当在关系数据库实例上评估修改的视图查询时,将通过在原始XML视图上评估XSLT样式表获得相同的XML文档。

    METHOD AND SYSTEM FOR RENDERING SIMPLIFIED POINT FINDING MAPS
    8.
    发明申请
    METHOD AND SYSTEM FOR RENDERING SIMPLIFIED POINT FINDING MAPS 审中-公开
    用于渲染简化点查找的方法和系统

    公开(公告)号:US20090112455A1

    公开(公告)日:2009-04-30

    申请号:US11923378

    申请日:2007-10-24

    IPC分类号: G01C21/34

    CPC分类号: G06Q10/047

    摘要: A method and system for rendering simplified point finding maps is provided. The method may include defining a boundary area and a target point within a target area, on a map that comprises multiple roads segments. A plurality of routes that follow the road segments and go from the boundary area to the target point may be selected. Road segments that are not necessary to the routes may be removed from the map.

    摘要翻译: 提供了一种用于绘制简化的点查找图的方法和系统。 该方法可以包括在包括多个道路段的地图上定义目标区域内的边界区域和目标点。 可以选择沿着道路段并且从边界区域到达目标点的多条路线。 路线不需要的路段可能会从地图中移除。

    Real-time event processing system with analysis engine using recovery information
    9.
    发明授权
    Real-time event processing system with analysis engine using recovery information 有权
    具有分析引擎的实时事件处理系统使用恢复信息

    公开(公告)号:US06502133B1

    公开(公告)日:2002-12-31

    申请号:US09276221

    申请日:1999-03-25

    IPC分类号: G06F1300

    摘要: A real-time event processing system (EPS) for processing a sequence of events generated by one or more applications. In an illustrative embodiment, the EPS includes a set of real-time analysis engines (RAEs) operating in parallel, e.g., a set of clusters each including one or more RAEs, and one or more mappers for mapping a given input event to a particular one of the clusters. A main-memory database system is coupled to the RAEs, and the RAEs process events associated with input streams from one or more data sources and deliver output streams to one or more data sinks. The data source and data sinks may be, e.g., network elements, clients, databases, etc. The events are processed in accordance with services implemented in the RAEs, and utilize data stored in a memory portion of the main-memory database system accessible to the RAEs. The data may include, e.g., a subscription table storing subscription information indicating the service or services that should be executed for a given event. The services are generated in a service authoring environment (SAE) in the EPS, using a declarative language. The SAE generates the services in the form of object code components, e.g., dynamically linked libraries, which may be dynamically linked into the RAEs without interrupting event processing. Recovery information regarding a recovery point for a given RAE or set of RAEs in the EPS may be stored in a memory portion of the main-memory database system, and utilized to implement a roll-back of the RAE to the recovery point.

    摘要翻译: 一种用于处理由一个或多个应用产生的事件序列的实时事件处理系统(EPS)。 在说明性实施例中,EPS包括并行操作的一组实时分析引擎(RAE),例如,一组包括一个或多个RAE的集群,以及一个或多个映射器,用于将给定的输入事件映射到特定的 其中一个集群。 主存储器数据库系统耦合到RAE,并且RAE处理与来自一个或多个数据源的输入流相关联的事件,并将输出流传送到一个或多个数据宿。 数据源和数据宿可以是例如网络元件,客户端,数据库等。根据在RAE中实现的服务来处理事件,并利用存储在主存储器数据库系统的存储器部分中的数据可访问 RAE。 数据可以包括例如存储指示应当为给定事件执行的服务或服务的订阅信息的订阅表。 这些服务是使用声明性语言在EPS中的服务创作环境(SAE)中生成的。 SAE以目标代码组件的形式生成服务,例如动态链接的库,其可以动态地链接到RAE而不中断事件处理。 关于EPS中的给定RAE或RAE集合的恢复点的恢复信息可以存储在主存储器数据库系统的存储器部分中,并且用于实现RAE到恢复点的回滚。

    Large scale entity-specific resource classification
    10.
    发明授权
    Large scale entity-specific resource classification 有权
    大规模实体专有资源分类

    公开(公告)号:US09317613B2

    公开(公告)日:2016-04-19

    申请号:US12764694

    申请日:2010-04-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: A system and method is described for large scale entity-specific classification of each entity-specific set of candidates in a collection of candidates for each specific entity in a collection of entities. The collection of entities may comprise a specific category or domain of entities (e.g. schools, restaurants, manufacturers, products, events, people). Candidates may comprise webpages or other resources with resource identifiers. Entity specific sets of candidates may be found by leveraging search engine query results and user interaction therewith for queries based on entity-specific attributes. The relationship(s) or class(es) for which candidate resources are being classified relative to a specific entity may comprise an authoritative, official home page (OHP), or other class (e.g. fan page, review, aggregator) relative to a specific entity. A feature generator generates entity-specific features for candidates. In accordance with its features, one or more classifiers rank each candidate for a specific class for a specific entity.

    摘要翻译: 描述了用于在实体集合中的每个特定实体的候选集合中的每个实体特定的候选者集合的大规模实体特定分类的系统和方法。 实体的收集可以包括实体(例如学校,餐馆,制造商,产品,事件,人)的特定类别或领域。 候选人可以包括具有资源标识符的网页或其他资源。 可以通过利用搜索引擎查询结果和与其进行用户交互来查找基于实体特定属性的查询来找到实体特定的候选者集合。 候选资源相对于特定实体被分类的关系或类可以包括权威的官方主页(OHP)或相对于特定实体的其他类(例如,粉丝专页,评论,聚合者) 实体。 特征生成器为候选者生成实体特定的特征。 根据其特征,一个或多个分类器为特定实体的特定类别的每个候选者排名。