Phrase matching in documents having nested-structure arbitrary (document-specific) markup
    3.
    发明授权
    Phrase matching in documents having nested-structure arbitrary (document-specific) markup 失效
    具有嵌套结构任意(文档特定)标记的文档中的短语匹配

    公开(公告)号:US07356528B1

    公开(公告)日:2008-04-08

    申请号:US10765675

    申请日:2004-01-27

    IPC分类号: G06F7/00

    摘要: A method of searching a document having nested-structure document-specific markup (such as Extensible Markup Language (XML)) involves 112 receiving a query that designates at least (A) a phrase to be matched in a phrase matching process, and (B) a selective designation of at least a tag or annotation that is to be ignored during the phrase matching process. The method further involves 114 deriving query-specific indices based on query-independent indices that were created specific to each document, and 116 carrying out the phrase matching process using the query-specific indices on the document having the nested-structure document-specific markup.

    摘要翻译: 搜索具有嵌套结构文档特定标记(例如可扩展标记语言(XML))的文档的方法涉及112接收在短语匹配处理中至少指定(A)要匹配的短语的查询,并且(B )选择性地指定短语匹配过程中至少要被忽略的标签或注释。 该方法还包括基于针对每个文档特定创建的独立于查询的索引来导出查询特定索引,以及116使用具有嵌套结构文档特定标记的文档上的查询特定索引进行短语匹配处理 。

    PHRASE MATCHING IN DOCUMENTS HAVING NESTED-STRUCTURE ARBITRARY (DOCUMENT-SPECIFIC) MARKUP
    4.
    发明申请
    PHRASE MATCHING IN DOCUMENTS HAVING NESTED-STRUCTURE ARBITRARY (DOCUMENT-SPECIFIC) MARKUP 有权
    具有结构化仲裁(文件特定)标记的文档中的相关匹配

    公开(公告)号:US20080154891A1

    公开(公告)日:2008-06-26

    申请号:US12042287

    申请日:2008-03-04

    IPC分类号: G06F17/30

    摘要: A method of searching a document having nested-structure document-specific markup (such as Extensible Markup Language (XML)) involves 112 receiving a query that designates at least (A) a phrase to be matched in a phrase matching process, and (B) a selective designation of at least a tag or annotation that is to be ignored during the phrase matching process. The method further involves 114 deriving query-specific indices based on query-independent indices that were created specific to each document, and 116 carrying out the phrase matching process using the query-specific indices on the document having the nested-structure document-specific markup.

    摘要翻译: 搜索具有嵌套结构文档特定标记(例如可扩展标记语言(XML))的文档的方法涉及112接收在短语匹配处理中至少指定(A)要匹配的短语的查询,并且(B )选择性地指定短语匹配过程中至少要被忽略的标签或注释。 该方法还包括基于针对每个文档特定创建的独立于查询的索引来导出查询特定索引,以及116使用具有嵌套结构文档特定标记的文档上的查询特定索引进行短语匹配处理 。

    Phrase matching in documents having nested-structure arbitrary (document-specific) markup
    5.
    发明授权
    Phrase matching in documents having nested-structure arbitrary (document-specific) markup 有权
    具有嵌套结构任意(文档特定)标记的文档中的短语匹配

    公开(公告)号:US08549006B2

    公开(公告)日:2013-10-01

    申请号:US12042287

    申请日:2008-03-04

    IPC分类号: G06F7/00

    摘要: A method of searching a document having nested-structure document-specific markup (such as Extensible Markup Language (XML)) involves 112 receiving a query that designates at least (A) a phrase to be matched in a phrase matching process, and (B) a selective designation of at least a tag or annotation that is to be ignored during the phrase matching process. The method further involves 114 deriving query-specific indices based on query-independent indices that were created specific to each document, and 116 carrying out the phrase matching process using the query-specific indices on the document having the nested-structure document-specific markup.

    摘要翻译: 搜索具有嵌套结构文档特定标记(例如可扩展标记语言(XML))的文档的方法涉及112接收在短语匹配处理中至少指定(A)要匹配的短语的查询,并且(B )选择性地指定短语匹配过程中至少要被忽略的标签或注释。 该方法还包括基于针对每个文档特定创建的独立于查询的索引来导出查询特定索引,以及116使用具有嵌套结构文档特定标记的文档上的查询特定索引进行短语匹配处理 。

    User-powered recommendation system
    6.
    发明授权
    User-powered recommendation system 有权
    用户推荐系统

    公开(公告)号:US08943081B2

    公开(公告)日:2015-01-27

    申请号:US12616892

    申请日:2009-11-12

    摘要: Recommendation systems are widely used in Internet applications. In current recommendation systems, users only play a passive role and have limited control over the recommendation generation process. As a result, there is often considerable mismatch between the recommendations made by these systems and the actual user interests, which are fine-grained and constantly evolving. With a user-powered distributed recommendation architecture, individual users can flexibly define fine-grained communities of interest in a declarative fashion and obtain recommendations accurately tailored to their interests by aggregating opinions of users in such communities. By combining a progressive sampling technique with data perturbation methods, the recommendation system is both scalable and privacy-preserving.

    摘要翻译: 推荐系统广泛应用于互联网应用。 在目前的推荐系统中,用户只能发挥被动的作用,对推荐生成过程的控制有限。 因此,这些系统提出的建议和实际用户兴趣之间经常存在很大的不匹配,这些建议是细粒度和不断发展的。 通过用户分配的推荐体系结构,个人用户可以灵活地定义精细的社区,并以声明方式定义感兴趣的社区,通过汇总用户在这些社区的意见,获得准确定制的兴趣建议。 通过将逐行采样技术与数据扰动方法相结合,推荐系统既可扩展又保密。

    Processing data using sequential dependencies
    7.
    发明授权
    Processing data using sequential dependencies 有权
    使用顺序依赖来处理数据

    公开(公告)号:US08645309B2

    公开(公告)日:2014-02-04

    申请号:US12592586

    申请日:2009-11-30

    CPC分类号: G06N7/00 G06N5/00

    摘要: The specification describes data processes for analyzing large data steams for target anomalies. “Sequential dependencies” (SDs) are chosen for ordered data and present a framework for discovering which subsets of the data obey a given sequential dependency. Given an interval G, an SD on attributes X and Y, written as X→G Y, denotes that the distance between the Y-values of any two consecutive records, when sorted on X, are within G. SDs may be extended to Conditional Sequential Dependencies (CSDs), consisting of an underlying SD plus a representation of the subsets of the data that satisfy the SD. The conditional approximate sequential dependencies may be expressed as pattern tableaux, i.e., compact representations of the subsets of the data that satisfy the underlying dependency.

    摘要翻译: 该规范描述了用于分析目标异常的大型数据流的数据处理。 为有序数据选择“顺序依赖”(SDs),并提供一个框架,用于发现数据的哪些子集服从给定的顺序依赖。 给定一个间隔G,写入X> GY的属性X和Y上的SD表示当在X上排序时,任何两个连续记录的Y值之间的距离在G之内可以扩展到条件 顺序依赖性(CSD)由基础SD加上满足SD的数据子集的表示组成。 条件近似顺序依赖性可以表示为模式表,即满足基础依赖性的数据子集的紧凑表示。

    Online Data Fusion
    8.
    发明申请
    Online Data Fusion 有权
    在线数据融合

    公开(公告)号:US20130144843A1

    公开(公告)日:2013-06-06

    申请号:US13311034

    申请日:2011-12-05

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30634

    摘要: An online data fusion system receives a query, probes a first source for an answer to the query, returns the answer from the first source, refreshes the answer while probing an additional source, and applies fusion techniques on data associated with an answer that is retrieved from the additional source. For each retrieved answer, the online data fusion system computes the probability that the answer is correct and stops retrieving data for the answer after gaining enough confidence that data retrieved from the unprocessed sources are unlikely to change the answer. The online data fusion system returns correct answers and terminates probing additional sources in an expeditious manner without sacrificing the quality of the answers.

    摘要翻译: 在线数据融合系统接收查询,探索第一个来源以获得查询的答案,从第一个源返回答案,在探索附加的源时刷新答案,并对与检索到的答案相关联的数据应用融合技术 从额外的来源。 对于每个检索到的答案,在线数据融合系统计算出答案正确的概率,并且在获得足够的信心从而从未处理的源中检索的数据不太可能改变答案之后,停止检索答案数据。 在线数据融合系统返回正确的答案,并以迅速的方式终止探测附加来源,而不牺牲答案的质量。

    Methods and systems to store state used to forward multicast traffic
    9.
    发明授权
    Methods and systems to store state used to forward multicast traffic 有权
    存储用于转发组播流量的状态的方法和系统

    公开(公告)号:US08295203B2

    公开(公告)日:2012-10-23

    申请号:US12060709

    申请日:2008-04-01

    IPC分类号: H04L12/28

    摘要: Methods and systems are described to store state used to forward multicast traffic. The system includes a receiving module to receive request to add a first node to a membership tree. The membership tree includes a first plurality of nodes associated with a multicast group. The system further includes a processing module to identify a second node in the first plurality of nodes and to communicate a node identifier that identifies the first node over a network to the second node. The node identifier is to be stored at the second node to add the first node to the membership tree. The node identifier is further to be stored in the membership tree exclusively at the second node to enable the second node to forward the multicast traffic to the first node.

    摘要翻译: 描述了用于存储用于转发组播流量的状态的方法和系统。 该系统包括接收模块,用于接收向成员树添加第一个节点的请求。 隶属树包括与多播组相关联的第一多个节点。 所述系统还包括处理模块,用于识别所述第一多个节点中的第二节点,并将通过网络识别所述第一节点的节点标识符传送到所述第二节点。 节点标识符将存储在第二个节点,以将第一个节点添加到成员树中。 节点标识符进一步被存储在专属于第二节点的成员树中,以使得第二节点能够将多播业务转发到第一节点。

    Selectivity estimation of set similarity selection queries
    10.
    发明授权
    Selectivity estimation of set similarity selection queries 失效
    集合相似性选择查询的选择性估计

    公开(公告)号:US08161046B2

    公开(公告)日:2012-04-17

    申请号:US12274546

    申请日:2008-11-20

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30469

    摘要: The invention relates to a system and/or methodology for selectivity estimation of set similarity queries. More specifically, the invention relates to a selectivity estimation technique employing hashed sampling. The invention providing for samples constructed a priori that can efficiently and quickly provide accurate estimates for arbitrary queries, and can be updated efficiently as well.

    摘要翻译: 本发明涉及用于组合相似性查询的选择性估计的系统和/或方法。 更具体地,本发明涉及采用散列采样的选择性估计技术。 本发明提供了可以有效地和快速地为任意查询提供准确估计的先验构建的样本,并且还可以有效地更新。