Methods and apparatus for selective workload off-loading across multiple data centers
    1.
    发明授权
    Methods and apparatus for selective workload off-loading across multiple data centers 有权
    多个数据中心选择性工作负载卸载的方法和设备

    公开(公告)号:US07853953B2

    公开(公告)日:2010-12-14

    申请号:US11140429

    申请日:2005-05-27

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5027 G06F2209/509

    摘要: Techniques for selectively off-loading workload across multiple data centers are disclosed. For example, a technique for processing a user request in accordance with a multiple data center environment comprises the following steps/operations. A user request is obtained at a first data center. The user request is classified based on one or more classification criterion. At least a portion of a workload associated with the classified user request is off-loaded to at least a second data center to be processed, wherein the off-loading step/operation is at least based on one or more administrative policies associated with at least one of the first data center and the second data center. The workload is processed such that the user request is at least substantially satisfied in accordance with a service level agreement for the requesting user.

    摘要翻译: 公开了用于跨多个数据中心选择性地卸载工作负载的技术。 例如,根据多数据中心环境处理用户请求的技术包括以下步骤/操作。 在第一数据中心获得用户请求。 基于一个或多个分类标准对用户请求进行分类。 与分类用户请求相关联的工作负载的至少一部分被卸载到要处理的至少第二数据中心,其中卸载步骤/操作至少基于与至少相关联的一个或多个管理策略 第一个数据中心和第二个数据中心之一。 处理工作量使得用户请求至少基于针对请求用户的服务级别协议而基本上满足。

    Information theory based result merging for searching hierarchical entities across heterogeneous data sources
    3.
    发明授权
    Information theory based result merging for searching hierarchical entities across heterogeneous data sources 有权
    基于信息理论的结果合并,用于跨异构数据源搜索分层实体

    公开(公告)号:US09251208B2

    公开(公告)日:2016-02-02

    申请号:US13462995

    申请日:2012-05-03

    IPC分类号: G06F17/30

    摘要: A method, system, and computer program product are disclosed for merging search results. In one embodiment, the method comprises identifying a query, splitting the query into sub-queries, and calculating information content for each of the sub-queries. This method also comprises executing each of the sub-queries to obtain a plurality of search results, and combining the search results based on the information content calculated for the sub-queries. In an embodiment, the execution of each of the sub-queries includes identifying a multitude of search results for at least one of the sub-queries; and the combining includes grouping said multitude of search results into a plurality of clusters, and computing a relevance score for each of said clusters. In the embodiment the combining further includes merging the clusters based on the relevance scores computed for the clusters as well as the information content calculated for the sub-queries.

    摘要翻译: 公开了一种用于合并搜索结果的方法,系统和计算机程序产品。 在一个实施例中,该方法包括识别查询,将查询分割成子查询,以及计算每个子查询的信息内容。 该方法还包括执行每个子查询以获得多个搜索结果,并且基于为子查询计算的信息内容来组合搜索结果。 在一个实施例中,每个子查询的执行包括识别至少一个子查询的大量搜索结果; 并且所述组合包括将所述多个搜索结果分组为多个群集,以及计算每个所述群集的相关性得分。 在该实施例中,组合还包括基于针对集群计算的相关性分数以及为子查询计算的信息内容来合并集群。

    Using historical information to improve search across heterogeneous indices
    4.
    发明授权
    Using historical information to improve search across heterogeneous indices 有权
    使用历史信息来改进异构索引的搜索

    公开(公告)号:US08996561B2

    公开(公告)日:2015-03-31

    申请号:US12535330

    申请日:2009-08-04

    IPC分类号: G06F17/30

    摘要: A method, system and computer program product are disclosed for searching for data. In one embodiment, the invention provides a method comprising identifying a query and a search scope including a set of specified entities; and for each of these entities, estimating a number of documents that would be identified in a search through the entity to answer the query. On the basis of this estimating, a subset of the entities is formed. The query and this subset of entities are sent to a search engine to search the subset of entities to answer the query. In one embodiment, the estimating includes collecting statistical information from queries to build up a historical cache using heuristics or machine learning techniques, wherein the query includes a key word and a scope, and the historical cache contains a maximum number of returned results for an entity given the queries executed.

    摘要翻译: 公开了用于搜索数据的方法,系统和计算机程序产品。 在一个实施例中,本发明提供了一种方法,包括识别包括一组指定实体的查询和搜索范围; 并且对于这些实体中的每一个,估计将通过实体在搜索中识别的文档的数量以回答查询。 在该估计的基础上,形成实体的子集。 将查询和该实体子集发送到搜索引擎以搜索实体的子集以回答查询。 在一个实施例中,估计包括从查询收集统计信息以使用启发式或机器学习技术建立历史高速缓存,其中查询包括关键字和范围,并且历史高速缓存包含实体的最大返回结果数 给出执行的查询。

    INFORMATION THEORY BASED RESULT MERGING FOR SEARCHING HIERARCHICAL ENTITIES ACROSS HETEROGENEOUS DATA SOURCES
    5.
    发明申请
    INFORMATION THEORY BASED RESULT MERGING FOR SEARCHING HIERARCHICAL ENTITIES ACROSS HETEROGENEOUS DATA SOURCES 失效
    基于异构数据源搜索分层实体的信息理论结果合并

    公开(公告)号:US20110082859A1

    公开(公告)日:2011-04-07

    申请号:US12575210

    申请日:2009-10-07

    IPC分类号: G06F17/30

    摘要: A method, system, and computer program product are disclosed for merging search results. In one embodiment, the method comprises identifying a query, splitting the query into sub-queries, and calculating information content for each of the sub-queries. This method also comprises executing each of the sub-queries to obtain a plurality of search results, and combining the search results based on the information content calculated for the sub-queries. In an embodiment, the execution of each of the sub-queries includes identifying a multitude of search results for at least one of the sub-queries; and the combining includes grouping said multitude of search results into a plurality of clusters, and computing a relevance score for each of said clusters. In the embodiment the combining further includes merging the clusters based on the relevance scores computed for the clusters as well as the information content calculated for the sub-queries.

    摘要翻译: 公开了一种用于合并搜索结果的方法,系统和计算机程序产品。 在一个实施例中,该方法包括识别查询,将查询分割成子查询,以及计算每个子查询的信息内容。 该方法还包括执行每个子查询以获得多个搜索结果,并且基于为子查询计算的信息内容来组合搜索结果。 在一个实施例中,每个子查询的执行包括识别至少一个子查询的大量搜索结果; 并且所述组合包括将所述多个搜索结果分组为多个群集,以及计算每个所述群集的相关性得分。 在该实施例中,组合还包括基于针对集群计算的相关性分数以及为子查询计算的信息内容来合并集群。

    Analyzing anticipated value and effort in using cloud computing to process a specified workload
    7.
    发明授权
    Analyzing anticipated value and effort in using cloud computing to process a specified workload 有权
    分析使用云计算处理指定工作负载的预期价值和努力

    公开(公告)号:US08229999B2

    公开(公告)日:2012-07-24

    申请号:US12652546

    申请日:2010-01-05

    IPC分类号: G06F15/16

    摘要: Workloads that are and are not appropriate for transformation and processing on a cloud computing environment are identified. A user entity is provided with a measurement of at least one of the value which will be realized, and the effort which will be required. A plurality of attributes are specified Input data represents the extent to which the user entity processing system possesses the given attribute. The input data for each attribute is employed to derive a corresponding numerical parameter value, which are to compute a final score.

    摘要翻译: 确定了不适用于云计算环境下的转换和处理的工作负载。 向用户实体提供将被实现的值中的至少一个的测量以及需要的努力。 指定多个属性输入数据表示用户实体处理系统拥有给定属性的程度。 每个属性的输入数据用于导出相应的数值参数值,它们是计算最终得分。

    INFORMATION THEORY BASED RESULT MERGING FOR SEARCHING HIERARCHICAL ENTITIES ACROSS HETEROGENEOUS DATA SOURCES
    8.
    发明申请
    INFORMATION THEORY BASED RESULT MERGING FOR SEARCHING HIERARCHICAL ENTITIES ACROSS HETEROGENEOUS DATA SOURCES 审中-公开
    基于异构数据源搜索分层实体的信息理论结果合并

    公开(公告)号:US20120221542A1

    公开(公告)日:2012-08-30

    申请号:US13462995

    申请日:2012-05-03

    IPC分类号: G06F17/30

    摘要: A method, system, and computer program product are disclosed for merging search results. In one embodiment, the method comprises identifying a query, splitting the query into sub-queries, and calculating information content for each of the sub-queries. This method also comprises executing each of the sub-queries to obtain a plurality of search results, and combining the search results based on the information content calculated for the sub-queries. In an embodiment, the execution of each of the sub-queries includes identifying a multitude of search results for at least one of the sub-queries; and the combining includes grouping said multitude of search results into a plurality of clusters, and computing a relevance score for each of said clusters. In the embodiment the combining further includes merging the clusters based on the relevance scores computed for the clusters as well as the information content calculated for the sub-queries.

    摘要翻译: 公开了一种用于合并搜索结果的方法,系统和计算机程序产品。 在一个实施例中,该方法包括识别查询,将查询分割成子查询,以及计算每个子查询的信息内容。 该方法还包括执行每个子查询以获得多个搜索结果,并且基于为子查询计算的信息内容来组合搜索结果。 在一个实施例中,每个子查询的执行包括识别至少一个子查询的大量搜索结果; 并且所述组合包括将所述多个搜索结果分组为多个群集,以及计算每个所述群集的相关性得分。 在该实施例中,组合还包括基于针对集群计算的相关性分数以及为子查询计算的信息内容来合并集群。

    USING HISTORICAL INFORMATION TO IMPROVE SEARCH ACROSS HETEROGENEOUS INDICES
    9.
    发明申请
    USING HISTORICAL INFORMATION TO IMPROVE SEARCH ACROSS HETEROGENEOUS INDICES 有权
    使用历史信息改善异质性指标的搜索

    公开(公告)号:US20110035399A1

    公开(公告)日:2011-02-10

    申请号:US12535330

    申请日:2009-08-04

    IPC分类号: G06F17/30

    摘要: A method, system and computer program product are disclosed for searching for data. In one embodiment, the invention provides a method comprising identifying a query and a search scope including a set of specified entities; and for each of these entities, estimating a number of documents that would be identified in a search through the entity to answer the query. On the basis of this estimating, a subset of the entities is formed. The query and this subset of entities are sent to a search engine to search the subset of entities to answer the query. In one embodiment, the estimating includes collecting statistical information from queries to build up a historical cache using heuristics or machine learning techniques, wherein the query includes a key word and a scope, and the historical cache contains a maximum number of returned results for an entity given the queries executed.

    摘要翻译: 公开了用于搜索数据的方法,系统和计算机程序产品。 在一个实施例中,本发明提供了一种方法,包括识别包括一组指定实体的查询和搜索范围; 并且对于这些实体中的每一个,估计将通过实体在搜索中识别的文档的数量以回答查询。 在该估计的基础上,形成实体的子集。 将查询和该实体子集发送到搜索引擎以搜索实体的子集以回答查询。 在一个实施例中,估计包括从查询收集统计信息以使用启发式或机器学习技术建立历史高速缓存,其中查询包括关键词和范围,并且历史高速缓存包含用于实体的最大返回结果数 给出执行的查询。

    Using historical information to improve search across heterogeneous indices
    10.
    发明授权
    Using historical information to improve search across heterogeneous indices 有权
    使用历史信息来改进异构索引的搜索

    公开(公告)号:US08909663B2

    公开(公告)日:2014-12-09

    申请号:US13435978

    申请日:2012-03-30

    IPC分类号: G06F17/30

    摘要: A method, system and computer program product are disclosed for searching for data. In one embodiment, the invention provides a method comprising identifying a query and a search scope including a set of specified entities; and for each of these entities, estimating a number of documents that would be identified in a search through the entity to answer the query. On the basis of this estimating, a subset of the entities is formed. The query and this subset of entities are sent to a search engine to search the subset of entities to answer the query. In one embodiment, the estimating includes collecting statistical information from queries to build up a historical cache using heuristics or machine learning techniques, wherein the query includes a key word and a scope, and the historical cache contains a maximum number of returned results for an entity given the queries executed.

    摘要翻译: 公开了用于搜索数据的方法,系统和计算机程序产品。 在一个实施例中,本发明提供了一种方法,包括识别包括一组指定实体的查询和搜索范围; 并且对于这些实体中的每一个,估计将通过实体在搜索中识别的文档的数量以回答查询。 在该估计的基础上,形成实体的子集。 将查询和该实体子集发送到搜索引擎以搜索实体的子集以回答查询。 在一个实施例中,估计包括从查询收集统计信息以使用启发式或机器学习技术建立历史高速缓存,其中查询包括关键词和范围,并且历史高速缓存包含用于实体的最大返回结果数 给出执行的查询。