Computer data storage system with parallelization migration plan generator
    1.
    发明授权
    Computer data storage system with parallelization migration plan generator 失效
    具有并行迁移计划生成器的计算机数据存储系统

    公开(公告)号:US06571258B1

    公开(公告)日:2003-05-27

    申请号:US09394646

    申请日:1999-09-13

    IPC分类号: G06F1200

    摘要: A data storage system is provided in which a migration plan generates migration plans for reaching a goal configuration from an initial configuration in a predetermined period of time. The migration plan initially makes terminal moves until no further terminal moves can be made. A shunt move is then made based on reducing the contention (the total size of the data stores that need to be moved onto a particular data stores device divided by the amount of excess capacity on the data storage device). The shunt is selected to lead to the lowest contention for the data storage system. Subsequently, the migration plan returns to making terminal moves to develop a migration plan. By determining existence and utilization dependencies of the various moves, independent moves are determined to be implemented in parallel with moves having dependencies. This produces parallelized migration plans which run much faster than sequential migration plans.

    摘要翻译: 提供了一种数据存储系统,其中迁移计划在预定时间段内从初始配置生成用于达到目标配置的迁移计划。 迁移计划最初使终端移动,直到不再进一步终端移动。 然后,基于减少争用(需要移动到特定数据存储设备上的数据存储器的总大小除以数据存储设备上的超量容量)来进行分流移动。 分流器被选择为导致数据存储系统的最低竞争力。 随后,迁移计划返回到终端移动制定迁移计划。 通过确定各种移动的存在和利用依赖性,确定独立移动与具有依赖性的移动并行地实现。 这产生并行迁移计划,其运行速度比顺序迁移计划快得多。

    Computer data storage system with migration plan generator
    2.
    发明授权
    Computer data storage system with migration plan generator 有权
    具有迁移计划生成器的计算机数据存储系统

    公开(公告)号:US06381619B1

    公开(公告)日:2002-04-30

    申请号:US09394221

    申请日:1999-09-13

    IPC分类号: G06F1730

    摘要: A data storage system is provided in which a migration plan generates migration plans for reaching a goal configuration from an initial configuration in a predetermined period of time. The migration plan initially makes terminal moves until no further terminal moves can be made. A shunt move is then made based on reducing the contention (the total size of the data stores that need to be moved onto a particular data stores device divided by the amount of excess capacity on the data storage device). The shunt is selected to lead to the lowest contention for the data storage system. Subsequently, the migration plan returns to making terminal moves to develop a migration plan. Further migration plans are provided.

    摘要翻译: 提供了一种数据存储系统,其中迁移计划在预定时间段内从初始配置生成用于达到目标配置的迁移计划。 迁移计划最初使终端移动,直到不再进一步终端移动。 然后,基于减少争用(需要移动到特定数据存储设备上的数据存储器的总大小除以数据存储设备上的超量容量)来进行分流移动。 分流器被选择为导致数据存储系统的最低竞争力。 随后,迁移计划返回到终端移动制定迁移计划。 提供进一步的移民计划。

    Scalable Query Infrastructure
    4.
    发明申请
    Scalable Query Infrastructure 有权
    可扩展查询基础设施

    公开(公告)号:US20080082628A1

    公开(公告)日:2008-04-03

    申请号:US11537476

    申请日:2006-09-29

    IPC分类号: G06F15/16

    CPC分类号: G06F17/30442

    摘要: Methods of querying a large number of endsystems are described in which metadata is replicated between endsystems. When a query is injected, an available endsystem receives a message relating to the query which identifies a range of endsystems for which that available endsystem is responsible. The available endsystem then generates completeness data for the range of endsystems based on data stored at the endsystem and this completeness data is transmitted to the sender of the message. The methods may be implemented using device-executable instructions which may be stored on device readable media.

    摘要翻译: 描述了查询大量终端系统的方法,其中元数据在终端系统之间复制。 当注入查询时,可用的末端系统接收与该查询有关的消息,其识别该可用端系统所负责的终端系统的范围。 可用的终端系统然后基于存储在终端系统中的数据生成用于终端系统的范围的完整性数据,并且该完整性数据被发送到消息的发送者。 可以使用可以存储在设备可读介质上的设备可执行指令来实现这些方法。

    Resource Optimization for Online Services
    5.
    发明申请
    Resource Optimization for Online Services 有权
    在线服务的资源优化

    公开(公告)号:US20120158858A1

    公开(公告)日:2012-06-21

    申请号:US12969963

    申请日:2010-12-16

    IPC分类号: G06F15/16 G06F15/173

    CPC分类号: H04L51/22

    摘要: Resource optimization for online services is described. In one example, objects (such as mailboxes or other data associated with an online service) are assigned to network elements (such as servers) by inferring a relationship graph from log data relating to usage of the online service. The graph has a node for each object, and connections between each pair of objects having data items in common. Each connection has a weight relating to the number of common data items. The graph is partitioned into a set of clusters, such that each cluster has nodes joined by connections with a high weight relative to the weight of connections between nodes in different clusters. The objects are then distributed to the network elements such that objects corresponding to nodes in the same cluster are located on the same network element.

    摘要翻译: 描述了在线服务的资源优化。 在一个示例中,通过从与在线服务的使用相关的日志数据推断关系图,将对象(诸如邮箱或与在线服务相关联的其他数据)分配给网络元件(例如服务器)。 该图具有每个对象的节点,以及具有共同数据项的每对对象之间的连接。 每个连接具有与公共数据项的数量相关的权重。 该图被划分成一组集群,使得每个集群具有通过具有相对于不同集群中的节点之间的连接的权重的高权重的连接而连接的节点。 然后将对象分发到网络元件,使得与相同集群中的节点相对应的对象位于同一网络元件上。

    PREDICTING DATABASE SYSTEM PERFORMANCE
    6.
    发明申请
    PREDICTING DATABASE SYSTEM PERFORMANCE 审中-公开
    预测数据库系统性能

    公开(公告)号:US20110288847A1

    公开(公告)日:2011-11-24

    申请号:US13187211

    申请日:2011-07-20

    IPC分类号: G06F9/455

    摘要: A prediction system may perform capacity planning for one or more resources of a database systems, such as by understanding how different workloads are using the system resources and/or predicting how the performance of the workloads will change when the hardware configuration of the resource is changed and/or when the workload changes. The prediction system may use a detailed, low-level tracing of a live database system running an application workload to monitor the performance of the current database system. In this manner, the current monitoring traces and analysis may be combined with a simulation to predict the workload's performance on a different hardware configuration. More specifically, performance may be indicated as throughput and/or latency, which may be for all transactions, for a particular transaction type, and/or for an individual transaction. Database system performance prediction may include instrumentation and tracing, demand trace extraction, cache simulation, disk scaling, CPU scaling, background activity prediction, throughput analysis, latency analysis, visualization, optimization, and the like.

    摘要翻译: 预测系统可以对数据库系统的一个或多个资源执行容量规划,例如通过了解不同的工作负载如何使用系统资源和/或预测当资源的硬件配置被改变时工作负载的性能将如何改变 和/或工作负载变化时。 预测系统可以使用运行应用程序工作负载的实时数据库系统的详细的低级跟踪来监视当前数据库系统的性能。 以这种方式,当前的监控跟踪和分析可以与模拟结合,以预测不同硬件配置上的工作负载性能。 更具体地,可以将性能指示为吞吐量和/或延迟,其可以针对特定交易类型和/或单个交易的所有交易。 数据库系统性能预测可能包括仪器跟踪,需求跟踪提取,缓存模拟,磁盘缩放,CPU缩放,后台活动预测,吞吐量分析,延迟分析,可视化,优化等。

    Resource optimization for online services
    7.
    发明授权
    Resource optimization for online services 有权
    在线服务资源优化

    公开(公告)号:US08819236B2

    公开(公告)日:2014-08-26

    申请号:US12969963

    申请日:2010-12-16

    IPC分类号: G06F15/16 G06F15/173

    CPC分类号: H04L51/22

    摘要: Resource optimization for online services is described. In one example, objects (such as mailboxes or other data associated with an online service) are assigned to network elements (such as servers) by inferring a relationship graph from log data relating to usage of the online service. The graph has a node for each object, and connections between each pair of objects having data items in common. Each connection has a weight relating to the number of common data items. The graph is partitioned into a set of clusters, such that each cluster has nodes joined by connections with a high weight relative to the weight of connections between nodes in different clusters. The objects are then distributed to the network elements such that objects corresponding to nodes in the same cluster are located on the same network element.

    摘要翻译: 描述了在线服务的资源优化。 在一个示例中,通过从与在线服务的使用相关的日志数据推断关系图,将对象(诸如邮箱或与在线服务相关联的其他数据)分配给网络元件(例如服务器)。 该图具有每个对象的节点,以及具有共同数据项的每对对象之间的连接。 每个连接具有与公共数据项的数量相关的权重。 该图被划分成一组集群,使得每个集群具有通过具有相对于不同集群中的节点之间的连接的权重的高权重的连接而连接的节点。 然后将对象分发到网络元件,使得与相同集群中的节点相对应的对象位于同一网络元件上。

    Generating Filters Automatically From Data Processing Jobs
    8.
    发明申请
    Generating Filters Automatically From Data Processing Jobs 有权
    从数据处理作业自动生成过滤器

    公开(公告)号:US20130152088A1

    公开(公告)日:2013-06-13

    申请号:US13324202

    申请日:2011-12-13

    IPC分类号: G06F9/46 G06F17/30

    摘要: Methods of generating filters automatically from data processing jobs are described. In an embodiment, these filters are automatically generated from a compiled version of the data processing job using static analysis which is applied to a high-level representation of the job. The executable filter is arranged to suppress rows and/or columns within the data to which the job is applied and which do not affect the output of the job. The filters are generated by a filter generator and then stored and applied dynamically at a filtering proxy that may be co-located with the storage node that holds the data. In another embodiment, the filtered data may be cached close to a compute node which runs the job and data may be provided to the compute node from the local cache rather than from the filtering proxy.

    摘要翻译: 描述从数据处理作业自动生成过滤器的方法。 在一个实施例中,这些过滤器使用静态分析从应用于作业的高级表示的数据处理作业的编译版本自动生成。 可执行过滤器被设置为抑制作业所应用的数据内的行和/或列,并且不影响作业的输出。 过滤器由过滤器生成器生成,然后在过滤代理处动态存储和应用,过滤代理可能与保存数据的存储节点位于同一位置。 在另一个实施例中,经过过滤的数据可以被缓存在运行作业的计算节点附近,并且可以从本地高速缓存而不是从过滤代理将数据提供给计算节点。

    Infrastructure to disseminate queries and provide query results
    9.
    发明授权
    Infrastructure to disseminate queries and provide query results 有权
    基础设施传播查询和提供查询结果

    公开(公告)号:US08375141B2

    公开(公告)日:2013-02-12

    申请号:US11537476

    申请日:2006-09-29

    IPC分类号: G06F15/173

    CPC分类号: G06F17/30442

    摘要: Methods of querying a large number of endsystems are described in which metadata is replicated between endsystems. When a query is injected, an available endsystem receives a message relating to the query which identifies a range of endsystems for which that available endsystem is responsible. The available endsystem then generates completeness data for the range of endsystems based on data stored at the endsystem and this completeness data is transmitted to the sender of the message. The methods may be implemented using device-executable instructions which may be stored on device readable media.

    摘要翻译: 描述了查询大量终端系统的方法,其中元数据在终端系统之间复制。 当注入查询时,可用的末端系统接收与该查询有关的消息,其识别该可用端系统所负责的终端系统的范围。 可用的终端系统然后基于存储在终端系统中的数据生成用于终端系统的范围的完整性数据,并且该完整性数据被发送到消息的发送者。 可以使用可以存储在设备可读介质上的设备可执行指令来实现这些方法。