PREDICTIVE PREFETCHING TO IMPROVE PARALLELIZATION OF DATA RETRIEVAL SUBTASKS
    1.
    发明申请
    PREDICTIVE PREFETCHING TO IMPROVE PARALLELIZATION OF DATA RETRIEVAL SUBTASKS 有权
    预测性改进以提高数据检索率的并行化

    公开(公告)号:US20120151000A1

    公开(公告)日:2012-06-14

    申请号:US13399781

    申请日:2012-02-17

    IPC分类号: G06F15/16

    摘要: In a system in which documents are generated dynamically in response to user requests, historical data is collected regarding data retrieval subtasks, such as service requests, that are performed to generate such documents. This data is used to predict the specific subtasks that will be performed to respond to specific document requests, such that these subtasks may be initiated preemptively at or near the outset of the associated document generation task. A subtask that would ordinarily be postponed pending the outcome of a prior subtask can thereby be performed in parallel with the prior subtask, reducing document generation times. In one embodiment, the historical data is included within, or is used to generate, a mapping table that maps document generation tasks (which may correspond to specific URLs) to the data retrieval subtasks that are frequently performed within such tasks.

    摘要翻译: 在响应于用户请求而动态地生成文档的系统中,收集关于用于生成这样的文档而执行的数据检索子任务(例如服务请求)的历史数据。 该数据用于预测将被执行以响应特定文档请求的特定子任务,使得这些子任务可以在相关联的文档生成任务的开始或接近开始时被预先启动。 因此,可以在先前子任务的结果之前通常推迟等待先前子任务的结果的子任务,从而减少文件生成时间。 在一个实施例中,将历史数据包括在或用于生成将文档生成任务(其可以对应于特定URL)映射到在这些任务内经常执行的数据检索子任务的映射表。

    Predictive prefetching to improve parallelization of data retrieval subtasks
    2.
    发明授权
    Predictive prefetching to improve parallelization of data retrieval subtasks 有权
    预测性预取,以提高数据检索子任务的并行化

    公开(公告)号:US08566788B2

    公开(公告)日:2013-10-22

    申请号:US13399781

    申请日:2012-02-17

    摘要: In a system in which documents are generated dynamically in response to user requests, historical data is collected regarding data retrieval subtasks, such as service requests, that are performed to generate such documents. This data is used to predict the specific subtasks that will be performed to respond to specific document requests, such that these subtasks may be initiated preemptively at or near the outset of the associated document generation task. A subtask that would ordinarily be postponed pending the outcome of a prior subtask can thereby be performed in parallel with the prior subtask, reducing document generation times. In one embodiment, the historical data is included within, or is used to generate, a mapping table that maps document generation tasks (which may correspond to specific URLs) to the data retrieval subtasks that are frequently performed within such tasks.

    摘要翻译: 在响应于用户请求而动态地生成文档的系统中,收集关于用于生成这样的文档而执行的数据检索子任务(例如服务请求)的历史数据。 该数据用于预测将被执行以响应特定文档请求的特定子任务,使得这些子任务可以在相关联的文档生成任务的开始或接近开始时被预先启动。 因此,可以在先前子任务的结果之前通常推迟等待先前子任务的结果的子任务,从而减少文件生成时间。 在一个实施例中,将历史数据包括在或用于生成将文档生成任务(其可以对应于特定URL)映射到在这些任务内经常执行的数据检索子任务的映射表。

    Predictive prefetching to improve parallelization of document generation subtasks
    3.
    发明授权
    Predictive prefetching to improve parallelization of document generation subtasks 有权
    预测性预取来提高文档生成子任务的并行化

    公开(公告)号:US08136089B2

    公开(公告)日:2012-03-13

    申请号:US11954175

    申请日:2007-12-11

    摘要: In a system in which documents are generated dynamically in response to user requests, historical data is collected regarding data retrieval subtasks, such as service requests, that are performed to generate such documents. This data is used to predict the specific subtasks that will be performed to respond to specific document requests, such that these subtasks may be initiated preemptively at or near the outset of the associated document generation task. A subtask that would ordinarily be postponed pending the outcome of a prior subtask can thereby be performed in parallel with the prior subtask, reducing document generation times. In one embodiment, the historical data is included within, or is used to generate, a mapping table that maps document generation tasks (which may correspond to specific URLs) to the data retrieval subtasks that are frequently performed within such tasks.

    摘要翻译: 在响应于用户请求而动态地生成文档的系统中,收集关于用于生成这样的文档而执行的数据检索子任务(例如服务请求)的历史数据。 该数据用于预测将被执行以响应特定文档请求的特定子任务,使得这些子任务可以在相关联的文档生成任务的开始或接近开始时被预先启动。 因此,可以在先前子任务的结果之前通常推迟等待先前子任务的结果的子任务,从而减少文件生成时间。 在一个实施例中,将历史数据包括在或用于生成将文档生成任务(其可以对应于特定URL)映射到在这些任务内经常执行的数据检索子任务的映射表。

    Predictive prefetching to improve parallelization of document generation subtasks
    4.
    发明授权
    Predictive prefetching to improve parallelization of document generation subtasks 有权
    预测性预取来提高文档生成子任务的并行化

    公开(公告)号:US07331038B1

    公开(公告)日:2008-02-12

    申请号:US10612529

    申请日:2003-07-02

    IPC分类号: G06F9/44 G06F3/00

    摘要: In a system in which documents are generated dynamically in response to user requests, historical data is collected regarding data retrieval subtasks, such as service requests, that are performed to generate such documents. This data is used to predict the specific subtasks that will be performed to respond to specific document requests, such that these subtasks may be initiated preemptively at or near the outset of the associated page generation task. A subtask that would ordinarily be postponed pending the outcome of a prior subtask can therefore be performed in parallel with the prior subtask, reducing document generation times. In one embodiment, the historical data is included within, or is used to generate, a mapping table that maps document generation tasks (which may correspond to specific URLs) to the data retrieval subtasks that are frequently performed within such tasks.

    摘要翻译: 在响应于用户请求而动态地生成文档的系统中,收集关于用于生成这样的文档而执行的数据检索子任务(例如服务请求)的历史数据。 该数据用于预测将被执行以响应特定文档请求的特定子任务,使得这些子任务可以在关联的页面生成任务的开始或接近开始时被初始化地发起。 因此,可以与先前的子任务并行地执行通常在先前子任务的结果之前推迟的子任务,从而减少了文档生成时间。 在一个实施例中,将历史数据包括在或用于生成将文档生成任务(其可以对应于特定URL)映射到在这些任务内经常执行的数据检索子任务的映射表。

    Archival data organization and management
    5.
    发明授权
    Archival data organization and management 有权
    存档数据组织与管理

    公开(公告)号:US09092441B1

    公开(公告)日:2015-07-28

    申请号:US13570029

    申请日:2012-08-08

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30073 G06F17/30221

    摘要: Methods and systems are provided herein that facilitate cost-effective, scalable and reliable archival data organization and management. In an embodiment, data are redundantly encoded and stored to provide data reliability. Further, encoded data may be stored in self-describing storage entities that provide information describing data stored therein. Information provided by self-describing storage entities may be used to construct a limited map that is usable to facilitate data placement and data location services during data storage and retrieval. Data reliability and durability is provided because information about data stored in the system is mostly contained in the storage entities themselves. Cost efficiency is provided because only a limited map is provided for efficiency purposes instead of a potentially large global index data structure.

    摘要翻译: 本文提供的方法和系统便于成本有效,可扩展和可靠的归档数据组织和管理。 在一个实施例中,对数据进行冗余编码和存储以提供数据可靠性。 此外,编码数据可以存储在提供描述存储在其中的数据的信息的自描述存储实体中。 由自描述存储实体提供的信息可用于构建有限地图,其可用于在数据存储和检索期间促进数据放置和数据定位服务。 提供数据可靠性和耐用性,因为关于存储在系统中的数据的信息主要包含在存储实体本身中。 提供成本效率,因为仅为有效的目的而不是潜在的大型全局索引数据结构提供有限的映射。

    Hard disk drive assembly with field-separable mechanical module and drive control
    6.
    发明授权
    Hard disk drive assembly with field-separable mechanical module and drive control 有权
    具有现场可分离机械模块和驱动控制的硬盘驱动器组件

    公开(公告)号:US08929024B1

    公开(公告)日:2015-01-06

    申请号:US13430246

    申请日:2012-03-26

    IPC分类号: G11B33/08

    摘要: A system for storing data includes a rack, one or more data storage drive assemblies coupled to the rack, and a data control module coupled to the rack. The data storage drive assemblies include one or more drive mechanical modules configured to store data and one or more drive control modules coupled to the drive mechanical modules. The drive control modules control mechanical operations in the drive mechanical modules. The drive mechanical modules and the associated drive control modules are separable from one another without removing the other module from the at least one data storage drive assembly.

    摘要翻译: 用于存储数据的系统包括机架,耦合到机架的一个或多个数据存储驱动器组件和耦合到机架的数据控制模块。 数据存储驱动器组件包括被配置为存储数据的一个或多个驱动机械模块和耦合到驱动机械模块的一个或多个驱动器控制模块。 驱动控制模块控制驱动机械模块中的机械操作。 驱动机械模块和相关联的驱动器控制模块可彼此分离,而不从至少一个数据存储驱动组件移除另一个模块。

    Providing service quality levels through CPU scheduling
    7.
    发明授权
    Providing service quality levels through CPU scheduling 有权
    通过CPU调度提供服务质量水平

    公开(公告)号:US08918784B1

    公开(公告)日:2014-12-23

    申请号:US12975211

    申请日:2010-12-21

    IPC分类号: G06F9/455 G06F9/46 G06F9/50

    摘要: In this disclosure, a resource scheduler is described that allows virtual machine instances to earn resource credits during the low activity levels. Virtual machine instances that spend a predominant amount of time operating at low activity levels are able to quickly gain resource credits. Once these virtual machine instances acquire enough resource credits to surpass a threshold level, the resource scheduler can assign a high priority level to the virtual machine instances that provide them with priority access to CPU resources. The next time that the virtual machine instances enter a high activity level, they have a high priority level that allows them to preempt other, lower priority virtual machine instances. Thus, these virtual machine instances are able to process operations and/or respond to user requests with low latency.

    摘要翻译: 在本公开中,描述了允许虚拟机实例在低活动级别期间获得资源信用的资源调度器。 花费大量时间运作在低活动水平的虚拟机实例能够快速获得资源信用。 一旦这些虚拟机实例获得足够的资源信用以超过阈值级别,资源调度器就可以为虚拟机实例分配高优先权级别,从而为其提供对CPU资源的优先访问。 虚拟机实例下一次进入高活动级别时,它们具有高优先级,允许它们抢占其他优先级较低的虚拟机实例。 因此,这些虚拟机实例能够以低延迟处理操作和/或响应用户请求。

    Throughput-sensitive redundancy encoding schemes for data storage
    8.
    发明授权
    Throughput-sensitive redundancy encoding schemes for data storage 有权
    用于数据存储的吞吐量敏感冗余编码方案

    公开(公告)号:US08850288B1

    公开(公告)日:2014-09-30

    申请号:US13535306

    申请日:2012-06-27

    IPC分类号: G11C29/00 G06F11/08 G06F11/10

    摘要: Techniques for optimizing data storage are disclosed herein. In particular, methods and systems for implementing redundancy encoding schemes with data storage systems are described. The redundancy encoding schemes may be scheduled according to system and data characteristics. The schemes may span multiple tiers or layers of a storage system. The schemes may be generated, for example, in accordance with a transaction rate requirement, a data durability requirement or in the context of the age of the stored data. The schemes may be designed to rectify entropy-related effects upon data storage. The schemes may include one or more erasure codes or erasure coding schemes. Additionally, methods and systems for improving and/or accounting for failure correlation of various components of the storage system, including that of storage devices such as hard disk drives, are described.

    摘要翻译: 本文公开了用于优化数据存储的技术。 特别地,描述了用于实现具有数据存储系统的冗余编码方案的方法和系统。 可以根据系统和数据特性调度冗余编码方案。 这些方案可以跨越存储系统的多层或多层。 可以例如根据交易速率要求,数据持久性要求或者存储数据的年龄的上下文来生成这些方案。 这些方案可能被设计为纠正数据存储时的熵相关影响。 这些方案可以包括一个或多个擦除代码或擦除编码方案。 此外,描述了用于改进和/或计算存储系统的各种组件(包括诸如硬盘驱动器)的存储设备的故障相关性的方法和系统。

    Immortal instance type
    9.
    发明授权
    Immortal instance type 有权
    不朽实例类型

    公开(公告)号:US08701109B1

    公开(公告)日:2014-04-15

    申请号:US13366495

    申请日:2012-02-06

    申请人: Colin L. Lazier

    发明人: Colin L. Lazier

    IPC分类号: G06F9/455 G06F9/445 G06F12/00

    CPC分类号: G06F8/63 G06F9/45533

    摘要: Technologies are described herein for ensuring data in long-term storage will be accessible at a future date. Upon storing the data in long-term storage, a well-defined instance of data processing resources is created on a host computing platform for the installation and testing of a related application that is capable of accessing the stored data. Once testing of the related application is complete, a machine image is generated from the instance and stored with the data in the long-term storage. If access to the data stored in the long-term storage is required at a future date, the data and associated machine image may be retrieved, and a compatible instance of data processing resources created in which the machine image may be restored. The data in the long-term storage may then be accessed by the related applications executing in the newly created instance.

    摘要翻译: 本文描述了技术,以确保将来可以访问长期存储中的数据。 在将数据存储在长期存储中时,在主计算平台上创建良好定义的数据处理资源的实例,以便安装和测试能够访问所存储的数据的相关应用。 一旦相关应用程序的测试完成,就会从实例生成机器映像,并将其与长期存储器中的数据一起存储。 如果在将来的日期需要存储在长期存储器中的数据,则可以检索数据和相关联的机器图像,以及创建可以恢复机器图像的数据处理资源的兼容实例。 然后,可以通过在新创建的实例中执行的相关应用程序来访问长期存储中的数据。

    Correlated failure zones for data storage
    10.
    发明授权
    Correlated failure zones for data storage 有权
    相关的故障区域用于数据存储

    公开(公告)号:US09110797B1

    公开(公告)日:2015-08-18

    申请号:US13535281

    申请日:2012-06-27

    申请人: Colin L. Lazier

    发明人: Colin L. Lazier

    IPC分类号: G06F11/00 G06F11/07 G11B20/18

    摘要: Techniques for optimizing data storage are disclosed herein. In particular, methods and systems for implementing redundancy encoding schemes with data storage systems are described. The redundancy encoding schemes may be scheduled according to system and data characteristics. The schemes may span multiple tiers or layers of a storage system. The schemes may be generated, for example, in accordance with a transaction rate requirement, a data durability requirement or in the context of the age of the stored data. The schemes may be designed to rectify entropy-related effects upon data storage. The schemes may include one or more erasure codes or erasure coding schemes. Additionally, methods and systems for improving and/or accounting for failure correlation of various components of the storage system, including that of storage devices such as hard disk drives, are described.

    摘要翻译: 本文公开了用于优化数据存储的技术。 特别地,描述了用于实现具有数据存储系统的冗余编码方案的方法和系统。 可以根据系统和数据特性调度冗余编码方案。 这些方案可以跨越存储系统的多层或多层。 可以例如根据交易速率要求,数据持久性要求或者存储数据的年龄的上下文来生成这些方案。 这些方案可能被设计为纠正数据存储时的熵相关影响。 这些方案可以包括一个或多个擦除代码或擦除编码方案。 此外,描述了用于改进和/或计算存储系统的各种组件(包括诸如硬盘驱动器)的存储设备的故障相关性的方法和系统。