Adaptive vicinity prefetching for filesystem metadata

    公开(公告)号:US20060080401A1

    公开(公告)日:2006-04-13

    申请号:US10953243

    申请日:2004-09-29

    申请人: Binny Gill Leo Luan

    发明人: Binny Gill Leo Luan

    IPC分类号: G06F15/16

    摘要: Network latencies are reduced by detecting a metadata access call for filesystem metadata contained in a filesystem node of remotely located filesystem. The metadata corresponding to the metadata access call is fetched when the metadata corresponding to the metadata access call is not contained in a local filesystem cache that has a hierarchical structure corresponding to a hierarchical structure of the remotely located filesystem. Metadata related to the metadata corresponding to the metadata access call from the remotely located filesystem is prefetched when the related metadata is not contained in the local filesystem cache. The metadata related to the metadata corresponding to the access call can be contained in a child node or a parent node of the node containing the metadata corresponding to the access call, or the descendants of such nodes.

    Remote access agent for caching in a SAN file system
    2.
    发明申请
    Remote access agent for caching in a SAN file system 失效
    用于在SAN文件系统中缓存的远程访问代理

    公开(公告)号:US20060004765A1

    公开(公告)日:2006-01-05

    申请号:US10864605

    申请日:2004-06-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30067

    摘要: Addressed is a system and method for remote data caching and replication by local copy maintenance of remote data within a SAN file system. Distributed Storage Tank (DST), an extension to a SAN file system, provides for transparent SAN client access of local copies by importing, exporting, and storing data using network file access protocols as well as by providing assurance of metadata and file content validity. A Remote Access Agent (RAA) handles protocol implementation and conversion necessary for communication with remote data sources. Controlled by a consistency policy, consistency is maintained by RAA fetching and updating local copies if modifications have occurred to a file since it was first stored as a local copy in local storage. Additionally, RAA returns metadata pertaining to the requested data. A SAN client obtains metadata corresponding to the requested data and utilizes it to directly access locally stored copies of remote data.

    摘要翻译: 解决的是通过SAN文件系统中的远程数据的本地复制维护来进行远程数据缓存和复制的系统和方法。 分布式存储库(DST)是SAN文件系统的扩展,通过使用网络文件访问协议导入,导出和存储数据以及提供元数据和文件内容有效性的保证,为本地副本的透明SAN客户端访问提供了便利。 远程访问代理(RAA)处理与远程数据源通信所需的协议实现和转换。 由一致性策略控制,由于RAA首次将其作为本地副本存储在本地存储中,因此RAA会在文件发生修改时,获取和更新本地副本,从而保持一致性。 此外,RAA返回与所请求数据有关的元数据。 SAN客户端获取与所请求数据相对应的元数据,并利用它来直接访问本地存储的远程数据副本。

    Decoupling storage controller cache read replacement from write retirement
    3.
    发明申请
    Decoupling storage controller cache read replacement from write retirement 审中-公开
    解除存储控制器缓存从写入退出中读取替换

    公开(公告)号:US20070118695A1

    公开(公告)日:2007-05-24

    申请号:US11282157

    申请日:2005-11-18

    IPC分类号: G06F12/00 G06F13/00

    摘要: In a data storage controller, accessed tracks are temporarily stored in a cache, with write data being stored in a first cache (such as a volatile cache) and a second cache and read data being stored in a second cache (such as a non-volatile cache). Corresponding least recently used (LRU) lists are maintained to hold entries identifying the tracks stored in the caches. When the list holding entries for the first cache (the A list) is full, the list is scanned to identify unmodified (read) data which can be discarded from the cache to make room for new data. Prior to or during the scan, modified (write) data entries are moved to the most recently used (MRU) end of the list, allowing the scans to proceed in an efficient manner and reducing the number of times the scan has to skip over modified entries Optionally, a status bit may be associated with each modified data entry. When the modified entry is moved to the MRU end of the A list without being requested to be read, its status bit is changed from an initial state (such as 0) to a second state (such as 1), indicating that it is a candidate to be discarded. If the status bit is already set to the second state (such as 1), then it is left unchanged. If a modified track is moved to the MRU end of the A list as a result of being requested to be read, the status bit of the corresponding A list entry is changed back to the first state, preventing the track from being discarded. Thus, write tracks are allowed to remain in the first cache only as long as necessary.

    摘要翻译: 在数据存储控制器中,将访问的轨道临时存储在高速缓存中,其中写入数据被存储在第一高速缓存(例如易失性高速缓存)和第二高速缓存中,并且将读取的数据存储在第二高速缓存(例如, 易失性缓存)。 保持相应的最近使用的(LRU)列表来保存标识存储在高速缓存中的轨道的条目。 当保存第一个缓存(A列表)的条目的列表已满时,将扫描列表以识别可以从缓存中丢弃的未修改(读取)数据,为新数据腾出空间。 在扫描之前或期间,修改(写入)数据条目移动到列表的最近使用(MRU)端,允许扫描以有效的方式继续进行,并减少扫描必须跳过修改的次数 条目可选地,状态位可以与每个修改的数据条目相关联。 当修改的条目移动到A列表的MRU结尾而不被请求读取时,其状态位从初始状态(例如0)改变到第二状态(例如1),表示它是 候选人被丢弃。 如果状态位已经设置为第二个状态(如1),那么它将保持不变。 如果作为被请求读取的结果将经修改的轨道移动到A列表的MRU端,则相应的A列表条目的状态位被改回到第一状态,防止轨道被丢弃。 因此,仅在需要时才允许写轨迹保留在第一缓存中。

    Method and system for adaptive back-off and advance for non-volatile storage (NVS) occupancy level management
    4.
    发明申请
    Method and system for adaptive back-off and advance for non-volatile storage (NVS) occupancy level management 有权
    用于非易失性存储(NVS)占用级别管理的自适应退避和提前的方法和系统

    公开(公告)号:US20070250660A1

    公开(公告)日:2007-10-25

    申请号:US11407797

    申请日:2006-04-20

    IPC分类号: G06F12/00

    摘要: A technique for determining when to destage write data from a fast, NVS of a computer system from an upper level to a lower level of storage in the computer system comprises adaptively varying a destage rate of the NVS according to a current storage occupancy of the NVS; maintaining a high threshold level for the NVS; maintaining a low threshold level that is set to be a predetermined fixed amount below the high threshold; setting the destage rate of the NVS to zero when the NVS occupancy is below the low threshold; setting the destage rate of the NVS to be maximum when the NVS occupancy is above the high threshold; linearly increasing the destage rate of the NVS from zero to maximum as the NVS occupancy goes from the low to the high threshold; and adaptively varying the high threshold in response to a dynamic computer storage workload.

    摘要翻译: 一种用于确定何时从计算机系统的快速NVS将计算机系统的写入数据从计算机系统中的较高级别存储到较低级别的存储装置的技术包括根据NVS的当前存储占用自适应地改变NVS的流率 ; 维持NVS的高门槛值; 保持低阈值水平,其被设置为低于高阈值的预定固定量; 当NVS占用率低于低阈值时,将NVS的流出率设置为零; 当NVS占用率高于高阈值时,将NVS的流出率设置为最大值; 随着NVS占用率从低到高的阈值,将NVS的流失率从零线性上升到最大值; 以及响应于动态计算机存储工作负载自适应地改变高阈值。

    Wise ordering for writes - combining spatial and temporal locality in write caches for multi-rank storage
    5.
    发明申请
    Wise ordering for writes - combining spatial and temporal locality in write caches for multi-rank storage 有权
    明智的订单写入 - 将空间和时间局部性组合在多级存储的写缓存中

    公开(公告)号:US20070220200A1

    公开(公告)日:2007-09-20

    申请号:US11384890

    申请日:2006-03-20

    IPC分类号: G06F13/00 G06F12/00

    摘要: A storage system has a storage controller for an array of storage disks, the array being ordered in an sequence of write groups. A write cache is shared by the disks. The storage controller temporarily stores write groups in the write cache responsive to write groups being written to their respective arrays. The write groups are assigned to a global queue ordered by ages. The controller selects a quantity of write groups for attempted destaging to the arrays responsive to a predetermined high threshold for the global queue and to sizes and the ages of the write groups in the global queue, and allocates the selected quantity among the arrays responsive to quantities of certain ones of the write groups in the global queue. Write groups are destaged to respective arrays responsive to the selected allocation quantity for the array and the sequences of the write groups in the arrays.

    摘要翻译: 存储系统具有用于存储盘阵列的存储控制器,阵列以写入组的顺序排序。 写缓存由磁盘共享。 存储控制器响应于被写入它们各自的阵列的写入组而将写入组临时存储在写入高速缓存中。 写组被分配给按年龄排序的全局队列。 控制器响应于全局队列的预定高阈值以及全局队列中的写组的大小和年龄来选择尝试向阵列尝试次数的写组,并且响应于数量来分配阵列中的所选数量 的全局队列中的某些写入组。 响应于阵列的选定分配量和阵列中的写入组的序列,写入组将转移到各个阵列。

    Minimizing resynchronization time after backup system failures in an appliance-based business continuance architecture
    6.
    发明申请
    Minimizing resynchronization time after backup system failures in an appliance-based business continuance architecture 有权
    在基于设备的业务连续性体系结构中,在备份系统故障后最大限度地减少重新同步时间

    公开(公告)号:US20050273654A1

    公开(公告)日:2005-12-08

    申请号:US10834502

    申请日:2004-04-28

    申请人: Ying Chen Binny Gill

    发明人: Ying Chen Binny Gill

    IPC分类号: G06F12/00 G06F11/00 G06F11/07

    摘要: A system for minimizing downtime in an appliance-based business continuance architecture is provided. The system includes at least one primary data storage and least one primary host machine. The system includes an intercept agent to intercept primary host machine data requests, and to collect information associated with the intercepted data requests. Moreover, at least one business continuance appliance in communication with the primary host machine and in communication with a remote backup site is provided. The appliance receives information associated with the intercepted data requests from the intercept agent. In addition, a local cache is included within the business continuance appliance. The local cache maintains copies of primary data storage according to the information received. Furthermore, the remote site is provided with the intercepted data requests via the business continuance appliance, wherein the remote site maintains a backup of the primary data storage.

    摘要翻译: 提供了一种用于最小化基于设备的业务连续性架构中的停机时间的系统。 该系统包括至少一个主数据存储和至少一个主要主机。 该系统包括截取主机数据请求的拦截代理,并收集与拦截的数据请求相关的信息。 此外,提供了与主主机通信并与远程备份站点通信的至少一个业务连续性设备。 设备从拦截代理接收与拦截的数据请求相关联的信息。 此外,本地缓存包含在业务连续设备中。 本地缓存根据收到的信息维护主数据存储的副本。 此外,通过业务连续性设备向远程站点提供截获的数据请求,其中远程站点维护主数据存储的备份。

    Wise ordering for writes-combining spatial and temporal locality in write caches
    7.
    发明申请
    Wise ordering for writes-combining spatial and temporal locality in write caches 失效
    写入高速缓存中的空间和时间局部性的写入 - 明智的排序

    公开(公告)号:US20070220201A1

    公开(公告)日:2007-09-20

    申请号:US11384939

    申请日:2006-03-20

    IPC分类号: G06F13/00 G06F12/00

    摘要: A storage system has a storage controller for an array of storage disks, the array being ordered in an sequence of write groups. A write cache is shared by the disks. The storage controller temporarily stores write groups in the write cache, responsive to write groups being written, and lists the write groups in order of their sequence in the array and in circular fashion, so that a lowest is listed next to a highest one of the write groups. The storage controller selects the listed write groups in rotating sequence. Such a write group is destaged from the write cache to the disk responsive to i) the selecting of the write group and ii) a state of a recency indicator for the write group, wherein the recency indicator shows recency of writing to the write group.

    摘要翻译: 存储系统具有用于存储盘阵列的存储控制器,阵列以写入组的顺序排序。 写缓存由磁盘共享。 存储控制器根据写入的组写入临时存储写入组中的写入组,并且按顺序在数组中以循环方式列出写入组,使得最低级别列在 写组。 存储控制器以旋转顺序选择列出的写入组。 响应于i)写入组的选择和ii)写入组的新近度指示符的状态,这样一个写入组从写入高速缓冲存储器移动到盘中,其中新近度指示符表示对写入组的写入的新近。

    Minimizing system downtime through intelligent data caching in an appliance-based business continuance architecture
    9.
    发明申请
    Minimizing system downtime through intelligent data caching in an appliance-based business continuance architecture 失效
    通过基于设备的业务连续性架构中的智能数据缓存来最大限度地减少系统停机时间

    公开(公告)号:US20050246575A1

    公开(公告)日:2005-11-03

    申请号:US10835299

    申请日:2004-04-28

    IPC分类号: G06F12/16

    摘要: A system for minimizing downtime in an appliance-based business continuance architecture is provided. The system includes at least one primary data storage and least one primary host machine. The system includes an intercept agent to intercept primary host machine data requests, and to collect information associated with the intercepted data requests. Moreover, at least one business continuance appliance in communication with the primary host machine and in communication with a remote backup site is provided. The appliance receives information associated with the intercepted data requests from the intercept agent. In addition, a local cache is included within the business continuance appliance. The local cache maintains copies of primary data storage according to the information received. Furthermore, the remote site is provided with the intercepted data requests via the business continuance appliance, wherein the remote site maintains a backup of the primary data storage.

    摘要翻译: 提供了一种用于最小化基于设备的业务连续性架构中的停机时间的系统。 该系统包括至少一个主数据存储和至少一个主要主机。 该系统包括截取主机数据请求的拦截代理,并收集与拦截的数据请求相关的信息。 此外,提供了与主主机通信并与远程备份站点通信的至少一个业务连续性设备。 设备从拦截代理接收与拦截的数据请求相关联的信息。 此外,本地缓存包含在业务连续设备中。 本地缓存根据收到的信息维护主数据存储的副本。 此外,通过业务连续性设备向远程站点提供截获的数据请求,其中远程站点维护主数据存储的备份。

    Apparatus and method to manage a data cache
    10.
    发明申请
    Apparatus and method to manage a data cache 有权
    用于管理数据高速缓存的设备和方法

    公开(公告)号:US20060080510A1

    公开(公告)日:2006-04-13

    申请号:US10964474

    申请日:2004-10-12

    IPC分类号: G06F12/00

    CPC分类号: G06F12/123 G06F12/0866

    摘要: A method is disclosed to manage a data cache. The method provides a data cache comprising a plurality of tracks, where each track comprises one or more segments. The method further maintains a first LRU list comprising one or more first tracks having a low reuse potential, maintains a second LRU list comprising one or more second tracks having a high reuse potential, and sets a target size for the first LRU list. The method then accesses a track, and determines if that accessed track comprises a first track. If the method determines that the accessed track comprises a first track, then the method increases the target size for said first LRU list. Alternatively, if the method determines that the accessed track comprises a second track, then the method decreases the target size for said first LRU list. The method demotes tracks from the first LRU list if its size exceeds the target size; otherwise, the method evicts tracks from the second LRU list.

    摘要翻译: 公开了一种管理数据高速缓存的方法。 该方法提供包括多个轨道的数据高速缓存,其中每个轨道包括一个或多个段。 该方法还维护包括具有低再利用潜力的一个或多个第一轨道的第一LRU列表,维护包括具有高重用潜力的一个或多个第二轨道的第二LRU列表,并设置第一LRU列表的目标大小。 然后,该方法访问轨道,并且确定所访问的轨道是否包括第一轨道。 如果方法确定所访问的轨道包括第一轨道,则该方法增加所述第一LRU列表的目标大小。 或者,如果该方法确定所访问的轨道包括第二轨道,则该方法减小所述第一LRU列表的目标大小。 该方法如果其大小超过目标大小,则会从第一个LRU列表中降低轨道; 否则,该方法从第二LRU列表中逐出轨道。