-
公开(公告)号:US08463762B2
公开(公告)日:2013-06-11
申请号:US12971322
申请日:2010-12-17
IPC分类号: G06F17/00
CPC分类号: G06F17/30115
摘要: The present invention extends to methods, systems, and computer program products for sharing volumes between clustered nodes. Embodiments of the invention include a Clustered Shared Volume File System (CsvFs) that appears to clients as a local file system. The CsvFs communicates to a node where a disk is mounted to coordinate access to files on the disks. CsvFs uses Opportunistic Locks (oplocks) to decide when direct access to a volume is safe. CsvFs can be extended with oplock upgrade mechanisms that allow a coordinating node to tell CsvFs when it is safe to attempt to upgrade oplock. CsvFs also uses a transitivity property of oplocks to be able to grant (delegate) oplocks to clients that are on top of CsvFs.
摘要翻译: 本发明扩展到用于在集群节点之间共享卷的方法,系统和计算机程序产品。 本发明的实施例包括作为本地文件系统向客户端显示的群集共享卷文件系统(CsvF)。 CsvF与安装磁盘的节点通信,以协调对磁盘上文件的访问。 CsvFs使用机会锁(oplocks)来决定直接访问卷是否安全。 可以使用oplock升级机制来扩展CsvF,允许协调节点在尝试升级oplock时安全地告诉CsvF。 CsvFs还使用oplock的传递性属性能够向位于CsvF之上的客户端授予(委托)oplock。
-
公开(公告)号:US20120158681A1
公开(公告)日:2012-06-21
申请号:US12971322
申请日:2010-12-17
IPC分类号: G06F17/30
CPC分类号: G06F17/30115
摘要: The present invention extends to methods, systems, and computer program products for sharing volumes between clustered nodes. Embodiments of the invention include a Clustered Shared Volume File System (CsvFs) that appears to clients as a local file system. The CsvFs communicates to a node where a disk is mounted to coordinate access to files on the disks. CsvFs uses Opportunistic Locks (oplocks) to decide when direct access to a volume is safe. CsvFs can be extended with oplock upgrade mechanisms that allow a coordinating node to tell CsvFs when it is safe to attempt to upgrade oplock. CsvFs also uses a transitivity property of oplocks to be able to grant (delegate) oplocks to clients that are on top of CsvFs.
摘要翻译: 本发明扩展到用于在集群节点之间共享卷的方法,系统和计算机程序产品。 本发明的实施例包括作为本地文件系统向客户端显示的群集共享卷文件系统(CsvF)。 CsvF与安装磁盘的节点通信,以协调对磁盘上文件的访问。 CsvFs使用机会锁(oplocks)来决定直接访问卷是否安全。 可以使用oplock升级机制来扩展CsvF,允许协调节点在尝试升级oplock时安全地告诉CsvF。 CsvFs还使用oplock的传递性属性能够向位于CsvF之上的客户端授予(委托)oplock。
-
公开(公告)号:US07840730B2
公开(公告)日:2010-11-23
申请号:US12147956
申请日:2008-06-27
申请人: Andrea D'Amato , Rajeev Y. Nagar , Gor Nishanov , Rajsekhar Das , Gregorio Maeso
发明人: Andrea D'Amato , Rajeev Y. Nagar , Gor Nishanov , Rajsekhar Das , Gregorio Maeso
IPC分类号: G06F19/00
CPC分类号: G06F3/0637 , G06F3/062 , G06F3/0635 , G06F3/067 , G06F11/1482 , G06F11/2035 , G06F11/2046 , G06F17/30091 , H04L67/1097 , H04L69/40
摘要: Described is a technology by which a storage volume is shared by cluster nodes of a server cluster. In one implementation, each node includes a redirector that provides shared access to the volume from that node. The redirector routes file system metadata requests from applications and the like through a first (e.g., SMB) communications path to the owning node, and routes file system read and write data to the storage device through a second, high-speed communications path such as direct direct block level I/O. An owning node maintains ownership of the storage device through a persistent reservation mechanism that writes a key to a registration table associated with the storage device. Non-owning nodes write a shared key. The owning node validates the shared keys against cluster membership data, and preempts (e.g., removes) any key deemed not valid. Security mechanisms for controlling access are also described.
摘要翻译: 描述了存储卷由服务器集群的集群节点共享的技术。 在一个实现中,每个节点包括重定向器,其向该节点提供对卷的共享访问。 重定向器通过第一(例如SMB)通信路径将应用程序等的文件系统元数据请求路由到所有者节点,并通过第二高速通信路径将文件系统读写数据路由到存储设备,例如 直接直接块级I / O。 拥有节点通过将密钥写入与存储设备相关联的注册表的持久预留机制来维护存储设备的所有权。 非拥有节点写共享密钥。 拥有节点根据集群成员数据验证共享密钥,并抢占(例如删除)任何被认为无效的密钥。 还描述了用于控制访问的安全机制。
-
公开(公告)号:US20090327798A1
公开(公告)日:2009-12-31
申请号:US12147956
申请日:2008-06-27
申请人: Andrea D'Amato , Rajeev Y. Nagar , Gor Nishanov , Rajsekhar Das , Gregorio Maeso
发明人: Andrea D'Amato , Rajeev Y. Nagar , Gor Nishanov , Rajsekhar Das , Gregorio Maeso
CPC分类号: G06F3/0637 , G06F3/062 , G06F3/0635 , G06F3/067 , G06F11/1482 , G06F11/2035 , G06F11/2046 , G06F17/30091 , H04L67/1097 , H04L69/40
摘要: Described is a technology by which a storage volume is shared by cluster nodes of a server cluster. In one implementation, each node includes a redirector that provides shared access to the volume from that node. The redirector routes file system metadata requests from applications and the like through a first (e.g., SMB) communications path to the owning node, and routes file system read and write data to the storage device through a second, high-speed communications path such as direct direct block level I/O. An owning node maintains ownership of the storage device through a persistent reservation mechanism that writes a key to a registration table associated with the storage device. Non-owning nodes write a shared key. The owning node validates the shared keys against cluster membership data, and preempts (e.g., removes) any key deemed not valid. Security mechanisms for controlling access are also described.
摘要翻译: 描述了存储卷由服务器集群的集群节点共享的技术。 在一个实现中,每个节点包括重定向器,其向该节点提供对卷的共享访问。 重定向器通过第一(例如SMB)通信路径将应用程序等的文件系统元数据请求路由到所有者节点,并通过第二高速通信路径将文件系统读写数据路由到存储设备,例如 直接直接块级I / O。 拥有节点通过将密钥写入与存储设备相关联的注册表的持久预留机制来维护存储设备的所有权。 非拥有节点写共享密钥。 拥有节点根据集群成员数据验证共享密钥,并抢占(例如删除)任何被认为无效的密钥。 还描述了用于控制访问的安全机制。
-
公开(公告)号:US08886609B2
公开(公告)日:2014-11-11
申请号:US12971981
申请日:2010-12-17
CPC分类号: G06F11/1461 , G06F11/1464 , G06F2201/82 , G06F2201/84 , H04L67/1097 , H04L67/28 , H04L69/40
摘要: Implementing point in time back-up. A method that may be practiced in a distributed computing environment. The distributed computing environment includes a number of computing system nodes. The method includes at a first node in the computing environment, taking a snapshot of a cluster volume. The cluster volume stores data items (e.g. files/blocks). Taking a snapshot includes marking a point in time for data items in the volume. The method further includes receiving a notification that a different node in the computing environment than the first node, has determined to change a data item in the cluster volume at a point after the marked point in time from an old version of the data item to a new version of the data item. The method further includes at the first node, saving the old version of the data item prior to modification.
摘要翻译: 实施时间点备份。 可以在分布式计算环境中实践的方法。 分布式计算环境包括许多计算系统节点。 该方法包括在计算环境中的第一个节点处,获取集群卷的快照。 集群卷存储数据项(例如文件/块)。 拍摄快照包括标记卷中数据项的时间点。 所述方法还包括接收与所述第一节点相比所述计算环境中的不同节点已经确定在所述标记时间点之后的点处将所述群集卷中的数据项从所述数据项的旧版本改变为 新版本的数据项。 该方法还包括在第一节点处,在修改之前保存数据项的旧版本。
-
公开(公告)号:US20130219125A1
公开(公告)日:2013-08-22
申请号:US13401104
申请日:2012-02-21
IPC分类号: G06F12/12
CPC分类号: G06F12/123 , G06F12/0871 , G06F12/127 , G06F2212/1016 , G06F2212/1048 , G06F2212/152 , G06F2212/282 , G06F2212/284 , G06F2212/311 , G06F2212/463
摘要: The present invention extends to methods, systems, and computer program products for implementing a cache using multiple page replacement algorithms. An exemplary cache can include two logical portions where the first portion implements the least recently used (LRU) algorithm and the second portion implements the least recently used two (LRU2) algorithm to perform page replacement within the respective portion. By implementing multiple algorithms, a more efficient cache can be implemented where the pages most likely to be accessed again are retained in the cache. Multiple page replacement algorithms can be used in any cache including an operating system cache for caching pages accessed via buffered I/O, as well as a cache for caching pages accessed via unbuffered I/O such as accesses to virtual disks made by virtual machines.
摘要翻译: 本发明扩展到用于使用多页替换算法实现高速缓存的方法,系统和计算机程序产品。 示例性高速缓存可以包括两个逻辑部分,其中第一部分实现最近最少使用(LRU)算法,并且第二部分实现最近最少使用的两个(LRU2)算法来在相应部分内执行页面替换。 通过实施多种算法,可以实现更有效的缓存,其中最有可能被重新访问的页面保留在高速缓存中。 可以在任何缓存中使用多页替换算法,包括用于缓存通过缓冲I / O访问的页面的操作系统缓存,以及用于缓存通过无缓冲I / O访问的页面的缓存,例如对由虚拟机进行的虚拟磁盘的访问。
-
公开(公告)号:US20120159005A1
公开(公告)日:2012-06-21
申请号:US12971997
申请日:2010-12-17
申请人: Andrea D'Amato , Vinod R. Shankar , Alan Warwick
发明人: Andrea D'Amato , Vinod R. Shankar , Alan Warwick
IPC分类号: G06F3/00
CPC分类号: G06F3/0659 , G06F3/0619 , G06F3/067
摘要: Coordinating methods of I/O access to a shared data store. A method includes at a node, in a distributed system, performing one or more I/O operations on the shared data store using direct I/O access on a virtual data container. Direct I/O access includes performing I/O operations directly from the node to the shared data store including not having an owner node perform the I/O operation on the shared data store on the node's behalf. The owner node is a different node than the node doing the direct I/O operation. The owner node accesses the shared data store through a local data container. The method further includes determining that one or more subsequent I/O operations should be performed using redirected I/O access, where I/O operations are directed through a centralized node. The method further includes indicating to nodes in the distributed system to switch to redirected I/O access method.
摘要翻译: 对共享数据存储的I / O访问的协调方法。 一种方法包括在分布式系统的节点处,在虚拟数据容器上使用直接I / O访问在共享数据存储器上执行一个或多个I / O操作。 直接I / O访问包括直接从节点到共享数据存储器的I / O操作,包括没有所有者节点代表节点在共享数据存储上执行I / O操作。 所有者节点与执行直接I / O操作的节点不同。 所有者节点通过本地数据容器访问共享数据存储。 该方法还包括确定使用重定向I / O访问来执行一个或多个后续I / O操作,其中I / O操作通过集中式节点引导。 该方法还包括向分布式系统中的节点指示切换到重定向的I / O访问方法。
-
公开(公告)号:US20130346532A1
公开(公告)日:2013-12-26
申请号:US13529872
申请日:2012-06-21
申请人: Andrea D'Amato , Vinod R. Shankar
发明人: Andrea D'Amato , Vinod R. Shankar
IPC分类号: G06F15/167
CPC分类号: G06F3/0617 , G06F3/0626 , G06F3/0635 , G06F3/0658 , G06F3/067 , G06F11/2079 , H04L67/1002
摘要: The present invention minimizes the cost of establishing a cluster that utilizes shared storage by creating a storage namespace within the cluster that makes each storage device, which is physically connected to any of the nodes in the cluster, appear to be physically connected to all nodes in the cluster. A virtual host bus adapter (VHBA) is executed on each node, and is used to create the storage namespace. Each VHBA determines which storage devices are physically connected to the node on which the VHBA executes, as well as each storage device that is physically connected to each of the other nodes. All storage devices determined in this manner are aggregated into the storage namespace which is then presented to the operating system on each node so as to provide the illusion that all storage devices in the storage namespace are physically connected to each node.
摘要翻译: 本发明通过在集群内创建存储命名空间来最小化建立利用共享存储的集群的成本,其使物理上连接到集群中的任何节点的每个存储设备看起来物理地连接到所有节点 集群。 虚拟主机总线适配器(VHBA)在每个节点上执行,用于创建存储命名空间。 每个VHBA确定哪些存储设备物理连接到VHBA执行的节点,以及物理连接到每个其他节点的每个存储设备。 以这种方式确定的所有存储设备都被聚合到存储命名空间中,然后将其呈现给每个节点上的操作系统,以提供存储命名空间中的所有存储设备物理连接到每个节点的错觉。
-
公开(公告)号:US20130166863A1
公开(公告)日:2013-06-27
申请号:US13333722
申请日:2011-12-21
IPC分类号: G06F12/16
CPC分类号: G06F11/1456 , G06F11/1446 , G06F11/1482 , G06F2201/82 , G06F2201/84
摘要: The present invention extends to methods, systems, and computer program products for creating a snapshot of a shared volume that is application consistent across various nodes of a cluster. The invention enables a snapshot of a volume to be initiated on one node which causes all applications in the cluster that use the volume to persist their data to the volume prior to the snapshot being created. Accordingly, the snapshot is application consistent to all applications in the cluster that use the volume. The invention also enables applications on various nodes to perform post snapshot processing on the created snapshot. The invention can be used in an existing backup system that is not cluster aware to enable the existing backup system to create application consistent snapshots of a volume shared by applications across multiple nodes of a cluster.
摘要翻译: 本发明扩展到用于创建共享卷的快照的方法,系统和计算机程序产品,该共享卷的应用在集群的各个节点之间一致。 本发明使得能够在一个节点上启动卷的快照,这使得使用该卷的集群中的所有应用在创建快照之前将其数据保存到卷。 因此,快照与使用该卷的集群中的所有应用程序的应用程序一致。 本发明还使得各种节点上的应用对所创建的快照执行后快照处理。 本发明可以用于不集群意识的现有备份系统,以使现有备份系统能够创建应用程序跨群集多个节点共享的卷的应用程序一致快照。
-
公开(公告)号:US08732346B2
公开(公告)日:2014-05-20
申请号:US12971997
申请日:2010-12-17
申请人: Andrea D'Amato , Vinod R. Shankar , Alan Warwick
发明人: Andrea D'Amato , Vinod R. Shankar , Alan Warwick
CPC分类号: G06F3/0659 , G06F3/0619 , G06F3/067
摘要: Coordinating methods of I/O access to a shared data store. A method includes at a node, in a distributed system, performing one or more I/O operations on the shared data store using direct I/O access on a virtual data container. Direct I/O access includes performing I/O operations directly from the node to the shared data store including not having an owner node perform the I/O operation on the shared data store on the node's behalf. The owner node is a different node than the node doing the direct I/O operation. The owner node accesses the shared data store through a local data container. The method further includes determining that one or more subsequent I/O operations should be performed using redirected I/O access, where I/O operations are directed through a centralized node. The method further includes indicating to nodes in the distributed system to switch to redirected I/O access method.
摘要翻译: 对共享数据存储的I / O访问的协调方法。 一种方法包括在分布式系统的节点处,在虚拟数据容器上使用直接I / O访问在共享数据存储器上执行一个或多个I / O操作。 直接I / O访问包括直接从节点到共享数据存储器的I / O操作,包括没有所有者节点代表节点在共享数据存储上执行I / O操作。 所有者节点与执行直接I / O操作的节点不同。 所有者节点通过本地数据容器访问共享数据存储。 该方法还包括确定使用重定向的I / O访问来执行一个或多个后续I / O操作,其中I / O操作通过集中式节点引导。 该方法还包括向分布式系统中的节点指示切换到重定向的I / O访问方法。
-
-
-
-
-
-
-
-
-