-
公开(公告)号:US20210073092A1
公开(公告)日:2021-03-11
申请号:US16567026
申请日:2019-09-11
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Ningdong Li , Jiahui Wang , Shao Kuang Hu
Abstract: Distributed failover of a failed BE, and failback of the failed BE when it recovers, are described. A separate pool of memory may be reserved for each BE on a storage system. Each such pool may be further divided into sub-pools. When a BE fails, the one or more other active BEs on the storage system may be notified, and each such active BE may attempt to take ownership of one or more sub-pools of the BE and execute the one or more pending write operations of the failed BE. Upon completing execution of the pending write operations in a failed-over sub-pool, the ownership of the sub-pool may be returned to the failed BE. When the failed BE recovers, the recovered BE may attempt to take back control of one or more of its sub-pools.
-
公开(公告)号:US20210081319A1
公开(公告)日:2021-03-18
申请号:US16569704
申请日:2019-09-13
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Jiahui Wang , Peng Yin
IPC: G06F12/0804 , G06F13/40
Abstract: A SAN storage node has multiple engines, each including at least one computing node with at least one drive adapter for accessing arrays of drives. All drives of the SAN node are accessible by all drive adapters. The drives are organized into hypers. Each hyper is accessible to all drive adapters. Each drive is temporarily associated with a preferred owner for spin-up. A drive adapter that initializes spin-down of one of the drives takes a spindle lock on the drive being spun-down and prompts all other drive adapters to complete outstanding IOs on the drive being spun-down.
-
3.
公开(公告)号:US20200034059A1
公开(公告)日:2020-01-30
申请号:US16048637
申请日:2018-07-30
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Kunxiu Gao , Jiahui Wang , Seema G. Pai , Ningdong Li , Daohong Wang , Stephen Richard Ives , Li Lang
IPC: G06F3/06
Abstract: Techniques for performing configuration may include: determining an initial distribution of logical devices among disk adapters, wherein the initial distribution assigns each logical device a designated disk adapter as a primary active disk adapter servicing I/Os directed to the logical device, wherein the logical devices have storage provisioned from a RAID group including physical storage devices, wherein the disk adapters concurrently issue I/Os to each of the physical storage devices; detecting a configuration change; and responsive to detecting a configuration change, performing first processing. The first processing may include determining, in accordance with the configuration change, a redistribution of the logical devices among the disk adapters; and draining pending I/Os. The configuration change may include adding or removing a logical device provisioned from the RAID group. Performing a first operation on a storage device of the RAID group may include synchronous draining of I/Os of the storage device.
-
公开(公告)号:US11467930B2
公开(公告)日:2022-10-11
申请号:US16567026
申请日:2019-09-11
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Ningdong Li , Jiahui Wang , Shao Kuang Hu
Abstract: Distributed failover of a failed BE, and failback of the failed BE when it recovers, are described. A separate pool of memory may be reserved for each BE on a storage system. Each such pool may be further divided into sub-pools. When a BE fails, the one or more other active BEs on the storage system may be notified, and each such active BE may attempt to take ownership of one or more sub-pools of the BE and execute the one or more pending write operations of the failed BE. Upon completing execution of the pending write operations in a failed-over sub-pool, the ownership of the sub-pool may be returned to the failed BE. When the failed BE recovers, the recovered BE may attempt to take back control of one or more of its sub-pools.
-
公开(公告)号:US10983879B1
公开(公告)日:2021-04-20
申请号:US16176188
申请日:2018-10-31
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Akash Agrawal , Timothy Johnson , Jiahui Wang , Peng Yin , Stephen Richard Ives , Michael Garvey , Christopher Monti
Abstract: A method of distributed management of recovery of multi-controller NVMe drives includes detecting a path failure of a PCIe path from a first storage node to a first controller on the multi-controller NVMe drive, and initially attempting to correct the path failure using a controller level reset. If the controller level reset is unsuccessful, an alternative path to the controller is sought, and if that is unsuccessful a drive level reset operation is coordinated by all storage nodes with controllers executing on the NVMe drive. To coordinate reset of the NVMe drive, one storage node is elected master. Each node (both slave and master) initiates quiescing of IO operations on its respective controller, and after quiescing has completed, initiates shutdown of its respective controller. Once all controllers are shut down, the master initiates reset of the NVMe drive. Timeouts are used to constrain completion of the quiescing and shutdown operations.
-
公开(公告)号:US11086536B2
公开(公告)日:2021-08-10
申请号:US16048637
申请日:2018-07-30
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Kunxiu Gao , Jiahui Wang , Seema G. Pai , Ningdong Li , Daohong Wang , Stephen Richard Ives , Li Lang
IPC: G06F3/06
Abstract: Techniques for performing configuration may include: determining an initial distribution of logical devices among disk adapters, wherein the initial distribution assigns each logical device a designated disk adapter as a primary active disk adapter servicing I/Os directed to the logical device, wherein the logical devices have storage provisioned from a RAID group including physical storage devices, wherein the disk adapters concurrently issue I/Os to each of the physical storage devices; detecting a configuration change; and responsive to detecting a configuration change, performing first processing. The first processing may include determining, in accordance with the configuration change, a redistribution of the logical devices among the disk adapters; and draining pending I/Os. The configuration change may include adding or removing a logical device provisioned from the RAID group. Performing a first operation on a storage device of the RAID group may include synchronous draining of I/Os of the storage device.
-
公开(公告)号:US10990527B2
公开(公告)日:2021-04-27
申请号:US16569704
申请日:2019-09-13
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Jiahui Wang , Peng Yin
IPC: G06F12/00 , G06F12/0804 , G06F13/40 , G06F13/00 , G06F3/06
Abstract: A SAN storage node has multiple engines, each including at least one computing node with at least one drive adapter for accessing arrays of drives. All drives of the SAN node are accessible by all drive adapters. The drives are organized into hypers. Each hyper is accessible to all drive adapters. Each drive is temporarily associated with a preferred owner for spin-up. A drive adapter that initializes spin-down of one of the drives takes a spindle lock on the drive being spun-down and prompts all other drive adapters to complete outstanding IOs on the drive being spun-down.
-
公开(公告)号:US09830266B1
公开(公告)日:2017-11-28
申请号:US14156678
申请日:2014-01-16
Applicant: EMC IP Holding Company LLC
Inventor: Rong Yu , Orit Levin-Michael , John W. Lefferts , Pei-Ching Hwang , Peng Yin , Yechiel Yochai , Dan Aharoni , Qun Fan , Stephen Richard Ives
IPC: G06F12/00 , G06F12/0862
CPC classification number: G06F12/0862 , G06F2212/6024
Abstract: Described are techniques for processing a data operation in a data storage system. A front-end component of the data storage system receives the data operation. In response to receiving the data operation, the front-end component performs first processing. The first processing includes determining whether the data operation is a read operation requesting to read a data portion which results in a cache miss; and if said determining determines that the data operation is a read operation resulting in a cache miss, performing read miss processing. Read miss processing includes sequential stream recognition processing performed by the front-end component to determine whether the data portion is included in a sequential stream.
-
-
-
-
-
-
-