-
公开(公告)号:US11467930B2
公开(公告)日:2022-10-11
申请号:US16567026
申请日:2019-09-11
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Ningdong Li , Jiahui Wang , Shao Kuang Hu
Abstract: Distributed failover of a failed BE, and failback of the failed BE when it recovers, are described. A separate pool of memory may be reserved for each BE on a storage system. Each such pool may be further divided into sub-pools. When a BE fails, the one or more other active BEs on the storage system may be notified, and each such active BE may attempt to take ownership of one or more sub-pools of the BE and execute the one or more pending write operations of the failed BE. Upon completing execution of the pending write operations in a failed-over sub-pool, the ownership of the sub-pool may be returned to the failed BE. When the failed BE recovers, the recovered BE may attempt to take back control of one or more of its sub-pools.
-
公开(公告)号:US10983879B1
公开(公告)日:2021-04-20
申请号:US16176188
申请日:2018-10-31
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Akash Agrawal , Timothy Johnson , Jiahui Wang , Peng Yin , Stephen Richard Ives , Michael Garvey , Christopher Monti
Abstract: A method of distributed management of recovery of multi-controller NVMe drives includes detecting a path failure of a PCIe path from a first storage node to a first controller on the multi-controller NVMe drive, and initially attempting to correct the path failure using a controller level reset. If the controller level reset is unsuccessful, an alternative path to the controller is sought, and if that is unsuccessful a drive level reset operation is coordinated by all storage nodes with controllers executing on the NVMe drive. To coordinate reset of the NVMe drive, one storage node is elected master. Each node (both slave and master) initiates quiescing of IO operations on its respective controller, and after quiescing has completed, initiates shutdown of its respective controller. Once all controllers are shut down, the master initiates reset of the NVMe drive. Timeouts are used to constrain completion of the quiescing and shutdown operations.
-
公开(公告)号:US20210073092A1
公开(公告)日:2021-03-11
申请号:US16567026
申请日:2019-09-11
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Ningdong Li , Jiahui Wang , Shao Kuang Hu
Abstract: Distributed failover of a failed BE, and failback of the failed BE when it recovers, are described. A separate pool of memory may be reserved for each BE on a storage system. Each such pool may be further divided into sub-pools. When a BE fails, the one or more other active BEs on the storage system may be notified, and each such active BE may attempt to take ownership of one or more sub-pools of the BE and execute the one or more pending write operations of the failed BE. Upon completing execution of the pending write operations in a failed-over sub-pool, the ownership of the sub-pool may be returned to the failed BE. When the failed BE recovers, the recovered BE may attempt to take back control of one or more of its sub-pools.
-
公开(公告)号:US11526447B1
公开(公告)日:2022-12-13
申请号:US17363167
申请日:2021-06-30
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Peng Wu , Rong Yu , Jiahui Wang , Lixin Pang
IPC: G06F12/0844 , G06F3/06
Abstract: A data service layer running on a storage director node generates a request to destage host data from a plurality of cache slots in a single back-end track. The destage request includes pointers to addresses of the cache slots and indicates an order in which the host application data in the cache slots is to be included in the back-end track. A back-end redundant array of independent drives (RAID) subsystem running on a drive adapter is responsive to the request to calculate parity information using the host application data in the cache slots. The back-end RAID subsystem assembles the single back-end track comprising the host application data from the plurality of cache slots of the request, and destages the single back-end track to a non-volatile drive in a single back-end input-output (IO) operation.
-
公开(公告)号:US11372562B1
公开(公告)日:2022-06-28
申请号:US17225170
申请日:2021-04-08
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Peng Wu , Rong Yu , Jiahui Wang , Lixin Pang
IPC: G06F3/06
Abstract: A storage system that supports multiple RAID levels presents storage objects with front-end tracks corresponding to back-end tracks on non-volatile drives and accesses the drives using a single type of back-end allocation unit that is larger than a back-end track. When the number of members of a protection group of a RAID level does not align with the back-end allocation unit, multiple back-end tracks are grouped and accessed using a single IO. The number of back-end tracks in a group is selected to align with the back-end allocation unit size. If the front-end tracks are variable size, then front-end tracks may be destaged into a smaller number of grouped back-end tracks in a single IO.
-
公开(公告)号:US11256447B1
公开(公告)日:2022-02-22
申请号:US17065558
申请日:2020-10-08
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Lixin Pang , Jiahui Wang , Peng Wu , Rong Yu
IPC: G06F11/10 , G06F11/20 , G06F3/06 , H04L67/1097
Abstract: A storage array that presents a logical storage device to hosts that is accessed using front-end tracks and access tangible managed drive using back-end tracks locates multiple front-end tracks in individual back-end tracks. Error-correcting codes are used to identify different front-end tracks in a back-end track when the back-end track is copied from the managed drives into storage array memory. CKD front-end tracks can be split into multiple partial CKD front-end tracks that are located at contiguous address space in different back-end tracks. The front-end tracks that are located in a particular back-end track may be selected to reduce or minimize unused space. The front-end tracks in a back-end track may be logically stored on different production volumes.
-
公开(公告)号:US20210081319A1
公开(公告)日:2021-03-18
申请号:US16569704
申请日:2019-09-13
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Jiahui Wang , Peng Yin
IPC: G06F12/0804 , G06F13/40
Abstract: A SAN storage node has multiple engines, each including at least one computing node with at least one drive adapter for accessing arrays of drives. All drives of the SAN node are accessible by all drive adapters. The drives are organized into hypers. Each hyper is accessible to all drive adapters. Each drive is temporarily associated with a preferred owner for spin-up. A drive adapter that initializes spin-down of one of the drives takes a spindle lock on the drive being spun-down and prompts all other drive adapters to complete outstanding IOs on the drive being spun-down.
-
8.
公开(公告)号:US20200034059A1
公开(公告)日:2020-01-30
申请号:US16048637
申请日:2018-07-30
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Kunxiu Gao , Jiahui Wang , Seema G. Pai , Ningdong Li , Daohong Wang , Stephen Richard Ives , Li Lang
IPC: G06F3/06
Abstract: Techniques for performing configuration may include: determining an initial distribution of logical devices among disk adapters, wherein the initial distribution assigns each logical device a designated disk adapter as a primary active disk adapter servicing I/Os directed to the logical device, wherein the logical devices have storage provisioned from a RAID group including physical storage devices, wherein the disk adapters concurrently issue I/Os to each of the physical storage devices; detecting a configuration change; and responsive to detecting a configuration change, performing first processing. The first processing may include determining, in accordance with the configuration change, a redistribution of the logical devices among the disk adapters; and draining pending I/Os. The configuration change may include adding or removing a logical device provisioned from the RAID group. Performing a first operation on a storage device of the RAID group may include synchronous draining of I/Os of the storage device.
-
公开(公告)号:US11086536B2
公开(公告)日:2021-08-10
申请号:US16048637
申请日:2018-07-30
Applicant: EMC IP Holding Company LLC
Inventor: Peng Yin , Kunxiu Gao , Jiahui Wang , Seema G. Pai , Ningdong Li , Daohong Wang , Stephen Richard Ives , Li Lang
IPC: G06F3/06
Abstract: Techniques for performing configuration may include: determining an initial distribution of logical devices among disk adapters, wherein the initial distribution assigns each logical device a designated disk adapter as a primary active disk adapter servicing I/Os directed to the logical device, wherein the logical devices have storage provisioned from a RAID group including physical storage devices, wherein the disk adapters concurrently issue I/Os to each of the physical storage devices; detecting a configuration change; and responsive to detecting a configuration change, performing first processing. The first processing may include determining, in accordance with the configuration change, a redistribution of the logical devices among the disk adapters; and draining pending I/Os. The configuration change may include adding or removing a logical device provisioned from the RAID group. Performing a first operation on a storage device of the RAID group may include synchronous draining of I/Os of the storage device.
-
公开(公告)号:US10990527B2
公开(公告)日:2021-04-27
申请号:US16569704
申请日:2019-09-13
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Jiahui Wang , Peng Yin
IPC: G06F12/00 , G06F12/0804 , G06F13/40 , G06F13/00 , G06F3/06
Abstract: A SAN storage node has multiple engines, each including at least one computing node with at least one drive adapter for accessing arrays of drives. All drives of the SAN node are accessible by all drive adapters. The drives are organized into hypers. Each hyper is accessible to all drive adapters. Each drive is temporarily associated with a preferred owner for spin-up. A drive adapter that initializes spin-down of one of the drives takes a spindle lock on the drive being spun-down and prompts all other drive adapters to complete outstanding IOs on the drive being spun-down.
-
-
-
-
-
-
-
-
-