-
公开(公告)号:US11720457B2
公开(公告)日:2023-08-08
申请号:US17876395
申请日:2022-07-28
Applicant: VMware, Inc.
Inventor: Keerthi Kumar , Halesh Sadashiv , Sairam Veeraswamy , Rajesh Venkatasubramanian , Kiran Dikshit , Kiran Tati
IPC: G06F11/00 , G06F11/20 , G06F15/173
CPC classification number: G06F11/2046 , G06F11/2023 , G06F11/2038 , G06F11/2094 , G06F15/17331 , G06F2201/85
Abstract: Techniques for implementing RDMA-based recovery of dirty data in remote memory are provided. In one set of embodiments, upon occurrence of a failure at a first (i.e., source) host system, a second (i.e., failover) host system can allocate a new memory region corresponding to a memory region of the source host system and retrieve a baseline copy of the memory region from a storage backend shared by the source and failover host systems. The failover host system can further populate the new memory region with the baseline copy and retrieve one or more dirty page lists for the memory region from the source host system via RDMA, where the one or more dirty page lists identify memory pages in the memory region that include data updates not present in the baseline copy. For each memory page identified in the one or more dirty page lists, the failover host system can then copy the content of that memory page from the memory region of the source host system to the new memory region via RDMA.
-
公开(公告)号:US11698760B2
公开(公告)日:2023-07-11
申请号:US17481418
申请日:2021-09-22
Applicant: VMWARE, INC.
Inventor: Marcos K. Aguilera , Keerthi Kumar , Pramod Kumar , Pratap Subrahmanyam , Sairam Veeraswamy , Rajesh Venkatasubramanian
IPC: G06F3/06
CPC classification number: G06F3/067 , G06F3/065 , G06F3/0611 , G06F3/0613 , G06F3/0619
Abstract: Disclosed are various embodiments for improving the resiliency and performance of cluster memory. First, a computing device can submit a write request to a byte-addressable chunk of memory stored by a memory host, wherein the byte-addressable chunk of memory is read-only. Then, the computing device can determine that a page-fault occurred in response to the write request. Next, the computing device can copy a page associated with the write request from the byte-addressable chunk of memory to the memory of the computing device. Subsequently, the computing device can free the page from the memory host. Then, the computing device can update a page table entry for the page to refer to a location of the page in the memory of the computing device.
-
公开(公告)号:US20230168965A1
公开(公告)日:2023-06-01
申请号:US18101536
申请日:2023-01-25
Applicant: VMware, Inc.
Inventor: Marcos K. Aguilera , Keerthi Kumar , Pramod Kumar , Pratap Subrahmanyam , Sairam Veeraswamy , Rajesh Venkatasubramanian
CPC classification number: G06F11/1068 , G06F11/0772 , G06F3/0673 , G06F3/0659 , G06F3/0619
Abstract: Disclosed are various embodiments for improving the resiliency and performance of clustered memory. A computing device can generate at least one parity page from at least a first local page and a second local page. The computing device can then submit a first write request for the first local page to a first one of a plurality of memory hosts. The computing device can also submit a second write request for the second local page to a second one of the plurality of memory hosts. Additionally, the computing device can submit a third write request for the parity page to a third one of the plurality of memory hosts.
-
公开(公告)号:US20230017804A1
公开(公告)日:2023-01-19
申请号:US17481418
申请日:2021-09-22
Applicant: VMWARE, INC.
Inventor: MARCOS K. AGUILERA , Keerthi Kumar , Pramod Kumar , Pratap Subrahmanyam , Sairam Veeraswamy , Rajesh Venkatasubramanian
IPC: G06F12/1009 , G06F11/10
Abstract: Disclosed are various embodiments for improving the resiliency and performance of cluster memory. First, a computing device can submit a write request to a byte-addressable chunk of memory stored by a memory host, wherein the byte-addressable chunk of memory is read-only. Then, the computing device can determine that a page-fault occurred in response to the write request. Next, the computing device can copy a page associated with the write request from the byte-addressable chunk of memory to the memory of the computing device. Subsequently, the computing device can free the page from the memory host. Then, the computing device can update a page table entry for the page to refer to a location of the page in the memory of the computing device.
-
公开(公告)号:US20220365855A1
公开(公告)日:2022-11-17
申请号:US17876395
申请日:2022-07-28
Applicant: VMware, Inc.
Inventor: Keerthi Kumar , Halesh Sadashiv , Sairam Veeraswamy , Rajesh Venkatasubramanian , Kiran Dikshit , Kiran Tati
IPC: G06F11/20 , G06F15/173
Abstract: Techniques for implementing RDMA-based recovery of dirty data in remote memory are provided. In one set of embodiments, upon occurrence of a failure at a first (i.e., source) host system, a second (i.e., failover) host system can allocate a new memory region corresponding to a memory region of the source host system and retrieve a baseline copy of the memory region from a storage backend shared by the source and failover host systems. The failover host system can further populate the new memory region with the baseline copy and retrieve one or more dirty page lists for the memory region from the source host system via RDMA, where the one or more dirty page lists identify memory pages in the memory region that include data updates not present in the baseline copy. For each memory page identified in the one or more dirty page lists, the failover host system can then copy the content of that memory page from the memory region of the source host system to the new memory region via RDMA.
-
公开(公告)号:US20210019168A1
公开(公告)日:2021-01-21
申请号:US16698994
申请日:2019-11-28
Applicant: VMWARE, INC.
Inventor: Marcos Aguilera , Keerthi Kumar , Pramod Kumar , Arun Ramanathan , Pratap Subrahmanyam , Sairam Veeraswamy , Rajesh Venkatasubramanian , Manish Mishra
IPC: G06F9/455 , G06F3/06 , G06F12/1009 , G06F16/907
Abstract: The disclosure provides an approach for creating a pool of memory out of local memories of host machines, and providing that pool for the hosts to use. The pool is managed by a controller that keeps track of memory usage and allocated memory among hosts. The controller allocates or reclaims memory between hosts, as needed by the hosts. Memory allocated from a second host to a first host may then be divided into smaller portions by the first host, and further allocated to virtual machines executing within the first host.
-
公开(公告)号:US11436112B1
公开(公告)日:2022-09-06
申请号:US17321673
申请日:2021-05-17
Applicant: VMware, Inc.
Inventor: Keerthi Kumar , Halesh Sadashiv , Sairam Veeraswamy , Rajesh Venkatasubramanian , Kiran Dikshit , Kiran Tati
IPC: G06F11/00 , G06F11/20 , G06F15/173
Abstract: Techniques for implementing RDMA-based recovery of dirty data in remote memory are provided. In one set of embodiments, upon occurrence of a failure at a first (i.e., source) host system, a second (i.e., failover) host system can allocate a new memory region corresponding to a memory region of the source host system and retrieve a baseline copy of the memory region from a storage backend shared by the source and failover host systems. The failover host system can further populate the new memory region with the baseline copy and retrieve one or more dirty page lists for the memory region from the source host system via RDMA, where the one or more dirty page lists identify memory pages in the memory region that include data updates not present in the baseline copy. For each memory page identified in the one or more dirty page lists, the failover host system can then copy the content of that memory page from the memory region of the source host system to the new memory region via RDMA.
-
公开(公告)号:US20230315593A1
公开(公告)日:2023-10-05
申请号:US18331019
申请日:2023-06-07
Applicant: VMware, Inc.
Inventor: Keerthi Kumar , Halesh Sadashiv , Sairam Veeraswamy , Rajesh Venkatasubramanian , Kiran Dikshit , Kiran Tati
IPC: G06F11/20 , G06F15/173
CPC classification number: G06F11/2046 , G06F11/2094 , G06F15/17331 , G06F11/2023 , G06F11/2038 , G06F2201/85
Abstract: Techniques for implementing RDMA-based recovery of dirty data in remote memory are provided. In one set of embodiments, upon occurrence of a failure at a first (i.e., source) host system, a second (i.e., failover) host system can allocate a new memory region corresponding to a memory region of the source host system and retrieve a baseline copy of the memory region from a storage backend shared by the source and failover host systems. The failover host system can further populate the new memory region with the baseline copy and retrieve one or more dirty page lists for the memory region from the source host system via RDMA, where the one or more dirty page lists identify memory pages in the memory region that include data updates not present in the baseline copy. For each memory page identified in the one or more dirty page lists, the failover host system can then copy the content of that memory page from the memory region of the source host system to the new memory region via RDMA.
-
公开(公告)号:US11704030B2
公开(公告)日:2023-07-18
申请号:US17481352
申请日:2021-09-22
Applicant: VMWARE, INC.
Inventor: Marcos K. Aguilera , Keerthi Kumar , Pramod Kumar , Pratap Subrahmanyam , Sairam Veeraswamy , Rajesh Venkatasubramanian
IPC: G06F3/06
CPC classification number: G06F3/0631 , G06F3/0604 , G06F3/067 , G06F3/0659
Abstract: Disclosed are various embodiments for improving resiliency and performance of clustered memory. A computing device can acquire a chunk of byte-addressable memory from a cluster memory host. The computing device can then identify an active set of allocated memory pages and an inactive set of allocated memory pages for a process executing on the computing device. Next, the computing device can store the active set of allocated memory pages for the process in the memory of the computing device. Finally, the computing device can store the inactive set of allocated memory pages for the process in the chunk of byte-addressable memory of the cluster memory host.
-
公开(公告)号:US11687286B2
公开(公告)日:2023-06-27
申请号:US17481335
申请日:2021-09-22
Applicant: VMWARE, INC.
Inventor: Marcos K. Aguilera , Keerthi Kumar , Pramod Kumar , Pratap Subrahmanyam , Sairam Veeraswamy , Rajesh Venkatasubramanian
CPC classification number: G06F3/0659 , G06F3/061 , G06F3/0604 , G06F3/067 , G06F3/0631
Abstract: Disclosed are various embodiments for improving the resiliency and performance for clustered memory. A computing device can mark a page of the memory as being reclaimed. The computing device can then set the page of the memory as read-only. Next, the computing device can submit a write request for the contents of the page to individual ones of a plurality of memory hosts. Subsequently, the computing device can receive individual confirmations of a successful write of the page from the individual ones of the plurality of memory hosts. Then, the computing device can mark the page as free in response to receipt of the individual confirmations of the successful write from the individual ones of the plurality of memory hosts.
-
-
-
-
-
-
-
-
-