-
21.
公开(公告)号:US11221886B2
公开(公告)日:2022-01-11
申请号:US15982618
申请日:2018-05-17
发明人: John A. Bivens , Ruchi Mahindru , Eugen Schenfeld , Min Li , Valentina Salapura
IPC分类号: G06F9/50 , G06F12/0862 , G06F11/34
摘要: Embodiments for optimizing dynamic resource allocations in a disaggregated computing environment. A new workload is assigned to a subset of a plurality of processors, the subset of processors assigned a subset of a plurality of cache devices. A determination is made that the new workload is categorized as a cache-friendly workload having a memory need which can be met primarily by the subset of cache devices by identifying that underlying data necessitated by the new workload resides primarily within the subset of cache devices. Pursuant to determining the new workload is the cache-friendly workload, a cache related action is performed to increase performance of the new workload executed by the subset of processors and commensurately executes additional workloads performed by other ones of the plurality of processors within the disaggregated computing environment.
-
22.
公开(公告)号:US11188429B2
公开(公告)日:2021-11-30
申请号:US16204602
申请日:2018-11-29
发明人: HariGovind V. Ramasamy , Eugen Schenfeld , Valentina Salapura , John A. Bivens , Min Li , Ruchi Mahindru , Yaoping Ruan
IPC分类号: G06F11/00 , G06F11/20 , H04L29/08 , H04L12/927 , H04L12/24
摘要: A new approach to resiliency management is provided in a data center wherein servers are constructed dynamically, on-demand and based on workload requirements and a tenant's resiliency requirements by allocating resources from these pools. In this approach, a set of functionally-equivalent “interchangeable compute units” (ICUs) are composed of resources from resource pools that have been extended to include not only different resource types (CPU, memory, accelerators), but also resources of different specifications (specs) and flavors. As a workload is being processed, the health or status of the resources are monitored. Upon a performance issue or failure event, a resiliency manager can swap out a current ICU and replace it with a functionally-equivalent ICU. Preferably, individual ICUs are hosted on one of: resources of a same type each with different specifications, and resources of a same type and specification and different flavors. The approach enables failure independence in a disaggregated environment.
-
23.
公开(公告)号:US11169592B2
公开(公告)日:2021-11-09
申请号:US15289280
申请日:2016-10-10
发明人: Ruchi Mahindru , John A. Bivens , Koushik K. Das , Min Li , HariGovind V. Ramasamy , Yaoping Ruan , Valentina Salapura , Eugen Schenfeld
IPC分类号: G06F1/329 , G06F1/30 , G06F1/3206 , G06F1/3296 , G06F1/324 , G06F1/3287 , G06F1/3228
摘要: For power management in a disaggregated computing system, upon detecting an electrical power reduction to the disaggregated computing system, respective workloads being performed by respective processors within a set of processors are prioritized according to a service level agreement (SLA) of the respective workloads. Backup electrical power is dynamically allocated to the respective processors performing the respective workloads based upon a priority of the SLA of the respective workloads.
-
公开(公告)号:US11153223B2
公开(公告)日:2021-10-19
申请号:US15093082
申请日:2016-04-07
发明人: Valentina Salapura , John Alan Bivens , Koushik K. Das , Min Li , Ruchi Mahindru , Harigovind V. Ramasamy , Yaoping Ruan , Eugen Schenfeld
IPC分类号: H04L12/911 , H04L12/24 , G06F9/50 , H04L12/26
摘要: Server resources in a data center are disaggregated into shared server resource pools. Servers are constructed dynamically, on-demand and based on workload requirements, by allocating from these resource pools. A disaggregated compute system of this type keeps track of resources that are available in the shared server resource pools, and it manages those resources based on that information. Each server entity built is assigned with a unique server ID, and each resource that comprises a component thereof is tagged with the identifier. As a workload is processed by the server entity, its composition may change, e.g. by allocating more resources to the server entity, or by de-allocating resources from the server entity. Workload requests are associated with the unique server ID for the server entity. When a workload request is received at a resource, it matches its unique server ID to that of the request before servicing the request.
-
公开(公告)号:US10977085B2
公开(公告)日:2021-04-13
申请号:US15982572
申请日:2018-05-17
发明人: John A. Bivens , Min Li , Ruchi Mahindru , Valentina Salapura , Eugen Schenfeld
IPC分类号: G06F9/50 , G06F9/48 , G06F12/0862 , G06F11/34 , H04L12/24
摘要: Embodiments for optimizing dynamic resource allocations in a disaggregated computing environment. Data access patterns of data elements associated with a set of workloads are tracked across predetermined windows of time. The set of workloads is categorized into at least one of a plurality of classes, each class characterized by a method of data access of the data elements during the tracking of the data access patterns. Resource allocations are optimized in the disaggregated computing environment for a current iteration of the set of workloads based on the method of data access by allocating specific resources within the disaggregated data center to the current iteration of the set of workloads according to the data access patterns in real-time.
-
公开(公告)号:US10895682B1
公开(公告)日:2021-01-19
申请号:US16518131
申请日:2019-07-22
发明人: Tymon Barwicz , Douglas M. Gill , William M. Green , Jason S. Orcutt , Jessie C. Rosenberg , Eugen Schenfeld , Chi Xiong
摘要: Photonic circuits are disclosed having an efficient optical power distribution network. Laser chips (InP) having different wavelengths are flip-chip assembled near the center of a silicon photonic chip. Each InP die has multiple optical lanes, but a given die has only one wavelength. Waveguides formed in the photonic chip are optically connected to the lanes, and fan out to form multiple waveguide sets, where each waveguide set has one of the waveguides from each of the different wavelengths, i.e., one waveguide from each InP die. The waveguide network is optimized to minimize the number of crossings that any given waveguide may have, and no waveguide having a particular wavelength crosses another waveguide of the same wavelength. The unique arrangements of light sources and waveguides allows the use of a smaller number of more intense laser sources, particularly in applications such as performance-optimized datacenters where liquid cooling systems may be leveraged.
-
27.
公开(公告)号:US10761915B2
公开(公告)日:2020-09-01
申请号:US16143418
申请日:2018-09-26
发明人: Ruchi Mahindru , John A. Bivens , Min Li , Valentina Salapura , Eugen Schenfeld
摘要: Embodiments for preemptive deep diagnostics of resources in a disaggregated computing environment. Responsive to detecting a threshold breach of a recurrent event associated with a first resource of a first resource type executing a workload, an alert is generated; and responsive to receiving the alert, the execution of the workload on the first resource is ceased. Health check diagnostics are identified and invoked on the first resource based on the alert and a server telemetry. Results of the health check diagnostics are mapped to a set of learned failure patterns; and a potential failure of the first resource is predicted based on the mapping.
-
28.
公开(公告)号:US10754720B2
公开(公告)日:2020-08-25
申请号:US16143430
申请日:2018-09-26
发明人: Ruchi Mahindru , John A. Bivens , Min Li , Valentina Salapura , Eugen Schenfeld
摘要: Embodiments for preemptive deep diagnostics of resources in a disaggregated computing environment. Respective resources from respective pools of resources of different types are assigned to compose a disaggregated server. A workload is executed by the respective resources within the disaggregated server while the respective resources of the disaggregated server are monitored by a monitoring task. Responsive to a first resource of the respective resources generating an alert from the monitoring task, the workload is instantiated to be concurrently performed by the first resource and a second resource of the respective resources while initiating a health check diagnostic operation on the first resource.
-
公开(公告)号:US10534598B2
公开(公告)日:2020-01-14
申请号:US15398350
申请日:2017-01-04
发明人: Valentina Salapura , John A. Bivens , Min Li , Ruchi Mahindru , HariGovind V. Ramasamy , Yaoping Ruan , Eugen Schenfeld
IPC分类号: G06F8/65
摘要: Embodiments for performing rolling software upgrades in a disaggregated computing environment. A rolling upgrade manager is provided for upgrading one or more disaggregated servers. A designated memory area is used for storing an updated software component, and a disaggregated server is switched to the designated memory area from a currently assigned memory area when performing the software upgrade.
-
公开(公告)号:US10401940B2
公开(公告)日:2019-09-03
申请号:US15289272
申请日:2016-10-10
发明人: Ruchi Mahindru , John A. Bivens , Koushik K. Das , Min Li , HariGovind V. Ramasamy , Yaoping Ruan , Valentina Salapura , Eugen Schenfeld
IPC分类号: G06F9/50 , G06F1/3287 , G06F1/324 , G06F1/3296 , G06F1/30
摘要: For power management in a disaggregated computing system, a set of initial electrical power levels are allocated to a set of processor cores according to a predicted desired workload, where the set of initial power levels aggregate to an initial collective contracted power level. Electrical power is dynamically allocated to respective processor cores within the set of processor cores to produce a capacity to execute a collective demanded workload while maintaining the electrical power to the set of processor cores to an approximately constant electrical power level within a threshold of the initial collective contracted electrical power level.
-
-
-
-
-
-
-
-
-