-
11.
公开(公告)号:US11068312B2
公开(公告)日:2021-07-20
申请号:US16368072
申请日:2019-03-28
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Leslie Johann Lamprecht , John Merrill Phillips , Umesh Chandani , Roberto Pentz De Faria , Hou Liu , Ladan Mahabadi , Letian Feng
Abstract: Techniques for an optimization service of a service provider network to help optimize the selection, configuration, and utilization, of virtual machine (VM) instance types to support workloads on behalf of users. The optimization service may implement the techniques described herein at various stages in a life cycle of a workload to help optimize the performance of the workload, and reduce underutilization of computing resources. For example, the optimization service may perform techniques to help new users select an optimized VM instance type on which to initially launch their workload. Further, the optimization service may monitor a workload for the life of the workload, and determine new VM instance types, and/or configuration modifications, that optimize the performance of the workload. The optimization service may provide recommendations to users that help improve performance of their workloads, and that also increase the aggregate utilization of computing resources of the service provider network.
-
公开(公告)号:US10726518B2
公开(公告)日:2020-07-28
申请号:US16530888
申请日:2019-08-02
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Cotton Kurtz , Malcolm Featonby , Umesh Chandani , Adithya Bhat , Yuxuan Liu , Mihir Sadruddin Surani
Abstract: Methods, systems, and computer-readable media for capacity reservation for virtualized graphics processing are disclosed. A request is received to attach a virtual GPU to a virtual compute instance. The request comprises one or more constraints. Availability information is retrieved from a data store that indicates virtual GPUs available in a provider network and matching the one or more constraints. A virtual GPU is selected from among the available virtual GPUs in the availability information. The selected virtual GPU is reserved for attachment to the virtual compute instance. The virtual compute instance is implemented using CPU resources and memory resources of a physical compute instance, the virtual GPU is implemented using a physical GPU in the provider network, and the physical GPU is accessible to the physical compute instance over a network.
-
公开(公告)号:US20200082495A1
公开(公告)日:2020-03-12
申请号:US16684985
申请日:2019-11-15
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Yuxuan Liu , Umesh Chandani , John Merrill Phillips, JR. , Adithya Bhat , Douglas Cotton Kurtz , Mihir Sadruddin Surani
Abstract: Methods, systems, and computer-readable media for interaction monitoring for virtualized graphics processing are disclosed. Execution of an application is initiated on a virtual compute instance that is implemented using CPU and memory resources of a server. Instruction calls are produced by the execution of the application and sent from the server to a graphics server over a network. The graphics server comprises a physical GPU, and a virtual GPU is implemented using the physical GPU and attached to the virtual compute instance. GPU output is generated at the graphics server based at least in part on execution of the instruction calls using the virtual GPU. A log of interactions between the application and the virtual GPU is stored. The interactions comprise the instruction calls sent to the graphics server and responses to the instruction calls sent to the virtual compute instance.
-
公开(公告)号:US10373284B2
公开(公告)日:2019-08-06
申请号:US15376399
申请日:2016-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Cotton Kurtz , Malcolm Featonby , Umesh Chandani , Adithya Bhat , Yuxuan Liu , Mihir Sadruddin Surani
Abstract: Methods, systems, and computer-readable media for capacity reservation for virtualized graphics processing are disclosed. A request is received to attach a virtual GPU to a virtual compute instance. The request comprises one or more constraints. Availability information is retrieved from a data store that indicates virtual GPUs available in a provider network and matching the one or more constraints. A virtual GPU is selected from among the available virtual GPUs in the availability information. The selected virtual GPU is reserved for attachment to the virtual compute instance. The virtual compute instance is implemented using CPU resources and memory resources of a physical compute instance, the virtual GPU is implemented using a physical GPU in the provider network, and the physical GPU is accessible to the physical compute instance over a network.
-
公开(公告)号:US10169841B1
公开(公告)日:2019-01-01
申请号:US15470821
申请日:2017-03-27
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Douglas Cotton Kurtz , Paolo Maggi , Umesh Chandani , John Merrill Phillips, Jr. , Yuxuan Liu , Adithya Bhat , Mihir Sadruddin Surani , Andrea Curtoni , Nicholas Patrick Wilt
Abstract: Methods, systems, and computer-readable media for dynamic interface synchronization for virtualized graphics processing are disclosed. A GPU interface synchronization request is sent from a compute instance to a graphics processing unit (GPU) server via a network. The GPU server comprises a virtual GPU attached to the compute instance and implemented using at least one physical GPU. Based at least in part on the GPU interface synchronization request, a shared version of a GPU interface is determined for use with the compute instance and the GPU server. Program code of the shared version of the GPU interface is installed on the compute instance and on the GPU server. Using the shared version of the GPU interface, the compute instance sends instructions to the virtual GPU over the network, and the virtual GPU generates GPU output associated with the instructions.
-
16.
公开(公告)号:US12135980B2
公开(公告)日:2024-11-05
申请号:US17861795
申请日:2022-07-11
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Leslie Johann Lamprecht , John Merrill Phillips , Umesh Chandani , Roberto Pentz De Faria , Hou Liu , Ladan Mahabadi , Letian Feng
Abstract: Techniques for an optimization service of a service provider network to help optimize the selection, configuration, and utilization, of virtual machine (VM) instance types to support workloads on behalf of users. The optimization service may implement the techniques described herein at various stages in a life cycle of a workload to help optimize the performance of the workload, and reduce underutilization of computing resources. For example, the optimization service may perform techniques to help new users select an optimized VM instance type on which to initially launch their workload. Further, the optimization service may monitor a workload for the life of the workload, and determine new VM instance types, and/or configuration modifications, that optimize the performance of the workload. The optimization service may provide recommendations to users that help improve performance of their workloads, and that also increase the aggregate utilization of computing resources of the service provider network.
-
公开(公告)号:US10963984B2
公开(公告)日:2021-03-30
申请号:US16684985
申请日:2019-11-15
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Yuxuan Liu , Umesh Chandani , John Merrill Phillips, Jr. , Adithya Bhat , Douglas Cotton Kurtz , Mihir Sadruddin Surani
Abstract: Methods, systems, and computer-readable media for interaction monitoring for virtualized graphics processing are disclosed. Execution of an application is initiated on a virtual compute instance that is implemented using CPU and memory resources of a server. Instruction calls are produced by the execution of the application and sent from the server to a graphics server over a network. The graphics server comprises a physical GPU, and a virtual GPU is implemented using the physical GPU and attached to the virtual compute instance. GPU output is generated at the graphics server based at least in part on execution of the instruction calls using the virtual GPU. A log of interactions between the application and the virtual GPU is stored. The interactions comprise the instruction calls sent to the graphics server and responses to the instruction calls sent to the virtual compute instance.
-
18.
公开(公告)号:US20200310876A1
公开(公告)日:2020-10-01
申请号:US16368072
申请日:2019-03-28
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Leslie Johann Lamprecht , John Merrill Phillips , Umesh Chandani , Roberto Pentz De Faria , Hou Liu , Ladan Mahabadi , Letian Feng
Abstract: Techniques for an optimization service of a service provider network to help optimize the selection, configuration, and utilization, of virtual machine (VM) instance types to support workloads on behalf of users. The optimization service may implement the techniques described herein at various stages in a life cycle of a workload to help optimize the performance of the workload, and reduce underutilization of computing resources. For example, the optimization service may perform techniques to help new users select an optimized VM instance type on which to initially launch their workload. Further, the optimization service may monitor a workload for the life of the workload, and determine new VM instance types, and/or configuration modifications, that optimize the performance of the workload. The optimization service may provide recommendations to users that help improve performance of their workloads, and that also increase the aggregate utilization of computing resources of the service provider network.
-
公开(公告)号:US20200310851A1
公开(公告)日:2020-10-01
申请号:US16367768
申请日:2019-03-28
Applicant: Amazon Technologies, Inc.
Inventor: Malcolm Featonby , Leslie Johann Lamprecht , John Merrill Phillips , Umesh Chandani , Roberto Pentz De Faria , Hou Liu , Ladan Mahabadi , Letian Feng
Abstract: Techniques for an optimization service of a service provider network to help optimize the selection, configuration, and utilization, of virtual machine (VM) instance types to support workloads on behalf of users. The optimization service may implement the techniques described herein at various stages in a life cycle of a workload to help optimize the performance of the workload, and reduce underutilization of computing resources. For example, the optimization service may perform techniques to help new users select an optimized VM instance type on which to initially launch their workload. Further, the optimization service may monitor a workload for the life of the workload, and determine new VM instance types, and/or configuration modifications, that optimize the performance of the workload. The optimization service may provide recommendations to users that help improve performance of their workloads, and that also increase the aggregate utilization of computing resources of the service provider network.
-
公开(公告)号:US20190355088A1
公开(公告)日:2019-11-21
申请号:US16530888
申请日:2019-08-02
Applicant: Amazon Technologies, Inc.
Inventor: Douglas Cotton Kurtz , Malcolm Featonby , Umesh Chandani , Adithya Bhat , Yuxuan Liu , Mihir Sadruddin Surani
Abstract: Methods, systems, and computer-readable media for capacity reservation for virtualized graphics processing are disclosed. A request is received to attach a virtual GPU to a virtual compute instance. The request comprises one or more constraints. Availability information is retrieved from a data store that indicates virtual GPUs available in a provider network and matching the one or more constraints. A virtual GPU is selected from among the available virtual GPUs in the availability information. The selected virtual GPU is reserved for attachment to the virtual compute instance. The virtual compute instance is implemented using CPU resources and memory resources of a physical compute instance, the virtual GPU is implemented using a physical GPU in the provider network, and the physical GPU is accessible to the physical compute instance over a network.
-
-
-
-
-
-
-
-
-