-
公开(公告)号:US11423283B1
公开(公告)日:2022-08-23
申请号:US15933114
申请日:2018-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Hagay Lupesko , Dominic Rajeev Divakaruni , Jonathan Esterhazy , Sandeep Krishnamurthy , Vikram Madan , Roshani Nagmote , Naveen Mysore Nagendra Swamy , Yao Wang
Abstract: Techniques for model adaptation are described. For example, a method of receiving a call to provide either a model variant or a model variant profile of a deep learning model, the call including desired performance of the deep learning model, a deep learning model identifier, and current edge device characteristics; comparing the received current edge device characteristics to available model variants and profiles based on the desired performance of the deep learning model to generate or select a model variant or profile, the available model variants and profiles determined by the model identifier; and sending the generated or selected model variant or profile to the edge device to use in inference is detailed.
-
公开(公告)号:US09892755B1
公开(公告)日:2018-02-13
申请号:US14671962
申请日:2015-03-27
Applicant: Amazon Technologies, Inc.
Inventor: Evan Walton Layman , Hagay Lupesko , Johnson Cheng , Haydn Lee Gilbert , Travis Ronald Langner , Rickesh Pal , Ivo Pletikosic
CPC classification number: G11B20/10527 , G06F3/165 , G11B2020/10546
Abstract: Technology is described for directing media content to a target device. A media content directing request may be received from a source device indicating that the source device intends to direct media content to the target device that is available for media content playback. A list of available target devices for media content playback may be provided to the source device. A playback message may be received from the source device that includes a selection of the target device from the list of available target devices. Communication of the playback message to the target device may be facilitated to initiate playback of media content from a media content playback server as directed by the source device.
-
公开(公告)号:US11763154B1
公开(公告)日:2023-09-19
申请号:US16262677
申请日:2019-01-30
Applicant: Amazon Technologies, Inc.
Inventor: Hagay Lupesko , Anirudh Acharya , Lee Cheng-Che , Lai Wei , Kalyanee Chendke , Ankit Khedia , Vandana Kannan , Sandeep Krishnamurthy , Roshani Nagmote
Abstract: Features related to systems and methods for automated generation of a machine learning model based in part on a pretrained model are described. The pretrained model is used as a starting point to augment and retrain according to client specifications. The identification of an appropriate pretrained model is based on the client specifications such as model inputs, model outputs, and similarities between the data used to train the models.
-
公开(公告)号:US11769035B1
公开(公告)日:2023-09-26
申请号:US16219751
申请日:2018-12-13
Applicant: Amazon Technologies, Inc.
Inventor: Lai Wei , Hagay Lupesko , Anirudh Acharya , Ankit Khedia , Sandeep Krishnamurthy , Cheng-Che Lee , Kalyanee Shriram Chendke , Vandana Kannan , Roshani Nagmote
Abstract: Techniques are described automatically determining runtime configurations used to execute recurrent neural networks (RNNs) for training or inference. One such configuration involves determining whether to execute an RNN in a looped, or “rolled,” execution pattern or in a non-looped, or “unrolled,” execution pattern. Execution of an RNN using a rolled execution pattern generally consumes less memory resources than execution using an unrolled execution pattern, whereas execution of an RNN using an unrolled execution pattern typically executes faster. The configuration choice thus involves a time-memory tradeoff that can significantly affect the performance of the RNN execution. This determination is made automatically by a machine learning (ML) runtime by analyzing various factors such as, for example, a type of RNN being executed, the network structure of the RNN, characteristics of the input data to the RNN, an amount of computing resources available, and so forth.
-
公开(公告)号:US10949252B1
公开(公告)日:2021-03-16
申请号:US15895747
申请日:2018-02-13
Applicant: Amazon Technologies, Inc.
Inventor: Sandeep Krishnamurthy , Jiajie Chen , Jonathan Esterhazy , Naveen Mysore Nagendra Swamy , Ruofei Yu , Yao Wang , Roshani Nagmote , Hagay Lupesko , Vikram Madan
Abstract: Techniques for benchmarking a machine learning model/algorithm are described. For example, in some instances a method includes generating an execution plan for benchmarking of at least one task corresponding to a machine learning model based on an identified machine learning model, identified training data, and at least one objective for the benchmarking job; receiving execution statistics about the execution of the task as a part of the benchmarking job according to the execution plan; and updating the execution plan based at least in part on the received execution statistics of the task.
-
公开(公告)号:US20230368028A1
公开(公告)日:2023-11-16
申请号:US18217929
申请日:2023-07-03
Applicant: Amazon Technologies, Inc.
Inventor: Hagay Lupesko , Anirudh Acharya , Cheng-Che Lee , Lai Wei , Kalyanee Chendke , Ankit Khedia , Vandana Kannan , Sandeep Krishnamurthy , Roshani Nagmote
Abstract: Features related to systems and methods for automated generation of a machine learning model based in part on a pretrained model are described. The pretrained model is used as a starting point to augment and retrain according to client specifications. The identification of an appropriate pretrained model is based on the client specifications such as model inputs, model outputs, and similarities between the data used to train the models.
-
-
-
-
-