- 专利标题: High perforamance machine learning inference framework for edge devices
-
申请号: US16179217申请日: 2018-11-02
-
公开(公告)号: US11301762B1公开(公告)日: 2022-04-12
- 发明人: Gang Chen , Long Gao , Eduardo Manuel Calleja
- 申请人: Amazon Technologies, Inc.
- 申请人地址: US WA Seattle
- 专利权人: Amazon Technologies, Inc.
- 当前专利权人: Amazon Technologies, Inc.
- 当前专利权人地址: US WA Seattle
- 代理机构: Nicholson De Vos Webster & Elliott LLP
- 主分类号: G06N5/02
- IPC分类号: G06N5/02 ; G06N20/00 ; G06F16/11
摘要:
Techniques for high-performance machine learning (ML) inference in heterogenous edge devices are described. A ML model trained using a variety of different frameworks is translated into a common format that is runnable by inferences engines of edge devices. The translated model is optimized in hardware-agnostic and/or hardware-specific ways to improve inference performance, and the optimized model is sent to the edge devices. The inference engine for any edge device can be accessed by a customer application using a same defined API, regardless of the hardware characteristics of the edge device or the original format of the ML model.
公开/授权文献
- US1266764A Projectile-rotator. 公开/授权日:1918-05-21
信息查询