Automatically determining configurations for executing recurrent neural networks

    公开(公告)号:US11769035B1

    公开(公告)日:2023-09-26

    申请号:US16219751

    申请日:2018-12-13

    CPC classification number: G06N3/04 G06N3/08

    Abstract: Techniques are described automatically determining runtime configurations used to execute recurrent neural networks (RNNs) for training or inference. One such configuration involves determining whether to execute an RNN in a looped, or “rolled,” execution pattern or in a non-looped, or “unrolled,” execution pattern. Execution of an RNN using a rolled execution pattern generally consumes less memory resources than execution using an unrolled execution pattern, whereas execution of an RNN using an unrolled execution pattern typically executes faster. The configuration choice thus involves a time-memory tradeoff that can significantly affect the performance of the RNN execution. This determination is made automatically by a machine learning (ML) runtime by analyzing various factors such as, for example, a type of RNN being executed, the network structure of the RNN, characteristics of the input data to the RNN, an amount of computing resources available, and so forth.

    Provisioning information technology (IT) infrastructures based on images of system architecture diagrams

    公开(公告)号:US10997409B1

    公开(公告)日:2021-05-04

    申请号:US16001618

    申请日:2018-06-06

    Abstract: Techniques are described for using machine learning (ML) models to create information technology (IT) infrastructures at a service provider network based on image of IT system architecture diagrams. To create IT system architecture diagrams, system architects often use tools ranging from pen and paper and whiteboards to various types of software-based drawing programs. Based on a user-provided image of an IT system architecture diagram (for example, a digital scan of a hand drawn system diagram, an image file created by a software-based drawing program, or the like), a service provider network uses one or more ML models to analyze the image to identify the constituent elements of the depicted IT system architecture and to create an infrastructure template that can be used to automatically provision corresponding computing resources at the service provider network.

Patent Agency Ranking