Concurrent training of functional subnetworks of a neural network

    公开(公告)号:US11836610B2

    公开(公告)日:2023-12-05

    申请号:US15841030

    申请日:2017-12-13

    CPC classification number: G06N3/08 G06N3/045

    Abstract: An artificial neural network that includes first subnetworks to implement known functions and second subnetworks to implement unknown functions is trained. The first subnetworks are trained separately and in parallel on corresponding known training datasets to determine first parameter values that define the first subnetworks. The first subnetworks are executing on a plurality of processing elements in a processing system. Input values from a network training data set are provided to the artificial neural network including the trained first subnetworks. Error values are generated by comparing output values produced by the artificial neural network to labeled output values of the network training data set. The second subnetworks are trained by back propagating the error values to modify second parameter values that define the second subnetworks without modifying the first parameter values. The first and second parameter values are stored in a storage component.

    AUTOMATED USE OF COMPUTATIONAL MOTIFS VIA DEEP LEARNING DETECTION

    公开(公告)号:US20230205517A1

    公开(公告)日:2023-06-29

    申请号:US17562921

    申请日:2021-12-27

    CPC classification number: G06F8/71 G06F8/4434 G06F8/4432 G06F9/45516

    Abstract: A system and method are described for efficiently utilizing optimized implementations of computational patterns in an application. In various implementations, a computing system includes at least one or more processors, and these one or more processors and other hardware resources of the computing system process a variety of applications. Sampled, dynamic values of hardware performance counters are sent to a trained data model. The data model provides characterization of the computational patterns being used and the types of workloads being processed. The data model also indicates whether the identified computational patterns already use an optimized version. Later, a selected processor determines a given unoptimized computational pattern is no longer running and replaces this computational pattern with an optimized version. Although the application is still running, the processor performs a static replacement. On a next iteration of the computational pattern, the optimized version is run.

Patent Agency Ranking