-
公开(公告)号:US20230064057A1
公开(公告)日:2023-03-02
申请号:US18048203
申请日:2022-10-20
摘要: Techniques that facilitate model support in deep learning are provided. In one example, a system includes a graphics processing unit and a central processing unit memory. The graphics processing unit processes data to train a deep neural network. The central processing unit memory stores a portion of the data to train the deep neural network. The graphics processing unit provides, during a forward pass process of the deep neural network that traverses through a set of layers for the deep neural network from a first layer of the set of layers to a last layer of the set of layers that provides a set of outputs for the deep neural network, input data for a layer from the set of layers for the deep neural network to the central processing unit memory.
-
公开(公告)号:US10209913B2
公开(公告)日:2019-02-19
申请号:US15420629
申请日:2017-01-31
IPC分类号: G06F3/06 , G06F12/0802
摘要: An iterative graph algorithm accelerating method, system, and computer program product, include recording an order of access nodes in a memory layout, reordering the access nodes in the memory layout in accordance with the recorded order, and updating edge information of the reordered access nodes.
-
公开(公告)号:US11526759B2
公开(公告)日:2022-12-13
申请号:US16180864
申请日:2018-11-05
摘要: Techniques that facilitate model support in deep learning are provided. In one example, a system includes a graphics processing unit and a central processing unit memory. The graphics processing unit processes data to train a deep neural network. The central processing unit memory stores a portion of the data to train the deep neural network. The graphics processing unit provides, during a forward pass process of the deep neural network that traverses through a set of layers for the deep neural network from a first layer of the set of layers to a last layer of the set of layers that provides a set of outputs for the deep neural network, input data for a layer from the set of layers for the deep neural network to the central processing unit memory.
-
公开(公告)号:US11915147B2
公开(公告)日:2024-02-27
申请号:US18048203
申请日:2022-10-20
CPC分类号: G06N3/084 , G06F13/4282 , G06N3/04
摘要: Techniques that facilitate model support in deep learning are provided. In one example, a system includes a graphics processing unit and a central processing unit memory. The graphics processing unit processes data to train a deep neural network. The central processing unit memory stores a portion of the data to train the deep neural network. The graphics processing unit provides, during a forward pass process of the deep neural network that traverses through a set of layers for the deep neural network from a first layer of the set of layers to a last layer of the set of layers that provides a set of outputs for the deep neural network, input data for a layer from the set of layers for the deep neural network to the central processing unit memory.
-
公开(公告)号:US20220121924A1
公开(公告)日:2022-04-21
申请号:US17075963
申请日:2020-10-21
发明人: Ulrich Alfons Finkler , Michele Merler , Mayoore Selvarasa Jaiswal , Hui Wu , Rameswar Panda , Wei Zhang
摘要: An embodiment includes identifying an initial plurality of sets of hyperparameter values at which to evaluate an objective function that relates hyperparameter values to performance values of a neural network. The embodiment also executes training processes on the neural network with the hyperparameters set to the each of the initial sets of hyperparameter values such that the training process provides an initial set of the performance values for the objective function. The embodiment also generates an approximation of the objective function using splines at selected performance values. The embodiment approximates a point at which the approximation of the objective function reaches a maximum value, then determines an updated set of hyperparameter values associated with the maximum value. The embodiment then executes a runtime process using the neural network with the hyperparameters set to the updated set of hyperparameter values.
-
6.
公开(公告)号:US20180217775A1
公开(公告)日:2018-08-02
申请号:US15420629
申请日:2017-01-31
IPC分类号: G06F3/06 , G06F12/0802
CPC分类号: G06F12/0802 , G06F12/0893 , G06F2212/1008
摘要: An iterative graph algorithm accelerating method, system, and computer program product, include recording an order of access nodes in a memory layout, reordering the access nodes in the memory layout in accordance with the recorded order, and updating edge information of the reordered access nodes.
-
公开(公告)号:US11557053B2
公开(公告)日:2023-01-17
申请号:US16785469
申请日:2020-02-07
发明人: Rui Zhang , Conrad M. Albrecht , Siyuan Lu , Wei Zhang , Ulrich Alfons Finkler , David S. Kung , Xiaodong Cui , Marcus Freitag
摘要: Techniques for image processing and transformation are provided. A plurality of images and a plurality of maps are received, and a system of neural networks is trained based on the plurality of images and the plurality of maps. A first image is received, and a first map is generated by processing the first image using the system of neural networks.
-
公开(公告)号:US10740232B2
公开(公告)日:2020-08-11
申请号:US16217163
申请日:2018-12-12
IPC分类号: G06F3/06 , G06F12/0802 , G06F12/0893
摘要: An iterative graph algorithm accelerating method, system, and computer program product, include recording an order of access nodes in a memory layout, reordering the access nodes in the memory layout in accordance with the recorded order, and updating edge information of the reordered access nodes.
-
公开(公告)号:US10353821B2
公开(公告)日:2019-07-16
申请号:US15189132
申请日:2016-06-22
IPC分类号: G06F12/08 , G06F12/0888 , G06F12/0804 , G06F12/1009 , G06F12/109
摘要: A parallel execution method, system, and non-transitory computer readable medium not maintaining a cache coherence, include creating a continuum, the continuum being a construct that holds data structures, giving a view to the continuum, the view being a descriptor that provides access rights and properties for the continuum, and performing a task associated with an execution sequence, the task holding the view to the continuum that the execution sequence is accessing.
-
公开(公告)号:US20190213134A1
公开(公告)日:2019-07-11
申请号:US16357751
申请日:2019-03-19
IPC分类号: G06F12/0888 , G06F12/1009 , G06F12/0804 , G06F12/109
CPC分类号: G06F12/0888 , G06F12/0804 , G06F12/1009 , G06F12/109 , G06F2212/6042 , G06F2212/657
摘要: A parallel execution method, system, and non-transitory computer readable medium, include creating a continuum where the continuum includes a construct that holds data structures and where the continuum enables redirection of memory allocation and deallocation within a marked code section of a virtual address range.
-
-
-
-
-
-
-
-
-