Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Samuel Jacob"

1.

发明授权
Dynamic code loading for multiple executions on a sequential processor 有权

公开(公告)号：US11809953B1

公开(公告)日：2023-11-07

申请号：US17902702

申请日：2022-09-02

Applicant: Amazon Technologies, Inc.

Inventor： Samuel Jacob , Ilya Minkin , Mohammad El-Shabani

IPC: G06F1/00 , G06N3/063 , G06N5/04

CPC classification number: G06N3/063 , G06N5/04

Abstract: Embodiments include techniques for enabling execution of N inferences on an execution engine of a neural network device. Instruction code for a single inference is stored in a memory that is accessible by a DMA engine, the instruction code forming a regular code block. A NOP code block and a reset code block for resetting an instruction DMA queue are stored in the memory. The instruction DMA queue is generated such that, when it is executed by the DMA engine, it causes the DMA engine to copy, for each of N inferences, both the regular code block and an additional code block to an instruction buffer. The additional code block is the NOP code block for the first N−1 inferences and is the reset code block for the Nth inference. When the reset code block is executed by the execution engine, the instruction DMA queue is reset.

2.

发明授权
Breakpoints in neural network accelerator 有权

公开(公告)号：US11467946B1

公开(公告)日：2022-10-11

申请号：US16368351

申请日：2019-03-28

Applicant: Amazon Technologies, Inc.

Inventor： Samuel Jacob , Drazen Borkovic , Yu Zhou , Mohammad El-Shabani

IPC: G06F11/36 , G06F8/41 , G06N3/10

Abstract: Techniques are disclosed for setting a breakpoint for debugging a neural network. User input is received by a debugger program executable by a host processor indicating a target layer of a neural network at which to halt execution of the neural network. The neural network includes a first set of instructions to be executed by a first execution engine and a second set of instructions to be executed by a second execution engine. A first halt point is set within the first set of instructions and a second halt point is set within the second set of instructions. It is then determined that operation of the first execution engine and the second execution engine has halted. It is then determined that the first execution engine has reached the first halt point. The second execution engine is then caused to move through instructions until reaching the second halt point.

3.

发明授权
Non-intrusive hardware profiling 有权

公开(公告)号：US11119787B1

公开(公告)日：2021-09-14

申请号：US16368263

申请日：2019-03-28

Applicant: Amazon Technologies, Inc.

Inventor： Mohammad El-Shabani , Ron Diamant , Samuel Jacob , Ilya Minkin , Richard John Heaton

IPC: G06F9/44 , G06F8/41 , G06F11/30 , G06F9/38 , G06F11/22 , G06F9/455 , G06F11/36 , G06F9/445 , G06F11/34 , G06F9/30

Abstract: Systems and methods for non-intrusive hardware profiling are provided. In some cases integrated circuit devices can be manufactured without native support for performance measurement and/or debugging capabilities, thereby limiting visibility into the integrated circuit device. Understanding the timing of operations can help to determine whether the hardware of the device is operating correctly and, when the device is not operating correctly, provide information that can be used to debug the device. In order to measure execution time of various tasks performed by the integrated circuit device, program instructions may be inserted to generate notifications that provide tracing information, including timestamps, for operations executed by the integrated circuit device.

4.

发明授权
Breakpoints in neural network accelerator 有权

公开(公告)号：US12210438B1

公开(公告)日：2025-01-28

申请号：US17947949

申请日：2022-09-19

Applicant: Amazon Technologies, Inc.

Inventor： Samuel Jacob , Drazen Borkovic , Yu Zhou , Mohammad El-Shabani

IPC: G06F11/36 , G06F8/41 , G06N3/10

Abstract: Techniques are disclosed for setting a breakpoint for debugging a neural network. User input is received by a debugger program executable by a host processor indicating a target layer of a neural network at which to halt execution of the neural network. The neural network includes a first set of instructions to be executed by a first execution engine and a second set of instructions to be executed by a second execution engine. A first halt point is set within the first set of instructions and a second halt point is set within the second set of instructions. It is then determined that operation of the first execution engine and the second execution engine has halted. It is then determined that the first execution engine has reached the first halt point. The second execution engine is then caused to move through instructions until reaching the second halt point.

5.

发明授权
Dynamic code loading for multiple executions on a sequential processor 有权

公开(公告)号：US11461622B2

公开(公告)日：2022-10-04

申请号：US16457268

申请日：2019-06-28

Applicant: Amazon Technologies, Inc.

Inventor： Samuel Jacob , Ilya Minkin , Mohammad El-Shabani

IPC: G06F1/00 , G06N3/063 , G06N5/04

Abstract: Embodiments include techniques for enabling execution of N inferences on an execution engine of a neural network device. Instruction code for a single inference is stored in a memory that is accessible by a DMA engine, the instruction code forming a regular code block. A NOP code block and a reset code block for resetting an instruction DMA queue are stored in the memory. The instruction DMA queue is generated such that, when it is executed by the DMA engine, it causes the DMA engine to copy, for each of N inferences, both the regular code block and an additional code block to an instruction buffer. The additional code block is the NOP code block for the first N−1 inferences and is the reset code block for the Nth inference. When the reset code block is executed by the execution engine, the instruction DMA queue is reset.

Patent Agency Ranking