Neural processing device and method for job scheduling thereof

    公开(公告)号:US12086633B2

    公开(公告)日:2024-09-10

    申请号:US18495682

    申请日:2023-10-26

    Inventor: Seokju Yoon

    CPC classification number: G06F9/4881 G06F9/544

    Abstract: A neural processing device and a method for job scheduling are provided. The neural processing device configured to receive, by an address space ID (ASID) manager, first and second requests from at least one context, respectively, and determine whether ASIDs are allocated, store jobs of contexts to which the ASIDs have not been allocated from the ASID manager in entities, schedule, by a job scheduler, an execution order of the jobs stored in the entities and cause the ASID manager to allocate the ASIDs to the contexts to which the ASIDs have not been allocated among the at least one context, and sequentially receive, by a command queue, jobs of contexts to which the ASIDs have been allocated, store the jobs as standby jobs, and sequentially execute the standby jobs.

    NEURAL PROCESSING DEVICE AND METHOD FOR USING SHARED PAGE TABLE THEREOF

    公开(公告)号:US20230418695A1

    公开(公告)日:2023-12-28

    申请号:US18463238

    申请日:2023-09-07

    Inventor: Seokju Yoon

    CPC classification number: G06F9/544 G06F12/1027 G06F12/1009 G06F2212/68

    Abstract: A neural processing device and a method for using shared page table thereof are provided. The neural processing device including at least one neural processor, a shared memory shared by the at least one neural processor, and a global interconnection configured to exchange data between the at least one neural processor and the shared memory, comprises at least one processing unit each of which included in each of the at least one neural processor and configured to provide logical addresses, a memory management unit configured to receive and translate the logical addresses into physical addresses, and a physical memory accessible by the physical addresses, wherein the memory management unit comprises a shared page table that has translation information between the logical addresses and the physical addresses and is shared by at least one process with each other.

    Method and system for recovering data associated with artificial intelligence calculation

    公开(公告)号:US12111732B1

    公开(公告)日:2024-10-08

    申请号:US18638482

    申请日:2024-04-17

    Inventor: Seokju Yoon

    CPC classification number: G06F11/1415 G06F11/0736 G06F11/076

    Abstract: Provided is a method for recovering data for artificial intelligence calculation, which is performed by one or more processors and which includes extracting, from a command queue, a descriptor associated with a target job, that is a job to be executed, of a plurality of jobs, executing at least one command associated with the extracted descriptor to execute artificial intelligence calculation associated with the target job, resetting the command queue if an error occurs while executing the artificial intelligence calculation, determining at least one descriptor to be recovered, based on tracking data associated with at least one job, and recovering the determined at least one descriptor to the reset command queue.

    Neural processing device and method for using shared page table thereof

    公开(公告)号:US11789791B1

    公开(公告)日:2023-10-17

    申请号:US18312500

    申请日:2023-05-04

    Inventor: Seokju Yoon

    CPC classification number: G06F9/544 G06F12/1009 G06F12/1027 G06F2212/68

    Abstract: A neural processing device and a method for using shared page table thereof are provided. The neural processing device including at least one neural processor, a shared memory shared by the at least one neural processor, and a global interconnection configured to exchange data between the at least one neural processor and the shared memory, comprises at least one processing unit each of which included in each of the at least one neural processor and configured to provide logical addresses, a memory management unit configured to receive and translate the logical addresses into physical addresses, and a physical memory accessible by the physical addresses, wherein the memory management unit comprises a shared page table that has translation information between the logical addresses and the physical addresses and is shared by at least one process with each other.

    METHOD AND SYSTEM FOR RECOVERING DATA ASSOCIATED WITH ARTIFICIAL INTELLIGENCE CALCULATION

    公开(公告)号:US20250053480A1

    公开(公告)日:2025-02-13

    申请号:US18824787

    申请日:2024-09-04

    Inventor: Seokju Yoon

    Abstract: Provided is a method for recovering data for artificial intelligence calculation, which is performed by one or more processors and which includes extracting, from a command queue, a descriptor associated with a target job, that is a job to be executed, of a plurality of jobs, executing at least one command associated with the extracted descriptor to execute artificial intelligence calculation associated with the target job, resetting the command queue if an error occurs while executing the artificial intelligence calculation, determining at least one descriptor to be recovered, based on tracking data associated with at least one job, and recovering the determined at least one descriptor to the reset command queue.

    Neural processing device and method for job scheduling thereof

    公开(公告)号:US11861401B2

    公开(公告)日:2024-01-02

    申请号:US18312490

    申请日:2023-05-04

    Inventor: Seokju Yoon

    CPC classification number: G06F9/4881 G06F9/544

    Abstract: A neural processing device and a method for job scheduling are provided. The neural processing device configured to receive, by an address space ID (ASID) manager, first and second requests from at least one context, respectively, and determine whether ASIDs are allocated, store jobs of contexts to which the ASIDs have not been allocated from the ASID manager in entities, schedule, by a job scheduler, an execution order of the jobs stored in the entities and cause the ASID manager to allocate the ASIDs to the contexts to which the ASIDs have not been allocated among the at least one context, and sequentially receive, by a command queue, jobs of contexts to which the ASIDs have been allocated, store the jobs as standby jobs, and sequentially execute the standby jobs.

    NEURAL PROCESSING DEVICE AND METHOD FOR JOB SCHEDULING THEREOF

    公开(公告)号:US20240061707A1

    公开(公告)日:2024-02-22

    申请号:US18495682

    申请日:2023-10-26

    Inventor: Seokju Yoon

    CPC classification number: G06F9/4881 G06F9/544

    Abstract: A neural processing device and a method for job scheduling are provided. The neural processing device configured to receive, by an address space ID (ASID) manager, first and second requests from at least one context, respectively, and determine whether ASIDs are allocated, store jobs of contexts to which the ASIDs have not been allocated from the ASID manager in entities, schedule, by a job scheduler, an execution order of the jobs stored in the entities and cause the ASID manager to allocate the ASIDs to the contexts to which the ASIDs have not been allocated among the at least one context, and sequentially receive, by a command queue, jobs of contexts to which the ASIDs have been allocated, store the jobs as standby jobs, and sequentially execute the standby jobs.

    NEURAL PROCESSING DEVICE AND METHOD FOR JOB SCHEDULING THEREOF

    公开(公告)号:US20230385105A1

    公开(公告)日:2023-11-30

    申请号:US18312490

    申请日:2023-05-04

    Inventor: Seokju Yoon

    CPC classification number: G06F9/4881 G06F9/544

    Abstract: A neural processing device and a method for job scheduling are provided. The neural processing device configured to receive, by an address space ID (ASID) manager, first and second requests from at least one context, respectively, and determine whether ASIDs are allocated, store jobs of contexts to which the ASIDs have not been allocated from the ASID manager in entities, schedule, by a job scheduler, an execution order of the jobs stored in the entities and cause the ASID manager to allocate the ASIDs to the contexts to which the ASIDs have not been allocated among the at least one context, and sequentially receive, by a command queue, jobs of contexts to which the ASIDs have been allocated, store the jobs as standby jobs, and sequentially execute the standby jobs.

Patent Agency Ranking