METHOD OF CONSTRUCTING NETWORK MODEL FOR DEEP LEARNING, DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20220058490A1

    公开(公告)日:2022-02-24

    申请号:US17519815

    申请日:2021-11-05

    Abstract: A method and apparatus of constructing a network model for deep learning, a device, and a storage medium, which relate to artificial intelligence, and in particular to a field of deep learning. The method of constructing the network model for deep learning includes: determining an execution mode for executing codes, based on a mode parameter; executing the codes by using a first component, which is executable in a first execution mode, through a syntax element in the codes, in response to determining that the execution mode is the first execution mode; and executing the codes by using a second component, which is executable in a second execution mode, through the syntax element, in response to determining that the execution mode is the second execution mode; wherein the first component and the second component have the same component interface, and the syntax element corresponds to the component interface.

    DEEP LEARNING FRAMEWORK SCHEDULING

    公开(公告)号:US20220222111A1

    公开(公告)日:2022-07-14

    申请号:US17707895

    申请日:2022-03-29

    Abstract: A scheduling method for a deep learning framework, a scheduling apparatus, an electronic device, a storage medium, and a program product is provided, and can be used in the field of artificial intelligence, especially in the fields of machine learning, deep learning, etc. The method includes: receiving a processing request for processing a plurality of tasks by using a dedicated processing unit, the processing request including scheduling requirements for the plurality of tasks, and each of the plurality of tasks being associated with execution of multi-batch data processing; and scheduling, based on the scheduling requirements for the plurality of tasks in batches of data, the dedicated processing unit to process the plurality of tasks.

    METHOD AND APPARATUS OF TRAINING MODEL, DEVICE, MEDIUM, AND PROGRAM PRODUCT

    公开(公告)号:US20220004811A1

    公开(公告)日:2022-01-06

    申请号:US17479061

    申请日:2021-09-20

    Abstract: There is provided a method and apparatus of training a model, a device, and a medium, which relate to artificial intelligence, and in particular to a deep learning and image processing technology. The method may include: determining a plurality of augmented sample sets associated with a plurality of original samples; determining a first constraint according to a first model based on the plurality of augmented sample sets; determining a second constraint according to the first model and a second model based on the plurality of augmented sample sets, wherein the second constraint is associated with a difference between outputs of the first model and the second model for one augmented sample, and the first model has a complexity lower than that of the second model; training the first model based on at least the first constraint and the second constraint, so as to obtain a trained first model.

    OPERATOR PROCESSING METHOD OF DEEP LEARNING FRAMEWORK, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20250005446A1

    公开(公告)日:2025-01-02

    申请号:US18547090

    申请日:2022-11-02

    Abstract: An operator processing method of a deep learning framework an electronic device, and a storage medium are provided, which relate to a field of computer technology, especially in a field of artificial intelligence technology such as deep learning. The specific implementation scheme includes: acquiring an operator to be processed, where the operator to be processed includes a template parameter independent of the deep learning framework and an operator kernel function; parsing, in response to receiving an input information for the operator to be processed, the template parameter by using the input information to obtain a plurality of complete template parameters related to the deep learning framework; and processing the operator kernel function according to the plurality of complete template parameters, to obtain an available operator for the deep learning framework.

Patent Agency Ranking