DEEP LEARNING FRAMEWORK SCHEDULING

    公开(公告)号:US20220222111A1

    公开(公告)日:2022-07-14

    申请号:US17707895

    申请日:2022-03-29

    Abstract: A scheduling method for a deep learning framework, a scheduling apparatus, an electronic device, a storage medium, and a program product is provided, and can be used in the field of artificial intelligence, especially in the fields of machine learning, deep learning, etc. The method includes: receiving a processing request for processing a plurality of tasks by using a dedicated processing unit, the processing request including scheduling requirements for the plurality of tasks, and each of the plurality of tasks being associated with execution of multi-batch data processing; and scheduling, based on the scheduling requirements for the plurality of tasks in batches of data, the dedicated processing unit to process the plurality of tasks.

    METHOD AND APPARATUS OF TRAINING MODEL, DEVICE, MEDIUM, AND PROGRAM PRODUCT

    公开(公告)号:US20220004811A1

    公开(公告)日:2022-01-06

    申请号:US17479061

    申请日:2021-09-20

    Abstract: There is provided a method and apparatus of training a model, a device, and a medium, which relate to artificial intelligence, and in particular to a deep learning and image processing technology. The method may include: determining a plurality of augmented sample sets associated with a plurality of original samples; determining a first constraint according to a first model based on the plurality of augmented sample sets; determining a second constraint according to the first model and a second model based on the plurality of augmented sample sets, wherein the second constraint is associated with a difference between outputs of the first model and the second model for one augmented sample, and the first model has a complexity lower than that of the second model; training the first model based on at least the first constraint and the second constraint, so as to obtain a trained first model.

    METHOD AND APPARATUS FOR DISTRIBUTING NETWORK LAYERS IN NEURAL NETWORK MODEL

    公开(公告)号:US20230206075A1

    公开(公告)日:2023-06-29

    申请号:US17991077

    申请日:2022-11-21

    CPC classification number: G06N3/082 G06N3/04

    Abstract: A method for distributing network layers in a neural network model includes: acquiring a to-be-processed neural network model and a computing device set; generating a target number of distribution schemes according to network layers in the to-be-processed neural network model and computing devices in the computing device set, the distribution schemes including corresponding relationships between the network layers and the computing devices; according to device types of the computing devices, combining the network layers corresponding to the same device type in each distribution scheme into one stage, to obtain a combination result of each distribution scheme; obtaining an adaptive value of each distribution scheme according to the combination result of each distribution scheme; and determining a target distribution scheme from the distribution schemes according to respective adaptive value, and taking the target distribution scheme as a distribution result of the network layers in the to-be-processed neural network model.

    METHOD AND SYSTEM OF TRAINING DEEP LEARNING MODEL, DEVICE, AND MEDIUM

    公开(公告)号:US20240394190A1

    公开(公告)日:2024-11-28

    申请号:US18696757

    申请日:2022-09-27

    Abstract: The present application provides a method of training a deep learning model. A specific implementation solution of the method of training the deep learning model includes: determining, according to first training data for a current training round, a first target parameter required to be written into a target memory in a first network parameter required by an embedding of the first training data, wherein the target memory is a memory contained in a target processor; determining a remaining storage slot in the target memory according to a first mapping relationship between a storage slot of the target memory and a network parameter; and writing, in response to the remaining storage slot meeting a storage requirement of the first target parameter, the first target parameter into the target memory so that a computing core contained in the target processor adjusts the first network parameter according to the first training data.

    IMAGE PROCESSING
    10.
    发明申请

    公开(公告)号:US20230085732A1

    公开(公告)日:2023-03-23

    申请号:US18058543

    申请日:2022-11-23

    Abstract: The present disclosure provides an image processing method and apparatus, and relates to the field of image processing, and in particular to the field of image annotation. An implementation is: obtaining an image to be processed including a target region to be annotated; in response to a first click on the target region, performing a first operation to expand a predicted region for the target region based on a click position of the first click; in response to a second click in a position where the predicted region exceeds the target region, performing a second operation to reduce the predicted region based on a click position of the second click; and in response to determining that a difference between the predicted region and the target region meets a preset condition, obtaining an outline of the predicted region to annotate the target region.

Patent Agency Ranking