Online distillation using frame cache

    公开(公告)号:US11620812B2

    公开(公告)日:2023-04-04

    申请号:US17131045

    申请日:2020-12-22

    Abstract: The image analysis apparatus includes: a first analysis unit that analyzes image frames by using a first image analysis model; a second analysis unit that can analyze the image frames by using a second image analysis model an analysis accuracy of which is lower; a storage unit that stores therein analyzed frames, which are already analyzed by using the first and second image analysis models, in association with an evaluation value for evaluating a result of an analysis performed with the second image analysis model; an extraction unit that extracts an analyzed frame that satisfies an extraction condition based on the evaluation value; and an update unit that updates the second image analysis model by using a result of an analysis performed with the first image analysis model on the extracted analyzed frame, and a result of an analysis performed with the first image analysis model on a new frame.

    Visual object tracking method, visual object tracking system, machine learning method, and learning system

    公开(公告)号:US12211218B2

    公开(公告)日:2025-01-28

    申请号:US17765946

    申请日:2019-10-07

    Abstract: An estimation unit of a visual object tracking apparatus estimates a plurality of estimated bounding boxes and estimated object IDs respectively corresponding to the estimated bounding boxes based on a plurality of predicted bounding boxes and a plurality of detected bounding boxes. For example, the “detected bounding boxes” are “bounding boxes (bounded areas)” detected by a detector in each of a plurality of frames in a time series such as moving images. The “bounding box” is a frame surrounding an image of an object detected in a frame. For example, the “predicted bounding box” is a “bounding box” predicted by a predictor based on an estimated bounding box(es) estimated for one or a plurality of frames in the past.

    Data compression system and method of using

    公开(公告)号:US11764806B2

    公开(公告)日:2023-09-19

    申请号:US17467282

    申请日:2021-09-06

    Abstract: A system includes a non-transitory computer readable medium configured to store instructions thereon; and a processor connected to the non-transitory computer readable medium. The processor is configured to execute the instructions for generating a mask based on received data from a sensor, wherein the mask includes a plurality of importance values, and each region of the received data is designated a corresponding importance value of the plurality of importance values. The processor is configured to execute the instructions for encoding the received data based on the mask; and transmitting the encoded data to a decoder for defining reconstructed data. The processor is configured to execute the instructions for computing a loss based on the reconstructed data, the received data and the mask. The processor is configured to execute the instructions for providing training to an encoder for encoding the received data based on the computed loss.

    INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM

    公开(公告)号:US20240153260A1

    公开(公告)日:2024-05-09

    申请号:US18280381

    申请日:2021-03-09

    CPC classification number: G06V10/82 G06V10/776

    Abstract: A model learning device determines a first machine learning model so as to further increase a combined loss function obtained by combining: a first loss function indicating the level of change in the reliability of a second image feature in a feature region of a reconstructed image, from the reliability of a first image feature in a feature region of the original image; and a second loss function indicating the level of recognition error. In addition, the model learning device: determines, so as to further reduce the combined loss function, respective parameter sets for a second machine learning model used in the generation of compressed data, and a third machine learning model used in the generation of the reconstructed image from the compressed data; and determines a parameter set for the fourth machine learning model in common with that for the first machine learning model.

Patent Agency Ranking