Method for Improving Temporal Consistency of Deep Neural Networks

    公开(公告)号:US20210073589A1

    公开(公告)日:2021-03-11

    申请号:US16821315

    申请日:2020-03-17

    Applicant: Apple Inc.

    Abstract: Training a network for image processing with temporal consistency includes obtaining un-annotated frames from a video feed. A pretrained network is applied to the first frame of first frame set comprising a plurality of frames to obtain a first prediction, wherein the pretrained network is pretrained for a first image processing task. A current version of the pretrained network is applied to each frame of the first frame set to obtain a first prediction. A content loss term is determined, based on the first prediction and a current prediction for the frame, based on the current network. A temporal consistency loss term is also determined based on a determined consistency of pixels within each frame of the first frame set. The pretrained network may be refined based on the content loss term and the temporal term to obtain a refined network.

    Saliency of an object for image processing operations

    公开(公告)号:US11663806B2

    公开(公告)日:2023-05-30

    申请号:US17659377

    申请日:2022-04-15

    Applicant: Apple Inc.

    CPC classification number: G06V10/255 G06F18/214 G06V10/462 G06V20/64

    Abstract: Various methods for utilizing a saliency heatmaps are described. The methods include obtaining image data corresponding to an image of a scene, obtaining a saliency heatmap for the image of the scene based on a saliency network, wherein the saliency heatmap indicates a likelihood of saliency for a corresponding portion of the scene, and manipulating the image data based on the saliency heatmap. In embodiments, the saliency heatmap may be produced using a trained machine learning model. The saliency heatmap may be used for various image processing tasks, such as determining which portion(s) of a scene to base an image capture device's autofocus, auto exposure, and/or white balance operations upon. According to some embodiments, one or more bounding boxes may be generated based on the saliency heatmap, e.g., using an optimization operation, which bounding box(es) may be used to assist or enhance the performance of various image processing tasks.

    Expanding indexed terms for searching files

    公开(公告)号:US11250041B2

    公开(公告)日:2022-02-15

    申请号:US16147444

    申请日:2018-09-28

    Applicant: Apple Inc.

    Abstract: A device implementing a system for expanded search includes a processor configured to identify plural words, and generate, for each word of the plural words, a word vector based on a proximity of the word relative to other words of the plural words, the word vector comprising plural dimensions. The processor is further configured to create a compressed word vector structure comprising clusters of subsets of the plural dimensions across the word vectors, each cluster including similar values of the respective dimensions, convert the word vectors to points on at least one plane, and partition the at least one plane into nested groupings of the points based on a threshold number of points per nested grouping. The processor is further configured to create a tree look-up structure of the nested groupings, and provide the compressed word vector structure and the tree look-up structure to a client device.

Patent Agency Ranking