IDENTIFYING REPRESENTATIVE FRAMES IN VIDEO CONTENT

    公开(公告)号:US20210390315A1

    公开(公告)日:2021-12-16

    申请号:US17344752

    申请日:2021-06-10

    Applicant: NETFLIX, INC.

    Abstract: One embodiment of the present invention sets forth a technique for selecting a frame of video content that is representative of a media title. The technique includes applying an embedding model to a plurality of faces included in a set of frames of the video content to generate a plurality of face embeddings. The technique also includes aggregating the plurality of face embeddings into a plurality of clusters representing a plurality of characters included in the media title. The technique further includes computing a plurality of prominence scores for the plurality of characters based on one or more attributes of the plurality of clusters, and selecting, from the set of frames, a frame of video content as representative of the media title based on one or more prominence scores for one or more characters included in the frame.

    TECHNIQUES FOR TRAINING IDENTITY-ROBUST MACHINE LEARNING MODELS

    公开(公告)号:US20250045633A1

    公开(公告)日:2025-02-06

    申请号:US18749378

    申请日:2024-06-20

    Applicant: NETFLIX, INC.

    Abstract: In various embodiments, a model trainer application trains a machine learning model with improved identity robustness. The model trainer application first processes images of faces using a trained face recognition model to generate a proxy representation of an identity of the individual in each image. Representations of individuals with similar faces lie in the same neighborhoods within a proxy identity space. The model trainer application trains a machine learning model to perform a task relating to faces while considering the accuracy of each identity proxy neighborhood. The model trainer assigns different weights to each image sample in a neighborhood based on the number of samples with the same output class in that neighborhood. The assigned weights can then be used to compute a relatively unbiased identity loss function that is used to train the machine learning model to perform the task relating to faces while being robust to identity features.

    IDENTIFYING REPRESENTATIVE FRAMES IN VIDEO CONTENT

    公开(公告)号:US20240242501A1

    公开(公告)日:2024-07-18

    申请号:US18620764

    申请日:2024-03-28

    Applicant: NETFLIX, INC.

    CPC classification number: G06V20/47 G06V20/49 G06V40/172

    Abstract: One embodiment of the present invention sets forth a technique for selecting a frame of video content that is representative of a media title. The technique includes applying an embedding model to a plurality of faces included in a set of frames of the video content to generate a plurality of face embeddings. The technique also includes aggregating the plurality of face embeddings into a plurality of clusters representing a plurality of characters included in the media title. The technique further includes computing a plurality of prominence scores for the plurality of characters based on one or more attributes of the plurality of clusters, and selecting, from the set of frames, a frame of video content as representative of the media title based on one or more prominence scores for one or more characters included in the frame.

Patent Agency Ranking