TECHNIQUES FOR TRAINING A MULTITASK LEARNING MODEL TO ASSESS PERCEIVED AUDIO QUALITY

    公开(公告)号:US20210350819A1

    公开(公告)日:2021-11-11

    申请号:US16905793

    申请日:2020-06-18

    Applicant: NETFLIX, INC.

    Abstract: In various embodiments, a training application trains a multitask learning model to assess perceived audio quality. The training application computes a set of pseudo labels based on a first audio clip and multiple models. The set of pseudo labels specifies metric values for a set of metrics that are relevant to audio quality. The training application also computes a set of feature values for a set of audio features based on the first audio clip. The training application trains a multitask learning model based on the set of feature values and the set of pseudo labels to generate a trained multitask learning model. In operation, the trained multitask learning model maps different sets of feature values for the set of audio features to different sets of predicted labels. Each set of predicted labels specifies estimated metric values for the set of metrics.

Patent Agency Ranking