Explainable video performance prediction

    公开(公告)号:US11430219B2

    公开(公告)日:2022-08-30

    申请号:US16952675

    申请日:2020-11-19

    Applicant: Adobe Inc.

    Abstract: Systems and methods predict a performance metric for a video and identify key portions of the video that contribute to the performance metric, which can be used to edit the video to improve the ultimate viewer response to the video. An initial performance metric is computed for an initial video (e.g., using a neural network). A perturbed video is generated by perturbing a video portion of the initial video. A modified performance metric is computed for the perturbed video. Based on a difference between the initial and modified performance metrics, the system determines that the video portion contributed to a predicted user viewer response to the initial video. An indication of the video portion that contributed to the predicted user viewer response is provided as output, which can be used to edit the video to improve the predicted viewer response.

    EXPLAINABLE VIDEO PERFORMANCE PREDICTION

    公开(公告)号:US20220156499A1

    公开(公告)日:2022-05-19

    申请号:US16952675

    申请日:2020-11-19

    Applicant: Adobe Inc.

    Abstract: Systems and methods predict a performance metric for a video and identify key portions of the video that contribute to the performance metric, which can be used to edit the video to improve the ultimate viewer response to the video. An initial performance metric is computed for an initial video (e.g., using a neural network). A perturbed video is generated by perturbing a video portion of the initial video. A modified performance metric is computed for the perturbed video. Based on a difference between the initial and modified performance metrics, the system determines that the video portion contributed to a predicted user viewer response to the initial video. An indication of the video portion that contributed to the predicted user viewer response is provided as output, which can be used to edit the video to improve the predicted viewer response.

    Validating a target audience using a combination of classification algorithms

    公开(公告)号:US11308523B2

    公开(公告)日:2022-04-19

    申请号:US15457882

    申请日:2017-03-13

    Applicant: Adobe Inc.

    Abstract: This disclosure generally covers systems and methods that determine demographic labels for a user or a group of users by using digital inputs within a predictive model for demographic classification. In particular, the disclosed systems and methods use a unique combination of classification algorithms to determine demographic labels for users as a potential audience of digital content items. When applying the combination of classification algorithms, the disclosed systems and methods use a first classification algorithm to determine user-level-latent features for each user within a group of users based on demographic-label statistics associated with particular digital content items. The disclosed systems and methods then use the user-level-latent features and session-level features (from sessions of each user consuming the digital content items) as inputs in a second classification algorithm to determine a demographic label for each user within the group of users.

    Trajectory-based viewport prediction for 360-degree videos

    公开(公告)号:US11252393B2

    公开(公告)日:2022-02-15

    申请号:US17074189

    申请日:2020-10-19

    Applicant: Adobe Inc.

    Abstract: In implementations of trajectory-based viewport prediction for 360-degree videos, a video system obtains trajectories of angles of users who have previously viewed a 360-degree video. The angles are used to determine viewports of the 360-degree video, and may include trajectories for a yaw angle, a pitch angle, and a roll angle of a user recorded as the user views the 360-degree video. The video system clusters the trajectories of angles into trajectory clusters, and for each trajectory cluster determines a trend trajectory. When a new user views the 360-degree video, the video system compares trajectories of angles of the new user to the trend trajectories, and selects trend trajectories for a yaw angle, a pitch angle, and a roll angle for the user. Using the selected trend trajectories, the video system predicts viewports of the 360-degree video for the user for future times.

    Deep Relational Factorization Machine Techniques for Content Usage Prediction via Multiple Interaction Types

    公开(公告)号:US20220027722A1

    公开(公告)日:2022-01-27

    申请号:US16939661

    申请日:2020-07-27

    Applicant: Adobe Inc.

    Abstract: A deep relational factorization machine (“DRFM”) system is configured to provide a high-order prediction based on high-order feature interaction data for a dataset of sample nodes. The DRFM system can be configured with improved factorization machine (“FM”) techniques for determining high-order feature interaction data describing interactions among three or more features. The DRFM system can be configured with improved graph convolutional neural network (“GCN”) techniques for determining sample interaction data describing sample interactions among sample nodes, including sample interaction data that is based on the high-order feature interaction data. The DRFM system generates a high-order prediction based on the high-order feature interaction embedding vector and the sample interaction embedding vector. The high-order prediction can be provided to a prediction computing system configured to perform operations based on the high-order prediction.

    Low-latency adaptive streaming for augmented reality scenes

    公开(公告)号:US11217208B2

    公开(公告)日:2022-01-04

    申请号:US16834776

    申请日:2020-03-30

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods that iteratively select versions of augmented reality objects at augmented reality levels of detail to provide for download to a client device to reduce start-up latency associated with providing a requested augmented reality scene. In particular, in one or more embodiments, the disclosed systems determine utility and priority metrics associated with versions of augmented reality objects associated with a requested augmented reality scene. The disclosed systems utilize the determined metrics to select versions of augmented reality objects that are likely to be viewed by the client device and improve the quality of the augmented reality scene as the client device moves through the augmented reality scene. In at least one embodiment, the disclosed systems iteratively select versions of augmented reality objects at various levels of detail until the augmented reality scene is fully downloaded.

    GENERATING MULTI-PASS-COMPRESSED-TEXTURE IMAGES FOR FAST DELIVERY

    公开(公告)号:US20210337222A1

    公开(公告)日:2021-10-28

    申请号:US16860758

    申请日:2020-04-28

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media to enhance texture image delivery and processing at a client device. For example, the disclosed systems can utilize a server-side compression combination that includes, in sequential order, a first compression pass, a decompression pass, and a second compression pass. By applying this compression combination to a texture image at the server-side, the disclosed systems can leverage both GPU-friendly and network-friendly image formats. For example, at a client device, the disclosed system can instruct the client device to execute a combination of decompression-compression passes on a GPU-network-friendly image delivered over a network connection to the client device. In so doing, client device can generate a tri-pass-compressed-texture from a decompressed image comprising texels with color palettes based on previously reduced color palettes from the first compression pass at the server-side, which reduces computational overhead and increases performance speed.

    Embedding codebooks for resource optimization

    公开(公告)号:US11086843B2

    公开(公告)日:2021-08-10

    申请号:US15788481

    申请日:2017-10-19

    Applicant: ADOBE INC.

    Abstract: Embodiments of the present disclosure provide systems, methods, and computer storage media for optimizing computing resources generally associated with cloud-based media services. Instead of decoding digital assets on-premises to stream to a remote client device, an encoded asset can be streamed to the remote client device. A codebook employable for decoding the encoded asset can be embedded into the stream transmitted to the remote client device, so that the remote client device can extract the embedded codebook, and employ the extracted codebook to decode the encoded asset locally. In this way, not only are processing resources associated with on-premises decoding eliminated, but on-premises storage of codebooks can be significantly reduced, while expensive bandwidth is freed up by virtue of transmitting a smaller quantity of data from the cloud to the remote client device.

    SELECTING LOGO IMAGES USING MACHINE-LEARNING-LOGO CLASSIFIERS

    公开(公告)号:US20210064934A1

    公开(公告)日:2021-03-04

    申请号:US16557330

    申请日:2019-08-30

    Applicant: Adobe Inc.

    Abstract: This disclosure relates to methods, non-transitory computer readable media, and systems that can initially train a machine-learning-logo classifier using synthetic training images and incrementally apply the machine-learning-logo classifier to identify logo images to replace the synthetic training images as training data. By incrementally applying the machine-learning-logo classifier to determine one or both of logo scores and positions for logos within candidate logo images, the disclosed systems can select logo images and corresponding annotations indicating positions for ground-truth logos. In some embodiments, the disclosed systems can further augment the iterative training of a machine-learning-logo classifier to include user curation and removal of incorrectly detected logos from candidate images, thereby avoiding the risk of model drift across training iterations.

Patent Agency Ranking