Patent search ap:("GOOGLE LLC") AND inv:"Balineedu Adsumilli" Page 1

1.

发明申请
Systems and Techniques for Retraining Models for Video Quality Assessment and for Transcoding Using the Retrained Models 有权

公开(公告)号：US20220415039A1

公开(公告)日：2022-12-29

申请号：US17762289

申请日：2019-11-26

Applicant: Google LLC

Inventor： Yilin Wang , Hossein Talebi , Peyman Milanfar , Feng Yang , Balineedu Adsumilli

IPC: G06V10/98 , G06V10/82 , G06V20/40 , G06N3/04

Abstract: A trained model is retrained for video quality assessment and used to identify sets of adaptive compression parameters for transcoding user generated video content. Using transfer learning, the model, which is initially trained for image object detection, is retrained for technical content assessment and then again retrained for video quality assessment. The model is then deployed into a transcoding pipeline and used for transcoding an input video stream of user generated content. The transcoding pipeline may be structured in one of several ways. In one example, a secondary pathway for video content analysis using the model is introduced into the pipeline, which does not interfere with the ultimate output of the transcoding should there be a network or other issue. In another example, the model is introduced as a library within the existing pipeline, which would maintain a single pathway, but ultimately is not expected to introduce significant latency.

2.

发明申请
Noise Reduction Method for High Dynamic Range Videos 有权

公开(公告)号：US20220237749A1

公开(公告)日：2022-07-28

申请号：US17722720

申请日：2022-04-18

Applicant: Google LLC

Inventor： Neil Birkbeck , Balineedu Adsumilli , Mohammad Izadi

IPC: G06T5/00

Abstract: Denoising video content includes identifying a three-dimensional flat frame block of multiple frames of the video content, wherein the three-dimensional flat frame block comprises flat frame blocks, each flat frame block is located within a respective frame of the multiple frames, and the flat frame blocks have a spatial and temporal intensity variance that is less than a threshold. Denoising video content also includes determining an average intensity value of the three-dimensional flat frame block, determining a noise model that represents noise characteristics of the three-dimensional flat frame block, generating a denoising function using the average intensity value and the noise model, and denoising the multiple frames using the denoising function.

3.

发明申请
Bitrate Optimizations For Immersive Multimedia Streaming 有权

公开(公告)号：US20210392392A1

公开(公告)日：2021-12-16

申请号：US17462286

申请日：2021-08-31

Applicant: Google LLC

Inventor： Neil Birkbeck , Balineedu Adsumilli , Damien Kelly

IPC: H04N21/2662 , H04N21/233 , H04N21/234 , H04N21/4728 , H04N21/81

Abstract: Signals of an immersive multimedia item are jointly considered for optimizing the quality of experience for the immersive multimedia item. During encoding, portions of available bitrate are allocated to the signals (e.g., a video signal and an audio signal) according to the overall contribution of those signals to the immersive experience for the immersive multimedia item. For example, in the spatial dimension, multimedia signals are processed to determine spatial regions of the immersive multimedia item to render using greater bitrate allocations, such as based on locations of audio content of interest, video content of interest, or both. In another example, in the temporal dimension, multimedia signals are processed in time intervals to adjust allocations of bitrate between the signals based on the relative importance of such signals during those time intervals. Other techniques for bitrate optimizations for immersive multimedia streaming are also described herein.

4.

发明授权
Bitrate optimizations for immersive multimedia streaming 有权

公开(公告)号：US11122314B2

公开(公告)日：2021-09-14

申请号：US16613961

申请日：2017-12-12

Applicant: Google LLC

Inventor： Neil Birkbeck , Balineedu Adsumilli , Damien Kelly

IPC: H04N21/2662 , H04N21/233 , H04N21/234 , H04N21/4728 , H04N21/81

Abstract: Signals of an immersive multimedia item are jointly considered for optimizing the quality of experience for the immersive multimedia item. During encoding, portions of available bitrate are allocated to the signals (e.g., a video signal and an audio signal) according to the overall contribution of those signals to the immersive experience for the immersive multimedia item. For example, in the spatial dimension, multimedia signals are processed to determine spatial regions of the immersive multimedia item to render using greater bitrate allocations, such as based on locations of audio content of interest, video content of interest, or both. In another example, in the temporal dimension, multimedia signals are processed in time intervals to adjust allocations of bitrate between the signals based on the relative importance of such signals during those time intervals. Other techniques for bitrate optimizations for immersive multimedia streaming are also described herein.

5.

发明授权
Systems and techniques for retraining models for video quality assessment and for transcoding using the retrained models 有权

公开(公告)号：US12230024B2

公开(公告)日：2025-02-18

申请号：US17762289

申请日：2019-11-26

Applicant: Google LLC

Inventor： Yilin Wang , Hossein Talebi , Peyman Milanfar , Feng Yang , Balineedu Adsumilli

IPC: G06V10/98 , G06N3/045 , G06V10/82 , G06V20/40

Abstract: A trained model is retrained for video quality assessment and used to identify sets of adaptive compression parameters for transcoding user generated video content. Using transfer learning, the model, which is initially trained for image object detection, is retrained for technical content assessment and then again retrained for video quality assessment. The model is then deployed into a transcoding pipeline and used for transcoding an input video stream of user generated content. The transcoding pipeline may be structured in one of several ways. In one example, a secondary pathway for video content analysis using the model is introduced into the pipeline, which does not interfere with the ultimate output of the transcoding should there be a network or other issue. In another example, the model is introduced as a library within the existing pipeline, which would maintain a single pathway, but ultimately is not expected to introduce significant latency.

6.

发明申请
Debanding Using A Novel Banding Metric 有权

公开(公告)号：US20230131228A1

公开(公告)日：2023-04-27

申请号：US17922531

申请日：2020-05-19

Applicant: Google LLC

Inventor： Yilin Wang , Balineedu Adsumilli , Feng Yang

IPC: G06T5/00 , G06T5/20 , G06T7/13 , G06V10/56 , G06V10/74

Abstract: A method includes training a first model to measure the banding artefacts, training a second model to deband the image, and generating a debanded image for the image using the second model. Training the first model can include selecting a first set of first training images, generating a banding edge map for a first training image, where the map includes weights that emphasize banding edges and de-emphasize true edges in the first training image, and using the map and a luminance plane of the first training image as input to the first model. Training the second model can include selecting a second set of second training images, generating a debanded training image for a second training image, generating a banding score for the debanded training image using the first model, and using the banding score in a loss function used in training the second model.

7.

发明授权
Noise reduction method for high dynamic range videos 有权

公开(公告)号：US11308585B2

公开(公告)日：2022-04-19

申请号：US16613945

申请日：2017-12-05

Applicant: Google LLC

Inventor： Neil Birkbeck , Balineedu Adsumilli , Mohammad Izadi

IPC: G06T5/00

Abstract: A method for denoising video content includes identifying a first frame block of a plurality of frame blocks associated with a first frame of the video content. The method also includes determining an average intensity value for the first frame block. The method also includes determining a first noise model that represents characteristics of the first frame block. The method also includes generating a denoising function using the average intensity value and the first noise model for the first frame block. The method further includes denoising the plurality of frame blocks using the denoising function.

8.

发明授权
Dynamic parameter selection for quality-normalized video transcoding 有权

公开(公告)号：US12250383B2

公开(公告)日：2025-03-11

申请号：US17911245

申请日：2020-05-19

Applicant: Google LLC

Inventor： Yilin Wang , Balineedu Adsumilli

IPC: H04N19/00 , H04N19/149 , H04N19/154 , H04N21/2743

Abstract: Video streams uploaded to a video hosting platform are transcoded using quality-normalized transcoding parameters dynamically selected using a learning model. Video frames of a video stream are processed using the learning model to determine bitrate and quality score pairs for some or all possible transcoding resolutions. The listing of bitrate and quality score pairs determined for each resolution is processed to determine a set of transcoding parameters for transcoding the video stream into each resolution. The bitrate and quality score pairs of a given listing may be processed using one or more predefined thresholds, which may, in some cases, refer to a weighted distribution of resolutions according to watch times of videos of the video hosting platform. The video stream is then transcoded into the various resolutions using the set of transcoding parameters selected for each resolution.

9.

发明公开
METHODS, SYSTEMS, AND MEDIA FOR DETERMINING PERCEPTUAL QUALITY INDICATORS OF VIDEO CONTENT ITEMS 审中-公开

公开(公告)号：US20230319327A1

公开(公告)日：2023-10-05

申请号：US18021636

申请日：2022-06-08

Applicant: Google LLC

Inventor： Yilin Wang , Balineedu Adsumilli , Junjie Ke , Hossein Talebi , Joong Yim , Neil Birkbeck , Peyman Milanfar , Feng Yang

IPC: H04N21/234 , H04N19/154 , H04N21/466

CPC classification number: H04N21/23418 , H04N19/154 , H04N21/4668

Abstract: Methods, systems, and media for determining perceptual quality indicators of video content items are provided. In some embodiments, the method comprises: receiving a video content item; extracting a plurality of frames from the video content item; determining, using a first subnetwork of a deep neural network, a content quality indicator for each frame of the plurality of frames of the video content item; determining, using a second subnetwork of the deep neural network, a video distortion indicator for each frame of the plurality of frames of the video content item; determining, using a third subnetwork of the deep neural network, a compression sensitivity indicator for each frame of the plurality of frames of the video content item; generating a quality level for each frame of the plurality of frames of the video content item that concatenates the content quality indicator, the video distortion indicator, and the compression sensitivity indicator for that frame of the video content item; generating an overall quality level for video content item by aggregating the quality level of each frame of the plurality of frames; and causing a video recommendation to be presented based on the overall quality level of the video content item.

10.

发明授权
Video-informed spatial audio expansion 有权

公开(公告)号：US11704087B2

公开(公告)日：2023-07-18

申请号：US16779921

申请日：2020-02-03

Applicant: GOOGLE LLC

Inventor： Marcin Gorzel , Balineedu Adsumilli

IPC: G06F3/16 , G10L25/51 , G06V20/40 , H04S5/00

CPC classification number: G06F3/165 , G06V20/41 , G10L25/51 , H04S5/005

Abstract: Assigning spatial information to audio segments is disclosed. A method includes receiving a first audio segment that is non-spatialized and is associated with first video frames; identifying visual objects in the first video frames; identifying auditory events in the first audio segment; identifying a match between a visual object of the visual objects and an auditory event of the auditory events; and assigning a spatial location to the auditory event based on a location of the visual object.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification