-
公开(公告)号:US20220254330A1
公开(公告)日:2022-08-11
申请号:US17610934
申请日:2019-05-20
Applicant: DeepMind Technologies Limited
Inventor: Luis Carlos Cobo Rus , Nal Kalchbrenner , Erich Elsen , Chenjie Gu
IPC: G10L13/047 , G10L13/08 , G10L25/30
Abstract: Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.
-
公开(公告)号:US20240267532A1
公开(公告)日:2024-08-08
申请号:US18565008
申请日:2022-05-30
Applicant: DeepMind Technologies Limited
Inventor: Anton Zhernov , Chenjie Gu , Daniel J. Mankowitz , Julian Schrittwieser , Amol Balkishan Mandhane , Mary Elizabeth Rauh , Miaosen Wang , Thomas Keisuke Hubert
IPC: H04N19/149 , H04N19/172
CPC classification number: H04N19/149 , H04N19/172
Abstract: Systems and methods for training rate control neural networks through reinforcement learning. During training, reward values for training examples are generated from the current performance of the rate control neural network in encoding the video in the training example and the historical performance of the rate control neural network in encoding the video in the training example.
-
公开(公告)号:US20240161729A1
公开(公告)日:2024-05-16
申请号:US18418025
申请日:2024-01-19
Applicant: DeepMind Technologies Limited
Inventor: Luis Carlos Cobo Rus , Nal Kalchbrenner , Erich Elsen , Chenjie Gu
IPC: G10L13/047 , G10L13/08 , G10L25/30
CPC classification number: G10L13/047 , G10L13/08 , G10L25/30
Abstract: Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.
-
公开(公告)号:US11915682B2
公开(公告)日:2024-02-27
申请号:US17610934
申请日:2019-05-20
Applicant: DeepMind Technologies Limited
Inventor: Luis Carlos Cobo Rus , Nal Kalchbrenner , Erich Elsen , Chenjie Gu
IPC: G10L13/00 , G10L19/00 , G10L13/047 , G10L13/08 , G10L25/30
CPC classification number: G10L13/047 , G10L13/08 , G10L25/30
Abstract: Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.
-
-
-