-
公开(公告)号:US12190896B2
公开(公告)日:2025-01-07
申请号:US17856292
申请日:2022-07-01
Applicant: Google LLC
Inventor: Yunpeng Li , Marco Tagliasacchi , Dominik Roblek , Félix de Chaumont Quitry , Beat Gfeller , Hannah Raphaelle Muckenhirn , Victor Ungureanu , Oleg Rybakov , Karolis Misiunas , Zalán Borsos
IPC: G10L19/022 , G06N3/045
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing an input audio waveform using a generator neural network to generate an output audio waveform. In one aspect, a method comprises: receiving an input audio waveform; processing the input audio waveform using an encoder neural network to generate a set of feature vectors representing the input audio waveform; and processing the set of feature vectors representing the input audio waveform using a decoder neural network to generate an output audio waveform that comprises a respective output audio sample for each of a plurality of output time steps.
-
公开(公告)号:US20230395087A1
公开(公告)日:2023-12-07
申请号:US18249126
申请日:2021-10-15
Applicant: Google LLC
Inventor: Marco Tagliasacchi , Beat Gfeller , Yunpeng Li , Zalán Borsos
IPC: G10L21/007 , G10L15/06 , G10L15/08 , G10L25/18 , G10L21/0208 , G10L25/21
CPC classification number: G10L21/007 , G10L15/063 , G10L15/08 , G10L25/18 , G10L21/0208 , G10L25/21 , G10L2015/088
Abstract: Example implementations of the present disclosure relate to machine learning for microphone style transfer, for example, to facilitate augmentation of audio data such as speech data to improve robustness of machine learning models trained on the audio data. Systems and methods for microphone style transfer can include one or more machine-learned microphone models trained to obtain and augment signal data to mimic characteristics of signal data obtained from a target microphone. The systems and methods can include a speech enhancement network for enhancing a sample before the style transfer. The augmentation output can then be utilized for a variety of downstream tasks.
-
13.
公开(公告)号:US11256472B2
公开(公告)日:2022-02-22
申请号:US17010694
申请日:2020-09-02
Applicant: Google LLC
Inventor: Dominik Roblek , Blaise Hilary Aguera-Arcas , Thomas W. Hume , Marvin Karl Ritter , Brandon Charles Barbello , Kevin I. Kilgour , Mihajlo Velimirović , Christopher Thornton , Gabriel Oak Taubman , James David Lyon , Jan Heinrich Althaus , Katsiaryna Naliuka , Julian James Odell , Matthew Sharifi , Beat Gfeller
IPC: G06F3/16 , G06F16/635 , G06F16/683 , G06N3/08 , G06N20/00
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.
-
14.
公开(公告)号:US10809968B2
公开(公告)日:2020-10-20
申请号:US16148338
申请日:2018-10-01
Applicant: Google LLC
Inventor: Dominik Roblek , Blaise Hilary Aguera-Arcas , Thomas W. Hume , Marvin Karl Ritter , Brandon Charles Barbello , Kevin I. Kilgour , Mihajlo Velimirovic , Christopher Thornton , Gabriel Oak Taubman , James David Lyon , Jan Heinrich Althaus , Katsiaryna Naliuka , Julian James Odell , Matthew Sharifi , Beat Gfeller
IPC: G06F17/00 , G06F3/16 , G06F16/635 , G06F16/683 , G06N3/08 , G06N20/00
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.
-
公开(公告)号:US10761802B2
公开(公告)日:2020-09-01
申请号:US16148401
申请日:2018-10-01
Applicant: Google LLC
Inventor: Dominik Roblek , Blaise Hilary Aguera-Arcas , Thomas W. Hume , Marvin Karl Ritter , Brandon Charles Barbello , Kevin I. Kilgour , Mihajlo Velimirović , Christopher Thornton , Gabriel Oak Taubman , James David Lyon , Jan Heinrich Althaus , Katsiaryna Naliuka , Julian James Odell , Matthew Sharifi , Beat Gfeller
IPC: G06F17/00 , G06F3/16 , G06F16/635 , G06F16/683 , G06N3/08 , G06N20/00
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for indicating a reference song. A computing device stores reference song characterization data that identifies a plurality of audio characteristics for each reference song in a plurality of reference songs. The computing device receives digital audio data that represents audio recorded by a microphone, converts the digital audio data from time-domain format into frequency-domain format, and uses the digital audio data in the frequency-domain format in a music-characterization process. In response to determining that characterization values for the digital audio data are most relevant to characterization values for a particular reference song, the computing device outputs an indication of the particular reference song.
-
-
-
-