Patent search ap:("Google LLC") AND inv:"Mauro Verzetti" Page 1

1.

发明公开
GENERATING AUDIO USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240079001A1

公开(公告)日：2024-03-07

申请号：US18463196

申请日：2023-09-07

Applicant: Google LLC

Inventor： Andrea Agostinelli , Timo Immanuel Denk , Antoine Caillon , Neil Zeghidour , Jesse Engel , Mauro Verzetti , Christian Frank , Zalán Borsos , Matthew Sharifi , Adam Joseph Roberts

IPC: G10L15/16 , G10H1/00 , G10L15/06 , G10L15/18

CPC classification number: G10L15/16 , G10H1/0008 , G10L15/063 , G10L15/1815 , G10H2210/056 , G10H2250/311

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.

2.

发明授权
Generating audio using auto-regressive generative neural networks 有权

公开(公告)号：US11915689B1

公开(公告)日：2024-02-27

申请号：US18463196

申请日：2023-09-07

Applicant: Google LLC

Inventor： Andrea Agostinelli , Timo Immanuel Denk , Antoine Caillon , Neil Zeghidour , Jesse Engel , Mauro Verzetti , Christian Frank , Zalán Borsos , Matthew Sharifi , Adam Joseph Roberts , Marco Tagliasacchi

IPC: G06F40/30 , G10L15/16 , G10L15/18 , G10H1/00 , G10L15/06

CPC classification number: G10L15/16 , G10H1/0008 , G10L15/063 , G10L15/1815 , G10H2210/056 , G10H2250/311

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.

3.

发明公开
GENERATING AUDIO USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS 审中-公开

公开(公告)号：US20240233713A1

公开(公告)日：2024-07-11

申请号：US18412394

申请日：2024-01-12

Applicant: Google LLC

Inventor： Andrea Agostinelli , Timo Immanuel Denk , Antoine Caillon , Neil Zeghidour , Jesse Engel , Mauro Verzetti , Christian Frank , Zalán Borsos , Matthew Sharifi , Adam Joseph Roberts , Marco Tagliasacchi

IPC: G10L15/16 , G06N3/0455 , G06N3/0475 , G10H1/00 , G10L15/06 , G10L15/18

CPC classification number: G10L15/16 , G06N3/0455 , G06N3/0475 , G10H1/0008 , G10L15/063 , G10L15/1815 , G10H2210/056 , G10H2250/311

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.

Patent Agency Ranking