Speech recognition using convolutional neural networks

Invention Grant

US10586531B2 Speech recognition using convolutional neural networks 有权

Please log in to see more content

Patent Title: Speech recognition using convolutional neural networks
Application No.: US16209661

Application Date: 2018-12-04
Publication No.: US10586531B2

Publication Date: 2020-03-10
Inventor: Aaron Gerard Antonius van den Oord , Sander Etienne Lea Dieleman , Nal Emmerich Kalchbrenner , Karen Simonyan , Oriol Vinyals , Lasse Espeholt
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
Agency: Fish & Richardson P.C.
Main IPC: G10L15/16
IPC: G10L15/16 ; G06N3/04 ; G10L15/02 ; G10L15/22 ; G06N3/08

Speech recognition using convolutional neural networks

Abstract:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.

Public/Granted literature

US20190108833A1 SPEECH RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS Public/Granted day:2019-04-11

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络