FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS

发明申请

US20170330586A1 FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS 审中-公开

请登陆查看更多内容

专利标题： FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS
申请号： US15151362

申请日： 2016-05-10
公开(公告)号： US20170330586A1

公开(公告)日： 2017-11-16
发明人: Dominik Roblek , Matthew Sharifi
申请人： Google Inc.
主分类号： G10L25/30
IPC分类号： G10L25/30 ; G06F11/07 ; G06N3/08

FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS

摘要：

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.

公开/授权文献

US10460747B2 Frequency based audio analysis using neural networks 公开/授权日：2019-10-29

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/27	.以分析方法为特征的
G10L25/30	..利用神经网络