- 专利标题: FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS
-
申请号: US15151362申请日: 2016-05-10
-
公开(公告)号: US20170330586A1公开(公告)日: 2017-11-16
- 发明人: Dominik Roblek , Matthew Sharifi
- 申请人: Google Inc.
- 主分类号: G10L25/30
- IPC分类号: G10L25/30 ; G06F11/07 ; G06N3/08
摘要:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.
公开/授权文献
- US10460747B2 Frequency based audio analysis using neural networks 公开/授权日:2019-10-29
信息查询