Singing voice separation with deep U-Net convolutional networks

Invention Grant

US10991385B2 Singing voice separation with deep U-Net convolutional networks 有权

Please log in to see more content

Patent Title: Singing voice separation with deep U-Net convolutional networks
Application No.: US16165498

Application Date: 2018-10-19
Publication No.: US10991385B2

Publication Date: 2021-04-27
Inventor: Andreas Simon Thore Jansson , Angus William Sackfield , Ching Chuan Sung , David Rubinstein
Applicant: Spotify AB
Applicant Address: SE Stockholm
Assignee: Spotify AB
Current Assignee: Spotify AB
Current Assignee Address: SE Stockholm
Agency: Merchant & Gould P.C.
Main IPC: G10L25/81
IPC: G10L25/81 ; G10L21/06 ; G10L25/18 ; G10L15/16 ; G06N3/08

Singing voice separation with deep U-Net convolutional networks

Abstract:

A system, method and computer product for estimating a component of a provided audio signal. The method comprises converting the provided audio signal to an image, processing the image with a neural network trained to estimate one of vocal content and instrumental content, and storing a spectral mask output from the neural network as a result of the image being processed by the neural network. The neural network is a U-Net. The method also comprises providing the spectral mask to a client media playback device, which applies the spectral mask to a spectrogram of the provided audio signal, to provide a masked spectrogram. The media playback device also transforms the masked spectrogram to an audio signal, and plays back that audio signal via an output user interface.

Public/Granted literature

US20200043517A1 SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS Public/Granted day:2020-02-06

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/78	.语音信号存在或不存在的检测（在双向扩音电话系统中通过语音频率切换传输的方向入H04M9/10）
G10L25/81	..从音乐中判别声音