Invention Grant
- Patent Title: Singing voice separation with deep U-Net convolutional networks
-
Application No.: US16165498Application Date: 2018-10-19
-
Publication No.: US10991385B2Publication Date: 2021-04-27
- Inventor: Andreas Simon Thore Jansson , Angus William Sackfield , Ching Chuan Sung , David Rubinstein
- Applicant: Spotify AB
- Applicant Address: SE Stockholm
- Assignee: Spotify AB
- Current Assignee: Spotify AB
- Current Assignee Address: SE Stockholm
- Agency: Merchant & Gould P.C.
- Main IPC: G10L25/81
- IPC: G10L25/81 ; G10L21/06 ; G10L25/18 ; G10L15/16 ; G06N3/08

Abstract:
A system, method and computer product for estimating a component of a provided audio signal. The method comprises converting the provided audio signal to an image, processing the image with a neural network trained to estimate one of vocal content and instrumental content, and storing a spectral mask output from the neural network as a result of the image being processed by the neural network. The neural network is a U-Net. The method also comprises providing the spectral mask to a client media playback device, which applies the spectral mask to a spectrogram of the provided audio signal, to provide a masked spectrogram. The media playback device also transforms the masked spectrogram to an audio signal, and plays back that audio signal via an output user interface.
Public/Granted literature
- US20200043517A1 SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS Public/Granted day:2020-02-06
Information query