Patent search ap:("Spotify AB") AND inv:"Andreas Simon Thore JANSSON" Page 1

1.

发明申请
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS 审中-公开

公开(公告)号：US20200043517A1

公开(公告)日：2020-02-06

申请号：US16165498

申请日：2018-10-19

Applicant: Spotify AB

Inventor： Andreas Simon Thore JANSSON

IPC: G10L25/81 , G10L21/06 , G10L25/18 , G10L15/16 , G06N3/08

Abstract: A system, method and computer product for estimating a component of a provided audio signal. The method comprises converting the provided audio signal to an image, processing the image with a neural network trained to estimate one of vocal content and instrumental content, and storing a spectral mask output from the neural network as a result of the image being processed by the neural network. The neural network is a U-Net. The method also comprises providing the spectral mask to a client media playback device, which applies the spectral mask to a spectrogram of the provided audio signal, to provide a masked spectrogram. The media playback device also transforms the masked spectrogram to an audio signal, and plays back that audio signal via an output user interface.

2.

发明申请
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS 审中-公开

公开(公告)号：US20200043516A1

公开(公告)日：2020-02-06

申请号：US16055870

申请日：2018-08-06

Applicant: Spotify AB

Inventor： Andreas Simon Thore JANSSON , Angus William Sackfield , Ching Chuan Sung

IPC: G10L25/81 , G06N5/04 , G06N3/08 , G10L15/16 , G10L21/10

Abstract: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

3.

发明申请
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS 有权

公开(公告)号：US20210256995A1

公开(公告)日：2021-08-19

申请号：US17135127

申请日：2020-12-28

Applicant: Spotify AB

Inventor： Andreas Simon Thore JANSSON , Angus William Sackfield , Ching Chuan Sung

IPC: G10L25/81 , G06N3/08 , G06N5/04 , G10L15/16 , G10L21/10

Abstract: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

4.

发明申请
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS 有权

公开(公告)号：US20210256994A1

公开(公告)日：2021-08-19

申请号：US17135119

申请日：2020-12-28

Applicant: Spotify AB

Inventor： Andreas Simon Thore JANSSON , Angus William Sackfield , Ching Chuan Sung

IPC: G10L25/81 , G06N3/08 , G06N5/04 , G10L15/16 , G10L21/10

Abstract: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

5.

发明公开
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS 审中-公开

公开(公告)号：US20240161770A1

公开(公告)日：2024-05-16

申请号：US18514289

申请日：2023-11-20

Applicant: Spotify AB

Inventor： Andreas Simon Thore JANSSON , Angus William Sackfield , Ching Chuan Sung

IPC: G10L25/81 , G06N3/08 , G06N5/046 , G10L15/16 , G10L21/10

CPC classification number: G10L25/81 , G06N3/08 , G06N5/046 , G10L15/16 , G10L21/10

Abstract: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

6.

发明申请
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS 审中-公开

公开(公告)号：US20200043518A1

公开(公告)日：2020-02-06

申请号：US16242525

申请日：2019-01-08

Applicant: Spotify AB

Inventor： Andreas Simon Thore JANSSON , Angus William Sackfield , Ching Chuan Sung

IPC: G10L25/81 , G06N3/08 , G06N5/04 , G10L21/10 , G10L15/16

Abstract: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

Patent Agency Ranking