Systems and Methods for Jointly Estimating Sound Sources and Frequencies from Audio

Invention Application

US20220351747A1 Systems and Methods for Jointly Estimating Sound Sources and Frequencies from Audio 有权

Please log in to see more content

Patent Title: Systems and Methods for Jointly Estimating Sound Sources and Frequencies from Audio
Application No.: US17751471

Application Date: 2022-05-23
Publication No.: US20220351747A1

Publication Date: 2022-11-03
Inventor: Andreas Jansson , Rachel Bittner
Applicant: Spotify AB
Applicant Address: SE Stockholm
Assignee: Spotify AB
Current Assignee: Spotify AB
Current Assignee Address: SE Stockholm
Main IPC: G10L25/51
IPC: G10L25/51 ; G06N20/00 ; G06N3/04 ; G06N3/08 ; H04L65/75

Systems and Methods for Jointly Estimating Sound Sources and Frequencies from Audio

Abstract:

An electronic device receives a first audio content item that includes a plurality of sound sources. The electronic device generates a representation of the first audio content item. The electronic device determines, from the representation of the first audio content item: a representation of an isolated sound source, and frequency data associated with the isolated sound source. Determining the representation of the isolated sound source and the frequency data associated with the isolated sound source includes using a neural network to jointly determine the representation of the isolated sound source and the frequency data associated with the isolated sound source. The electronic device determines that a portion of a second audio content item matches the first audio content item using the representation of the isolated sound source and/or the frequency data associated with the isolated sound source.

Public/Granted literature

US11862187B2 Systems and methods for jointly estimating sound sources and frequencies from audio Public/Granted day:2024-01-02

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/48	.专门适用于特定用途
G10L25/51	..比较或判别