MASKING SYSTEMS AND METHODS
    3.
    发明申请

    公开(公告)号:US20210183372A1

    公开(公告)日:2021-06-17

    申请号:US16717507

    申请日:2019-12-17

    Applicant: Spotify AB

    Abstract: Term masking is performed by generating a time-alignment value for a plurality of identifiable units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of identifiable units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned identifiable units of sound a force-aligned identifiable unit of sound to be muddled, and audio muddling the force-aligned identifiable unit of sound to be muddled.

    Systems and Methods for Jointly Estimating Sound Sources and Frequencies from Audio

    公开(公告)号:US20220351747A1

    公开(公告)日:2022-11-03

    申请号:US17751471

    申请日:2022-05-23

    Applicant: Spotify AB

    Abstract: An electronic device receives a first audio content item that includes a plurality of sound sources. The electronic device generates a representation of the first audio content item. The electronic device determines, from the representation of the first audio content item: a representation of an isolated sound source, and frequency data associated with the isolated sound source. Determining the representation of the isolated sound source and the frequency data associated with the isolated sound source includes using a neural network to jointly determine the representation of the isolated sound source and the frequency data associated with the isolated sound source. The electronic device determines that a portion of a second audio content item matches the first audio content item using the representation of the isolated sound source and/or the frequency data associated with the isolated sound source.

    Systems and methods for embedding data in media content

    公开(公告)号:US10777177B1

    公开(公告)日:2020-09-15

    申请号:US16588470

    申请日:2019-09-30

    Applicant: Spotify AB

    Abstract: An electronic device determines a first audio event of a first media content item and modifies the first media content item by superimposing a first set of data that corresponds to the first media content item over the first audio event. The first audio event has a first audio profile configured to be presented over a first channel for playback. The first set of data has a second audio profile configured to be presented over the first channel for playback. Playback of the second audio profile is configured to be masked by the first audio profile during playback of the first media content item. The electronic device transmits, to a second electronic device, the modified first media content item.

    EXTRACTING SIGNALS FROM PAIRED RECORDINGS
    8.
    发明申请

    公开(公告)号:US20190043528A1

    公开(公告)日:2019-02-07

    申请号:US15974767

    申请日:2018-05-09

    Applicant: Spotify AB

    Abstract: A system, method and computer product for extracting an activity from recordings. The method comprises searching for signals representing plural versions of a track, determining feature representations of the plural versions of the track identified in the searching, aligning the feature representations determined in the determining, and extracting a time varying activity signal from the feature representations aligned in the aligning.

    Systems and methods for jointly estimating sound sources and frequencies from audio

    公开(公告)号:US11862187B2

    公开(公告)日:2024-01-02

    申请号:US17751471

    申请日:2022-05-23

    Applicant: Spotify AB

    CPC classification number: G10L25/51 G06N3/045 G06N3/08 G06N20/00 H04L65/75

    Abstract: An electronic device receives a first audio content item that includes a plurality of sound sources. The electronic device generates a representation of the first audio content item. The electronic device determines, from the representation of the first audio content item: a representation of an isolated sound source, and frequency data associated with the isolated sound source. Determining the representation of the isolated sound source and the frequency data associated with the isolated sound source includes using a neural network to jointly determine the representation of the isolated sound source and the frequency data associated with the isolated sound source. The electronic device determines that a portion of a second audio content item matches the first audio content item using the representation of the isolated sound source and/or the frequency data associated with the isolated sound source.

    Masking systems and methods
    10.
    发明授权

    公开(公告)号:US11574627B2

    公开(公告)日:2023-02-07

    申请号:US17379325

    申请日:2021-07-19

    Applicant: Spotify AB

    Abstract: Term masking is performed by generating a time-alignment value for a plurality of units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned units of sound a force-aligned unit of sound to be altered, and altering the identified force-aligned unit of sound.

Patent Agency Ranking