MASKING SYSTEMS AND METHODS
    1.
    发明申请

    公开(公告)号:US20210183372A1

    公开(公告)日:2021-06-17

    申请号:US16717507

    申请日:2019-12-17

    Applicant: Spotify AB

    Abstract: Term masking is performed by generating a time-alignment value for a plurality of identifiable units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of identifiable units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned identifiable units of sound a force-aligned identifiable unit of sound to be muddled, and audio muddling the force-aligned identifiable unit of sound to be muddled.

    CUEPOINT DETERMINATION SYSTEM
    2.
    发明申请

    公开(公告)号:US20210103422A1

    公开(公告)日:2021-04-08

    申请号:US16595404

    申请日:2019-10-07

    Applicant: SPOTIFY AB

    Abstract: A cuepoint determination system utilizes a convolutional neural network (CNN) to determine cuepoint placements within media content items to facilitate smooth transitions between them. For example, audio content from a media content item is normalized to a plurality of beats, the beats are partitioned into temporal sections, and acoustic feature groups are extracted from each beat in one or more of the temporal sections. The acoustic feature groups include at least downbeat confidence, position in bar, peak loudness, timbre and pitch. The extracted acoustic feature groups for each beat are provided as input to the CNN on a per temporal section basis to predict whether a beat immediately following the temporal section within the media content item is a candidate for cuepoint placement. A cuepoint placement is then determined from among the candidate cuepoint placements predicted by the CNN.

    Cuepoint determination system
    3.
    发明授权

    公开(公告)号:US11714594B2

    公开(公告)日:2023-08-01

    申请号:US16595404

    申请日:2019-10-07

    Applicant: SPOTIFY AB

    Abstract: A cuepoint determination system utilizes a convolutional neural network (CNN) to determine cuepoint placements within media content items to facilitate smooth transitions between them. For example, audio content from a media content item is normalized to a plurality of beats, the beats are partitioned into temporal sections, and acoustic feature groups are extracted from each beat in one or more of the temporal sections. The acoustic feature groups include at least downbeat confidence, position in bar, peak loudness, timbre and pitch. The extracted acoustic feature groups for each beat are provided as input to the CNN on a per temporal section basis to predict whether a beat immediately following the temporal section within the media content item is a candidate for cuepoint placement. A cuepoint placement is then determined from among the candidate cuepoint placements predicted by the CNN.

    CUEPOINT DETERMINATION SYSTEM
    4.
    发明公开

    公开(公告)号:US20230409281A1

    公开(公告)日:2023-12-21

    申请号:US18335060

    申请日:2023-06-14

    Applicant: Spotify AB

    Abstract: A cuepoint determination system utilizes a convolutional neural network (CNN) to determine cuepoint placements within media content items to facilitate smooth transitions between them. For example, audio content from a media content item is normalized to a plurality of beats, the beats are partitioned into temporal sections, and acoustic feature groups are extracted from each beat in one or more of the temporal sections. The acoustic feature groups include at least downbeat confidence, position in bar, peak loudness, timbre and pitch. The extracted acoustic feature groups for each beat are provided as input to the CNN on a per temporal section basis to predict whether a beat immediately following the temporal section within the media content item is a candidate for cuepoint placement. A cuepoint placement is then determined from among the candidate cuepoint placements predicted by the CNN.

    Masking systems and methods
    5.
    发明授权

    公开(公告)号:US11574627B2

    公开(公告)日:2023-02-07

    申请号:US17379325

    申请日:2021-07-19

    Applicant: Spotify AB

    Abstract: Term masking is performed by generating a time-alignment value for a plurality of units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned units of sound a force-aligned unit of sound to be altered, and altering the identified force-aligned unit of sound.

    Masking systems and methods
    6.
    发明授权

    公开(公告)号:US11087744B2

    公开(公告)日:2021-08-10

    申请号:US16717507

    申请日:2019-12-17

    Applicant: Spotify AB

    Abstract: Term masking is performed by generating a time-alignment value for a plurality of identifiable units of sound in vocal audio content contained in a mixed audio track, force-aligning each of the plurality of identifiable units of sound to the vocal audio content based on the time-alignment value, thereby generating a plurality of force-aligned identifiable units of sound, identifying from the plurality of force-aligned identifiable units of sound a force-aligned identifiable unit of sound to be muddled, and audio muddling the force-aligned identifiable unit of sound to be muddled.

Patent Agency Ranking