Methods and apparatus for dynamic volume adjustment via audio classification

    公开(公告)号:US12061840B2

    公开(公告)日:2024-08-13

    申请号:US18453792

    申请日:2023-08-22

    申请人: Gracenote, Inc.

    IPC分类号: G06F3/16 G10L25/51 G10L25/30

    CPC分类号: G06F3/165 G10L25/51 G10L25/30

    摘要: Methods, apparatus, systems and articles of manufacture are disclosed for dynamic volume adjustment via audio classification. Example apparatus include at least one memory; instructions; and at least one processor to execute the instructions to: analyze, with a neural network, a parameter of an audio signal associated with a first volume level to determine a classification group associated with the audio signal; determine an input volume of the audio signal; determine a classification gain value based on the classification group; determine an intermediate gain value as an intermediate between the input volume and the classification gain value by applying a first weight to the input volume and a second weight to the classification gain value; apply the intermediate gain value to the audio signal, the intermediate gain value to modify the first volume level to a second volume level; and apply a compression value to the audio signal, the compression value to modify the second volume level to a third volume level that satisfies a target volume threshold.

    Matching audio fingerprints
    85.
    发明授权

    公开(公告)号:US11954148B2

    公开(公告)日:2024-04-09

    申请号:US17187431

    申请日:2021-02-26

    申请人: Gracenote, Inc.

    摘要: Methods, apparatus, systems and articles of manufacture are disclosed to select reference sub-fingerprints for comparison to query sub-fingerprints based on a determination that a query sub-fingerprint is a match with a reference sub-fingerprint, generate a count vector that stores total counts of matches between the query sub-fingerprints and different subsets of the reference sub-fingerprints, each of the different subsets being aligned to the query sub-fingerprints at a different offset from a reference point, each of the different offsets being mapped by the count vector to a different total count, calculate a maximum count among the total counts, a median of the total counts, and a difference between the maximum count and the median of the total counts, and classify the reference sub-fingerprints as a match with the query sub-fingerprints based on the difference between the maximum count in the count vector and the median.

    Audio fingerprinting
    88.
    发明授权

    公开(公告)号:US11854557B2

    公开(公告)日:2023-12-26

    申请号:US18049882

    申请日:2022-10-26

    申请人: Gracenote, Inc.

    IPC分类号: G06F17/00 G10L19/018

    CPC分类号: G10L19/018

    摘要: A machine may be configured to generate one or more audio fingerprints of one or more segments of audio data. The machine may access audio data to be fingerprinted and divide the audio data into segments. For any given segment, the machine may generate a spectral representation from the segment; generate a vector from the spectral representation; generate an ordered set of permutations of the vector; generate an ordered set of numbers from the permutations of the vector; and generate a fingerprint of the segment of the audio data, which may be considered a sub-fingerprint of the audio data. In addition, the machine or a separate device may be configured to determine a likelihood that candidate audio data matches reference audio data.