METHODS AND APPARATUS TO FINGERPRINT AN AUDIO SIGNAL VIA NORMALIZATION

    公开(公告)号:US20200082835A1

    公开(公告)日:2020-03-12

    申请号:US16453654

    申请日:2019-06-26

    Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to fingerprint audio via mean normalization. An example apparatus for audio fingerprinting includes a frequency range separator to transform an audio signal into a frequency domain, the transformed audio signal including a plurality of time-frequency bins including a first time-frequency bin, an audio characteristic determiner to determine a first characteristic of a first group of time-frequency bins of the plurality of time-frequency bins, the first group of time-frequency bins surrounding the first time-frequency bin and a signal normalizer to normalize the audio signal to thereby generate normalized energy values, the normalizing of the audio signal including normalizing the first time-frequency bin by the first characteristic. The example apparatus further includes a point selector to select one of the normalized energy values and a fingerprint generator to generate a fingerprint of the audio signal using the selected one of the normalized energy values.

    SYSTEMS, METHODS, AND APPARATUS TO IMPROVE MEDIA IDENTIFICATION

    公开(公告)号:US20200081914A1

    公开(公告)日:2020-03-12

    申请号:US16528237

    申请日:2019-07-31

    Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to improve media identification. An example apparatus includes a hash handler to generate a first set of reference matches by performing hash functions on a subset of media data associated with media to generate hashed media data based on a first bucket size, a candidate determiner to identify a second set of reference matches that include ones of the first set, the second set including ones having first quantities of hits that did not satisfy a threshold, determine second quantities of hits for ones of the second set by matching ones to the hash tables based on a second bucket size, and identify one or more candidate matches based on at least one of (1) ones of the first set or (2) ones of the second set, and a report generator to generate a report including a media identification.

    Detecting an event within interactive media

    公开(公告)号:US10156894B2

    公开(公告)日:2018-12-18

    申请号:US16017170

    申请日:2018-06-25

    Abstract: As a user is being presented with interactive media by a presenting device, a separate monitoring device may be used to monitor the presentation of the interactive media and detect an event that occurs therein. Such a monitoring device may be configured and positioned to access media content from the presentation of the interactive media. For example, the monitoring device may be configured and positioned to record video content with a camera and record audio content with a microphone. Having accessed this media content, the monitoring device may generate an identifier, such as a fingerprint or watermark, of the media content and compare the generated identifier with a reference identifier that is generated from the source of the media content. Based on the generated identifier matching the reference identifier, the monitoring device may detect that an event has occurred within the interactive media presentation and present a corresponding notification.

    AUTOMATED COVER SONG IDENTIFICATION
    64.
    发明申请

    公开(公告)号:US20180189390A1

    公开(公告)日:2018-07-05

    申请号:US15698557

    申请日:2017-09-07

    CPC classification number: G06F16/683 G06Q50/184

    Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.

    DIGITAL FINGERPRINT INDEXING
    65.
    发明申请

    公开(公告)号:US20170309298A1

    公开(公告)日:2017-10-26

    申请号:US15134071

    申请日:2016-04-20

    CPC classification number: G10L25/87 G06F16/683 G10L25/54

    Abstract: A machine accesses audio data that may be included in a media item, and the audio data includes multiple segments. The machine detects a silent segment among non-silent segments of the audio data. The machine generates sub-fingerprints of the non-silent segments by hashing the non-silent segments with a same fingerprinting algorithm, but the machine generates a sub-fingerprint of the silent segment based on a predetermined non-zero value that represents fingerprinted silence. With these sub-fingerprints generated, the machine generates a fingerprint of the audio data, of the media item, or of both, by storing the generated sub-fingerprints mapped to locations of their corresponding segments in the audio data. The machine then indexes the fingerprint by indexing the sub-fingerprints of the non-silent segments, without indexing the sub-fingerprint of the silent segment.

    AUDIO FINGERPRINTING
    66.
    发明申请

    公开(公告)号:US20160217799A1

    公开(公告)日:2016-07-28

    申请号:US15008042

    申请日:2016-01-27

    CPC classification number: G10L19/018

    Abstract: A machine may be configured to generate one or more audio fingerprints of one or more segments of audio data. The machine may access audio data to be fingerprinted and divide the audio data into segments. For any given segment, the machine may generate a spectral representation from the segment; generate a vector from the spectral representation; generate an ordered set of permutations of the vector; generate an ordered set of numbers from the permutations of the vector; and generate a fingerprint of the segment of the audio data, which may be considered a sub-fingerprint of the audio data. In addition, the machine or a separate device may be configured to determine a likelihood that candidate audio data matches reference audio data.

    Methods and apparatus to fingerprint an audio signal via exponential normalization

    公开(公告)号:US12235896B2

    公开(公告)日:2025-02-25

    申请号:US18674678

    申请日:2024-05-24

    Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to fingerprint an audio signal via exponential normalization. An example apparatus includes an audio segmenter to divide an audio signal into a plurality of audio segments including a first audio segment and a second audio segment, the first audio segment including a first time-frequency bin, the second audio segment including a second time-frequency bin, a mean calculator to determine a first exponential mean value associated with the first time frequency bin based on a first magnitude of the audio signal associated with the first time frequency bin and a second exponential mean value associated with the second time frequency bin based on a second magnitude of the audio signal associated with the second time frequency bin and the first exponential mean value. The example apparatus further includes a bin normalizer to normalize the first time-frequency bin based on the second exponential mean value and a fingerprint generator to generate a fingerprint of the audio signal based on the normalized first time-frequency bins.

    METHODS AND APPARATUS FOR VOLUME ADJUSTMENT

    公开(公告)号:US20250038724A1

    公开(公告)日:2025-01-30

    申请号:US18917165

    申请日:2024-10-16

    Abstract: Apparatus, systems, articles of manufacture, and methods for volume adjustment are disclosed herein. An example method includes collecting data corresponding to a volume of an audio signal as the audio signal is output through a device, when an average volume of the audio signal does not satisfy a volume threshold for a specified timespan, determining a difference between the average volume and a desired volume, and applying a gain to the audio signal to adjust the volume of the audio signal to the desired volume, the gain determined based on the difference between the average volume and the desired volume.

    Transition detector neural network
    69.
    发明授权

    公开(公告)号:US12142035B2

    公开(公告)日:2024-11-12

    申请号:US18539758

    申请日:2023-12-14

    Abstract: In one aspect, an example method includes (i) extracting a sequence of audio features from a portion of a sequence of media content; (ii) extracting a sequence of video features from the portion of the sequence of media content; (iii) providing the sequence of audio features and the sequence of video features as an input to a transition detector neural network that is configured to classify whether or not a given input includes a transition between different content segments; (iv) obtaining from the transition detector neural network classification data corresponding to the input; (v) determining that the classification data is indicative of a transition between different content segments; and (vi) based on determining that the classification data is indicative of a transition between different content segments, outputting transition data indicating that the portion of the sequence of media content includes a transition between different content segments.

Patent Agency Ranking