-
公开(公告)号:US20200082835A1
公开(公告)日:2020-03-12
申请号:US16453654
申请日:2019-06-26
Applicant: Gracenote, Inc.
Inventor: Robert Coover , Zafar Rafii
Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to fingerprint audio via mean normalization. An example apparatus for audio fingerprinting includes a frequency range separator to transform an audio signal into a frequency domain, the transformed audio signal including a plurality of time-frequency bins including a first time-frequency bin, an audio characteristic determiner to determine a first characteristic of a first group of time-frequency bins of the plurality of time-frequency bins, the first group of time-frequency bins surrounding the first time-frequency bin and a signal normalizer to normalize the audio signal to thereby generate normalized energy values, the normalizing of the audio signal including normalizing the first time-frequency bin by the first characteristic. The example apparatus further includes a point selector to select one of the normalized energy values and a fingerprint generator to generate a fingerprint of the audio signal using the selected one of the normalized energy values.
-
公开(公告)号:US20200081914A1
公开(公告)日:2020-03-12
申请号:US16528237
申请日:2019-07-31
Applicant: Gracenote, Inc.
Inventor: Jeffrey Scott , Matthew James Wilkinson , Robert Coover , Shashank Merchant
IPC: G06F16/683 , G06K9/00 , G06F16/65 , G06K9/62 , G06F16/901
Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to improve media identification. An example apparatus includes a hash handler to generate a first set of reference matches by performing hash functions on a subset of media data associated with media to generate hashed media data based on a first bucket size, a candidate determiner to identify a second set of reference matches that include ones of the first set, the second set including ones having first quantities of hits that did not satisfy a threshold, determine second quantities of hits for ones of the second set by matching ones to the hash tables based on a second bucket size, and identify one or more candidate matches based on at least one of (1) ones of the first set or (2) ones of the second set, and a report generator to generate a report including a media identification.
-
公开(公告)号:US10156894B2
公开(公告)日:2018-12-18
申请号:US16017170
申请日:2018-06-25
Applicant: Gracenote, Inc.
Inventor: Jeff Benson , Michael Gubman , Craig Kawahara , Robert Coover , Markus K. Cremer , Andy Mai
Abstract: As a user is being presented with interactive media by a presenting device, a separate monitoring device may be used to monitor the presentation of the interactive media and detect an event that occurs therein. Such a monitoring device may be configured and positioned to access media content from the presentation of the interactive media. For example, the monitoring device may be configured and positioned to record video content with a camera and record audio content with a microphone. Having accessed this media content, the monitoring device may generate an identifier, such as a fingerprint or watermark, of the media content and compare the generated identifier with a reference identifier that is generated from the source of the media content. Based on the generated identifier matching the reference identifier, the monitoring device may detect that an event has occurred within the interactive media presentation and present a corresponding notification.
-
公开(公告)号:US20180189390A1
公开(公告)日:2018-07-05
申请号:US15698557
申请日:2017-09-07
Applicant: Gracenote, Inc.
Inventor: Markus K. Cremer , Zafar Rafii , Robert Coover , Prem Seetharaman
IPC: G06F17/30
CPC classification number: G06F16/683 , G06Q50/184
Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.
-
公开(公告)号:US20170309298A1
公开(公告)日:2017-10-26
申请号:US15134071
申请日:2016-04-20
Applicant: Gracenote, Inc.
Inventor: Jeffrey Scott , Markus K. Cremer , Robert Coover
IPC: G10L25/87 , G10L25/54 , G10L19/018
CPC classification number: G10L25/87 , G06F16/683 , G10L25/54
Abstract: A machine accesses audio data that may be included in a media item, and the audio data includes multiple segments. The machine detects a silent segment among non-silent segments of the audio data. The machine generates sub-fingerprints of the non-silent segments by hashing the non-silent segments with a same fingerprinting algorithm, but the machine generates a sub-fingerprint of the silent segment based on a predetermined non-zero value that represents fingerprinted silence. With these sub-fingerprints generated, the machine generates a fingerprint of the audio data, of the media item, or of both, by storing the generated sub-fingerprints mapped to locations of their corresponding segments in the audio data. The machine then indexes the fingerprint by indexing the sub-fingerprints of the non-silent segments, without indexing the sub-fingerprint of the silent segment.
-
公开(公告)号:US20160217799A1
公开(公告)日:2016-07-28
申请号:US15008042
申请日:2016-01-27
Applicant: Gracenote, Inc.
Inventor: Jinyu Han , Robert Coover
IPC: G10L19/018
CPC classification number: G10L19/018
Abstract: A machine may be configured to generate one or more audio fingerprints of one or more segments of audio data. The machine may access audio data to be fingerprinted and divide the audio data into segments. For any given segment, the machine may generate a spectral representation from the segment; generate a vector from the spectral representation; generate an ordered set of permutations of the vector; generate an ordered set of numbers from the permutations of the vector; and generate a fingerprint of the segment of the audio data, which may be considered a sub-fingerprint of the audio data. In addition, the machine or a separate device may be configured to determine a likelihood that candidate audio data matches reference audio data.
-
公开(公告)号:US12235896B2
公开(公告)日:2025-02-25
申请号:US18674678
申请日:2024-05-24
Applicant: Gracenote, Inc.
Inventor: Alexander Berrian , Matthew James Wilkinson , Robert Coover
IPC: G10L25/51 , G06F16/683 , G10L25/21
Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to fingerprint an audio signal via exponential normalization. An example apparatus includes an audio segmenter to divide an audio signal into a plurality of audio segments including a first audio segment and a second audio segment, the first audio segment including a first time-frequency bin, the second audio segment including a second time-frequency bin, a mean calculator to determine a first exponential mean value associated with the first time frequency bin based on a first magnitude of the audio signal associated with the first time frequency bin and a second exponential mean value associated with the second time frequency bin based on a second magnitude of the audio signal associated with the second time frequency bin and the first exponential mean value. The example apparatus further includes a bin normalizer to normalize the first time-frequency bin based on the second exponential mean value and a fingerprint generator to generate a fingerprint of the audio signal based on the normalized first time-frequency bins.
-
公开(公告)号:US20250038724A1
公开(公告)日:2025-01-30
申请号:US18917165
申请日:2024-10-16
Applicant: GRACENOTE, INC.
Inventor: Robert Coover , Jeffrey Scott , Markus K. Cremer , Aneesh Vartakavi
Abstract: Apparatus, systems, articles of manufacture, and methods for volume adjustment are disclosed herein. An example method includes collecting data corresponding to a volume of an audio signal as the audio signal is output through a device, when an average volume of the audio signal does not satisfy a volume threshold for a specified timespan, determining a difference between the average volume and a desired volume, and applying a gain to the audio signal to adjust the volume of the audio signal to the desired volume, the gain determined based on the difference between the average volume and the desired volume.
-
公开(公告)号:US12142035B2
公开(公告)日:2024-11-12
申请号:US18539758
申请日:2023-12-14
Applicant: Gracenote, Inc.
Inventor: Joseph Renner , Aneesh Vartakavi , Robert Coover
IPC: G06V10/82 , G06F18/24 , G06F18/2413 , G06N3/049 , G06N3/08 , G06V10/80 , G06V20/40 , H04N21/234 , H04N21/81
Abstract: In one aspect, an example method includes (i) extracting a sequence of audio features from a portion of a sequence of media content; (ii) extracting a sequence of video features from the portion of the sequence of media content; (iii) providing the sequence of audio features and the sequence of video features as an input to a transition detector neural network that is configured to classify whether or not a given input includes a transition between different content segments; (iv) obtaining from the transition detector neural network classification data corresponding to the input; (v) determining that the classification data is indicative of a transition between different content segments; and (vi) based on determining that the classification data is indicative of a transition between different content segments, outputting transition data indicating that the portion of the sequence of media content includes a transition between different content segments.
-
公开(公告)号:US20240373093A1
公开(公告)日:2024-11-07
申请号:US18776970
申请日:2024-07-18
Applicant: GRACENOTE, INC.
Inventor: Joseph Renner , Robert Coover , Markus Cremer , Cameron Aubrey Summers
IPC: H04N21/442 , G06F3/16 , G06N3/04 , G06N3/08 , G10L25/30 , G10L25/51 , H03F3/181 , H03G5/16 , H04N9/87 , H04N21/439 , H04N21/45 , H04R3/04
Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for audio equalization. Example instructions disclosed herein cause one or more processors to at least: detect an irregularity in a frequency representation of an audio signal in response to a change in volume between a set of frequency values exceeding a threshold; and adjust a volume at a first frequency value of the set of frequency values to reduce the irregularity.
-
-
-
-
-
-
-
-
-