Abstract:
Systems and methods are provided herein relating to video chunking for robust, progressive upload. Video can be parsed to determined byte offsets associated with prospective chunk boundaries. Chunks can be generated based on the prospective chunk boundaries and a preferred chunk size. Sample tables can be generated for each chunk. The chunks can be fully self contained, in that they can be received and transcoded independently of other chunks. Thus, if one chunk fails, only that chunk needs to be retransmitted versus the entire video.
Abstract:
System and methods for characterizing interest points within a descriptor are disclosed herein. The systems include generating a set of interest points related to an audio sample. A set of gradients relating to respective interest points in the set of interest points can be generated. A set of descriptors can then be generated based upon the set of interest points and the set of gradients and used in comparison to reference descriptors to identify the audio sample. The disclosed systems and methods provide for an audio matching system robust to pitch-shift distortion by using gradients that characterize the time-frequency neighborhood around an interest point rather than solely relying on interest points themselves. Thus, the disclosed system and methods result in more accurate audio identification.
Abstract:
A technique for inverted client side fingerprinting and matching provides the benefits of disposable fingerprinting to identify multiple content streams from multiple clients without overloading a fingerprinting system. Rather than tasking a fingerprinting system with the generation and comparison of all fingerprints, the technique distributes some fingerprinting tasks to the clients receiving the content streams. As a result, the fingerprinting system is not bottlenecked by fingerprinting tasks. In one embodiment, the fingerprinting system can provide additional services to the clients.
Abstract:
Systems and methods are provided herein relating to audio matching. Both melody fingerprints and audio-id fingerprints can be used to improve an audio matching system's resistance to pitch shifts. A melody fingerprint can be used to identify a set of potential melody matches. Varying pitch shifted audio-id reference fingerprints can be generated for audio-id fingerprints associated with the potential matches identified in melody matching. Additional pitch shifted audio-id fingerprints of a reference sample are generated and used in matching only if an audio sample has previously been matched to a melody fingerprint of the same reference sample. A reference index need not be expanded to include pitch shifted variations of each reference sample as pitch shifted variations of audio-id fingerprint reference samples are generated and used only if their associated melody fingerprint is deemed a potential match.
Abstract:
Systems and methods described herein can assign a confidence score to a match of unstructured descriptive information with structured reference information in a reference database. The systems and methods can take into account the structured nature of the reference information in assigning the score, thereby facilitating increased confidence in the match, and consequently, facilitating improved database organization and content identification.
Abstract:
A matching system receives probe audio samples for comparison to references of a data store. Comparisons are generated to determine a sufficient match for a portion or a first amount of the probe sample. Ranking scores are assigned to the resulting match references. The match references are retained, unless meeting a score threshold. Comparisons are continually generated with second amounts of the probe sample and the retained references are updated with further matching references assigned ranking scores. The retained results are merged and determined to satisfy a score threshold for release as outputted results for matching references.
Abstract:
This disclosure relates to systems and methods for proactively determining identification information for a plurality of audio segments within a plurality of broadcast media streams, and providing identification information associated with specific audio portions of a broadcast media stream automatically or upon request.