专利检索 ap:("Matthew Sharifi" OR "George Tzanetakis" OR "Annie Chen" OR "Dominik Roblek") AND inv:"Dominik Roblek" 第 1 页

1.

发明授权
Frequency ratio fingerprint characterization for audio matching 有权
标题翻译：频率比指纹表征音频匹配

公开(公告)号：US08886543B1

公开(公告)日：2014-11-11

申请号：US13296899

申请日：2011-11-15

申请人： Matthew Sharifi , George Tzanetakis , Annie Chen , Dominik Roblek

发明人： Matthew Sharifi , George Tzanetakis , Annie Chen , Dominik Roblek

IPC分类号： G10L11/00

CPC分类号： G10L19/018

摘要： System and methods for characterizing interest points within a fingerprint are disclosed herein. The systems include generating a set of interest points and an anchor point related to an audio sample. A quantized absolute frequency of an anchor point can be calculated and used to calculate a set of quantized ratios. A fingerprint can then be generated based upon the set of quantized ratios and used in comparison to reference fingerprints to identify the audio sample. The disclosed systems and methods provide for an audio matching system robust to pitch-shift distortion by using quantized ratios within fingerprints rather than solely using absolute frequencies of interest points. Thus, the disclosed system and methods result in more accurate audio identification.

摘要翻译： 本文公开了用于表征指纹内的兴趣点的系统和方法。系统包括产生一组感兴趣点和与音频样本相关的定位点。锚定点的量化绝对频率可以被计算并用于计算一组量化比率。然后可以基于所述量化比率的集合生成指纹，并且与参考指纹进行比较以用于识别音频样本。所公开的系统和方法通过使用指纹内的量化比率而不是仅使用感兴趣点的绝对频率来提供对音调偏移失真鲁棒的音频匹配系统。因此，所公开的系统和方法导致更准确的音频识别。

2.

发明授权
Noise based interest point density pruning 有权
标题翻译：基于噪声的兴趣点密度修剪

公开(公告)号：US08805560B1

公开(公告)日：2014-08-12

申请号：US13276318

申请日：2011-10-18

申请人： George Tzanetakis , Dominik Roblek , Matthew Sharifi

发明人： George Tzanetakis , Dominik Roblek , Matthew Sharifi

IPC分类号： G10L11/00 , G10L21/02

CPC分类号： G06F17/30743 , G06F17/30758 , G10L25/54

摘要： Systems and methods for noise based interest point density pruning are disclosed herein. The systems include determining an amount of noise in an audio sample and adjusting the amount of interest points within an audio sample fingerprint based on the amount of noise. Samples containing high amounts of noise correspondingly generate fingerprints with more interest points. The disclosed systems and methods allow reference fingerprints to be reduced in size while increasing the size of sample fingerprints. The benefits in scalability do not compromise the accuracy of an audio matching system using noise based interest point density pruning.

摘要翻译： 本文公开了基于噪声的兴趣点密度修剪的系统和方法。系统包括确定音频样本中的噪声量，并且基于噪声量来调整音频样本指纹内的兴趣点的数量。含有大量噪声的样本相应地产生了具有更多兴趣点的指纹。所公开的系统和方法允许参考指纹的尺寸减小，同时增加样本指纹的大小。可扩展性的优点不会影响使用基于噪声的兴趣点密度修剪的音频匹配系统的准确性。

3.

发明授权
Magnitude ratio descriptors for pitch-resistant audio matching 有权
标题翻译：用于音高匹配的音高比例描述符

公开(公告)号：US09202472B1

公开(公告)日：2015-12-01

申请号：US13434832

申请日：2012-03-29

申请人： Matthew Sharifi , Dominik Roblek , George Tzanetakis

发明人： Matthew Sharifi , Dominik Roblek , George Tzanetakis

IPC分类号： G06F17/30 , G10L19/018

CPC分类号： G10L19/018 , G10L25/03

摘要： Systems and methods for generating unique pitch-resistant descriptors for audio clips are provided. In one or more embodiments, a descriptor for an audio clip is generated as a function of relative magnitudes between interest points within the audio clip's time-frequency representation. A number of techniques for leveraging the relative magnitudes to generate descriptors are considered. These techniques include ordering of interest points as a function of ascending or descending magnitude, creation of binary vectors based on magnitude comparisons between pairs of points, and calculation of quantized magnitude ratios between pairs of points. Descriptors generated based on relative magnitudes according to the techniques disclosed herein are relatively invariant to common transformations to the original audio clip, such as pitch shifting, time stretching, global volume changes, equalization, and/or dynamic range compression.

摘要翻译： 提供了用于为音频剪辑生成独特的音高描述符的系统和方法。在一个或多个实施例中，音频剪辑的描述符作为音频剪辑的时间 - 频率表示内的兴趣点之间的相对幅度的函数被生成。考虑了利用相对幅度来生成描述符的许多技术。这些技术包括将兴趣点排序为上升或下降幅度的函数，基于点对之间的幅度比较的二进制向量的创建以及点对之间的量化幅度比的计算。基于根据本文公开的技术的相对幅度生成的描述符对于原始音频剪辑的常见变换（例如音调偏移，时间延伸，全局音量变化，均衡和/或动态范围压缩）是相对不变的。

4.

发明授权
Transformation invariant media matching 有权
标题翻译：转换不变媒体匹配

公开(公告)号：US08738633B1

公开(公告)日：2014-05-27

申请号：US13362905

申请日：2012-01-31

申请人： Matthew Sharifi , Sergey Ioffe , Jay Yagnik , Gheorghe Postelnicu , Dominik Roblek , George Tzanetakis

发明人： Matthew Sharifi , Sergey Ioffe , Jay Yagnik , Gheorghe Postelnicu , Dominik Roblek , George Tzanetakis

IPC分类号： G06F17/30

CPC分类号： G06K9/6267 , G06F17/3002 , G06F17/30244 , G06K9/00013

摘要： This disclosure relates to transformation invariant media matching. A fingerprinting component can generate a transformation invariant identifier for media content by adaptively encoding the relative ordering of interest points in media content. The interest points can be grouped into subsets, and stretch invariant descriptors can be generated for the subsets based on ratios of coordinates of interest points included in the subsets. The stretch invariant descriptors can be aggregated into a transformation invariant identifier. An identification component compares the identifier against a set of identifiers for known media content, and the media content can be matched or identified as a function of the comparison.

摘要翻译： 本公开涉及变换不变媒体匹配。指纹分量可以通过对媒体内容中的兴趣点的相对排序进行自适应编码来生成媒体内容的变换不变标识符。可以将兴趣点分组为子集，并且可以基于子集中包括的兴趣点坐标的比例为子集生成拉伸不变描述符。拉伸不变描述符可以聚合成变换不变标识符。识别部件将标识符与已知媒体内容的一组标识符进行比较，并且媒体内容可以作为比较的函数进行匹配或标识。

5.

发明授权
Ensemble interest point detection for audio matching 有权
标题翻译：音乐匹配的集合兴趣点检测

公开(公告)号：US09098576B1

公开(公告)日：2015-08-04

申请号：US13274725

申请日：2011-10-17

申请人： Matthew Sharifi , Gheorghe Postelnicu , George Tzanetakis , Dominik Roblek

发明人： Matthew Sharifi , Gheorghe Postelnicu , George Tzanetakis , Dominik Roblek

IPC分类号： G06F17/00 , G06F17/30

CPC分类号： G06F17/30743 , G06F17/3074

摘要： Systems and methods for audio matching are disclosed herein. In one embodiment, a system includes both interest point mixing and fingerprint mixing by using multiple interest point detection methods in parallel. Since multiple interest point detection methods are used in parallel, accuracy of audio matching is improved across a wide variety of audio signals. In addition the scalability of the disclosed audio matching system is increased by matching the fingerprint of an audio sample with a fingerprint of a reference sample versus matching an entire spectrogram. Accordingly, a more accurate and more general solution to audio matching can be accomplished.

摘要翻译： 本文公开了用于音频匹配的系统和方法。在一个实施例中，系统通过并行使用多个兴趣点检测方法来包括兴趣点混合和指纹混合。由于并行地使用多个兴趣点检测方法，因此在多种音频信号中提高了音频匹配的精度。此外，通过将音频样本的指纹与参考样本的指纹匹配以匹配整个频谱图来增加所公开的音频匹配系统的可扩展性。因此，可以实现更准确和更一般的音频匹配解决方案。

6.

发明授权
Intelligent interest point pruning for audio matching 有权
标题翻译：智能兴趣点修剪音频匹配

公开(公告)号：US08831763B1

公开(公告)日：2014-09-09

申请号：US13276316

申请日：2011-10-18

申请人： Matthew Sharifi , Gheorghe Postelnicu , George Tzanetakis , Dominik Roblek

发明人： Matthew Sharifi , Gheorghe Postelnicu , George Tzanetakis , Dominik Roblek

IPC分类号： G10L15/20 , G10L15/02

CPC分类号： G10L25/54 , G10L25/87

摘要： System and methods for intelligently pruning interest points are disclosed herein. The systems include generating a plurality of distorted audio samples and associated distorted interest points based upon a clean audio sample. Interest points that are common to sets of distorted interest points are retained with interest points not robust to distortion discarded. The disclosed systems and methods therefore can provide for a scalable audio matching solution by eliminating interest points in reference sample fingerprints. The set of pruned interest points are robust to distortion and the benefits of both scalability and accuracy can be had.

摘要翻译： 本文公开了用于智能修剪兴趣点的系统和方法。系统包括基于干净的音频样本产生多个失真的音频样本和相关联的失真的兴趣点。对于一组扭曲的兴趣点常见的兴趣点被保留，对于丢弃的失真不利于兴趣点。因此，所公开的系统和方法可以通过消除参考样本指纹中的兴趣点来提供可扩展的音频匹配解决方案。修剪的兴趣点的集合对于失真是稳健的，并且可以实现可扩展性和准确性的优点。

7.

发明授权
Inverted client-side fingerprinting and matching 有权
标题翻译：反向客户端指纹和匹配

公开(公告)号：US09113202B1

公开(公告)日：2015-08-18

申请号：US13239138

申请日：2011-09-21

申请人： Matthew Wiseman , Matthew Sharifi , Yaniv Bernstein , Annie Chen , Dominik Roblek

发明人： Matthew Wiseman , Matthew Sharifi , Yaniv Bernstein , Annie Chen , Dominik Roblek

IPC分类号： H04N21/439 , H04L7/08 , H04N21/84

CPC分类号： H04N21/4394 , G06F17/30743 , G06K9/00744 , G06K9/00758 , H04L7/08 , H04N21/8358 , H04N21/84

摘要： A technique for inverted client side fingerprinting and matching provides the benefits of disposable fingerprinting to identify multiple content streams from multiple clients without overloading a fingerprinting system. Rather than tasking a fingerprinting system with the generation and comparison of all fingerprints, the technique distributes some fingerprinting tasks to the clients receiving the content streams. As a result, the fingerprinting system is not bottlenecked by fingerprinting tasks. In one embodiment, the fingerprinting system can provide additional services to the clients.

摘要翻译： 用于反向客户端指纹和匹配的技术提供了一次性指纹识别的优点，以便从多个客户端识别多个内容流而不会使指纹系统过载。该指纹技术将指纹识别任务分配给接收内容流的客户端，而不是通过生成和比较所有指纹来对指纹系统进行任务。因此，指纹识别系统不会由于指纹识别任务的瓶颈。在一个实施例中，指纹系统可以向客户端提供附加服务。

8.

发明授权
Real-time audio recognition protocol 有权
标题翻译：实时音频识别协议

公开(公告)号：US08805683B1

公开(公告)日：2014-08-12

申请号：US13404978

申请日：2012-02-24

申请人： Matthew Wiseman , Yaniv Bernstein , Daniel Switkin , Gheorghe M. Postelnicu , Matthew Sharifi , Annie Chen , Dominik Roblek

发明人： Matthew Wiseman , Yaniv Bernstein , Daniel Switkin , Gheorghe M. Postelnicu , Matthew Sharifi , Annie Chen , Dominik Roblek

IPC分类号： G10L15/00

CPC分类号： G10L15/22 , G06F17/30758 , G10L15/30 , G10L25/54

摘要： An audio recognition service recognizes an audio sample across multiple content types. At least a partial set of results generated by the service are returned to a client while the audio sample is still being recorded and/or transmitted. The client additionally displays the results in real-time or near real-time to the user. The audio sample can be sent over a first HTTP connection and the results can be returned over a second HTTP connection. The audio recognition service further processes check-in selections received from the client for content items indicated by the results. Responsive to receiving the check-in selections, the service determines whether a user is eligible for a reward. If the user is eligible, the service provides the reward.

摘要翻译： 音频识别服务识别多种内容类型的音频样本。当音频样本仍然被记录和/或发送时，由服务产生的至少一部分结果返回给客户机。客户端另外向用户实时或接近实时显示结果。音频样本可以通过第一个HTTP连接发送，并且可以通过第二个HTTP连接返回结果。音频识别服务进一步处理从客户端接收的用于由结果指示的内容项的登记选择。响应于接收签入选择，该服务确定用户是否有资格获得奖励。如果用户符合条件，则该服务将提供奖励。

9.

发明申请
DYNAMIC DISPLAY OF CONTENT CONSUMPTION BY GEOGRAPHIC LOCATION 有权
标题翻译：通过地理位置动态显示内容消费

公开(公告)号：US20130235027A1

公开(公告)日：2013-09-12

申请号：US13417598

申请日：2012-03-12

申请人： Matthew Sharifi , Annie Chen , Dominik Roblek

发明人： Matthew Sharifi , Annie Chen , Dominik Roblek

IPC分类号： G06F17/00

CPC分类号： G06F17/30241 , G06F17/3053 , G06F17/30817 , G06F17/30867 , G06F17/3087 , G06Q10/0637 , G09B29/006 , G09B29/007

摘要： This disclosure relates to dynamic display of content consumption by geographic location. A recognition component recognizes content being consumed by a set of users, and identifies geographic locations of the consumption and a set of characteristics associated with the consumption. An aggregation component ranks the consumed content based on a subset of the characteristics associated with the consumption, and a display component generates a map displaying subsets of the consumed content as a function of respective rankings and geographic location.

摘要翻译： 本公开涉及通过地理位置动态显示内容消费。识别组件识别由一组用户消费的内容，并且识别消费的地理位置和与消费相关联的一组特征。聚合组件基于与消费相关联的特征的子集来排列消耗的内容，并且显示组件生成显示作为相应排名和地理位置的函数的消费内容的子集的映射。

10.

发明授权
Real-time audio recognition using multiple recognizers 有权
标题翻译：使用多个识别器实时音频识别

公开(公告)号：US09384734B1

公开(公告)日：2016-07-05

申请号：US13404971

申请日：2012-02-24

申请人： Matthew Wiseman , Gheorghe M. Postelnicu , Dominik Roblek , Yaniv Bernstein , Matthew Sharifi , Annie Chen

发明人： Matthew Wiseman , Gheorghe M. Postelnicu , Dominik Roblek , Yaniv Bernstein , Matthew Sharifi , Annie Chen

IPC分类号： G10L15/26

CPC分类号： G10L15/26 , G06F17/3074 , G06F17/30743 , G06F17/30766 , G06F17/30769 , G10H1/0008 , G10H1/0033 , G10H2240/141 , G10L15/00 , G10L15/265 , G10L17/00 , G10L25/51

摘要： An audio recognition service recognizes an audio sample across multiple content types. At least a partial set of results generated by the service are returned to a client while the audio sample is still being recorded and/or transmitted. The client additionally displays the results in real-time or near real-time to the user. The audio sample can be sent over a first HTTP connection and the results can be returned over a second HTTP connection. The audio recognition service further processes check-in selections received from the client for content items indicated by the results. Responsive to receiving the check-in selections, the service determines whether a user is eligible for a reward. If the user is eligible, the service provides the reward.

摘要翻译： 音频识别服务识别多种内容类型的音频样本。当音频样本仍然被记录和/或发送时，由服务产生的至少一部分结果返回给客户机。客户端另外向用户实时或接近实时显示结果。音频样本可以通过第一个HTTP连接发送，并且可以通过第二个HTTP连接返回结果。音频识别服务进一步处理从客户端接收的用于由结果指示的内容项的登记选择。响应于接收签入选择，该服务确定用户是否有资格获得奖励。如果用户符合条件，则该服务将提供奖励。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类