专利检索 ap:("Ajay Divakaran" OR "Ziyou Xiong" OR "Regunathan Radhakrishnan") AND inv:"Regunathan Radhakrishnan" 第 4 页

31.

发明申请
Scalable Media Fingerprint Extraction 有权
标题翻译：可扩展媒体指纹提取

公开(公告)号：US20110268315A1

公开(公告)日：2011-11-03

申请号：US13142355

申请日：2010-01-07

申请人： Claus Bauer , Regunathan Radhakrishnan , Wenyu Jiang , Glenn N. Dickins

发明人： Claus Bauer , Regunathan Radhakrishnan , Wenyu Jiang , Glenn N. Dickins

IPC分类号： G06K9/00

CPC分类号： G06K9/00744 , G06K9/46 , G06K9/623

摘要： Derivation of a fingerprint includes generating feature matrices based on one or more training images, generating projection matrices based on the feature matrices in a training process, and deriving a fingerprint for one or more images by, at least in part, projecting a feature matrix based on the one or more images onto the projection matrices generated in the training process.

摘要翻译： 指纹的推导包括基于一个或多个训练图像生成特征矩阵，基于训练过程中的特征矩阵生成投影矩阵，以及通过至少部分地基于特征矩阵投影来导出一个或多个图像的指纹，在一个或多个图像上，在训练过程中产生的投影矩阵上。

32.

发明授权
Alignment and re-association of metadata for media streams within a computing device 有权
标题翻译：计算设备内媒体流元数据的对齐和重新关联

公开(公告)号：US09075806B2

公开(公告)日：2015-07-07

申请号：US13402718

申请日：2012-02-22

申请人： Wenyu Jiang , Regunathan Radhakrishnan , Claus Bauer

发明人： Wenyu Jiang , Regunathan Radhakrishnan , Claus Bauer

IPC分类号： G06F17/00 , G06F17/30 , G06F3/16 , H03G3/30

CPC分类号： G06F17/30743 , G06F3/16 , G06F17/30017 , G10L19/167 , H03G3/3005

摘要： Techniques for re-associating dynamic metadata with media data are provided. A media processing system creates, with a first media processing stage, binding information comprising dynamic metadata and a time relationship between the dynamic metadata and media data. The binding information may be derived from the media data. While the first media processing stage delivers the media data to a second media processing stage in a first data path, the first media processing stage passes the binding information to the second media processing stage in a second data path. The media processing system re-associates, with the second media processing stage, the dynamic metadata and the media data using the binding information.

摘要翻译： 提供了将动态元数据与媒体数据重新关联的技术。媒体处理系统在第一媒体处理阶段创建包含动态元数据和动态元数据与媒体数据之间的时间关系的绑定信息。可以从媒体数据导出绑定信息。当第一媒体处理阶段将媒体数据传送到第一数据路径中的第二媒体处理阶段时，第一媒体处理阶段将绑定信息传递到第二数据路径中的第二媒体处理阶段。媒体处理系统使用绑定信息在第二媒体处理阶段重新关联动态元数据和媒体数据。

33.

发明申请
Ranking Representative Segments in Media Data 有权
标题翻译：媒体数据中的代表片段排名

公开(公告)号：US20130289756A1

公开(公告)日：2013-10-31

申请号：US13997866

申请日：2011-12-15

申请人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

发明人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

IPC分类号： G06F17/00

CPC分类号： H04R29/00 , G06F17/00 , G06F17/3053 , G10H1/0008 , G10H2210/061 , G10H2240/151 , G10L25/48

摘要： Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

摘要翻译： 提供了在媒体数据中排列代表片段的技术。可以从媒体数据中提取许多不同类型的媒体特征。可以将多个排名得分分配给多个候选代表段。基于从媒体数据可提取的一种或多种类型的特征，多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

34.

发明授权
Extracting features of video and audio signal content to provide reliable identification of the signals 有权
标题翻译：提取视频和音频信号内容的特征，提供信号的可靠识别

公开(公告)号：US08259806B2

公开(公告)日：2012-09-04

申请号：US12312840

申请日：2007-11-29

申请人： Regunathan Radhakrishnan , Claus Bauer , Kent Bennett Terry , Brian David Link , Hyung-Suk Kim , Eric Gsell

发明人： Regunathan Radhakrishnan , Claus Bauer , Kent Bennett Terry , Brian David Link , Hyung-Suk Kim , Eric Gsell

IPC分类号： H04N11/02

CPC分类号： G06T1/005 , G06F17/30743 , G06F17/30787 , G06F17/30799 , G06K9/00744 , G06K9/00758 , G06T1/0028 , G10L25/18 , G10L25/54 , G11B2020/10537

摘要： Signatures that can be used to identify video and audio content are generated from the content by generating measures of dissimilarity between features of corresponding groups of pixels in frames of video content and by generating low-resolution time-frequency representations of audio segments. The signatures are generated by applying a hash function to intermediate values derived from the measures of dissimilarity and to the low-resolution time-frequency representations. The generated signatures may be used in a variety of applications such as restoring synchronization between video and audio content streams and identifying copies of original video and audio content. The generated signatures can provide reliable identifications despite intentional and unintentional modifications to the content.

摘要翻译： 可以用于识别视频和音频内容的签名通过在视频内容的帧中产生相应的像素组的特征之间的不相似度量度和通过生成音频段的低分辨率时间频率表示来从内容产生。通过将散列函数应用于从不相似性的度量导出的中间值和低分辨率时间频率表示来生成签名。生成的签名可以用于各种应用中，例如恢复视频和音频内容流之间的同步，并识别原始视频和音频内容的副本。生成的签名可以提供可靠的标识，尽管有意和无意的修改内容。

35.

发明申请
Automatic Generation of Metadata for Audio Dominance Effects 有权
标题翻译：自动生成音频优势效应的元数据

公开(公告)号：US20120201386A1

公开(公告)日：2012-08-09

申请号：US13501086

申请日：2010-10-05

申请人： Jeffrey C. Riedmiller , Regunathan Radhakrishnan , Hannes Muesch

发明人： Jeffrey C. Riedmiller , Regunathan Radhakrishnan , Hannes Muesch

IPC分类号： H04R29/00 , G10L15/00 , H04H20/88

CPC分类号： G11B27/11 , G10L19/008 , G10L19/167 , G11B27/031 , G11B27/105 , G11B27/28 , G11B27/322

摘要： Metadata comprising a set of gain values for creating a dominance effect is automatically generated. Automatically generating the metadata includes receiving multiple audio streams and a dominance criterion for at least one of the audio streams. A set of gains is computed for one or more audio streams based on the dominance criterion for the at least one audio stream and metadata is generated with the set of gains.

摘要翻译： 包括用于产生优势效果的一组增益值的元数据被自动生成。自动生成元数据包括为至少一个音频流接收多个音频流和优势准则。基于用于至少一个音频流的优势准则为一个或多个音频流计算一组增益，并且利用该组增益生成元数据。

36.

发明申请
Robust Media Fingerprints 有权
标题翻译：强大的媒体指纹

公开(公告)号：US20110153050A1

公开(公告)日：2011-06-23

申请号：US13060032

申请日：2009-08-26

申请人： Claus Bauer , Regunathan Radhakrishnan

发明人： Claus Bauer , Regunathan Radhakrishnan

IPC分类号： G06F17/00

CPC分类号： G10L19/018

摘要： Robust media fingerprints are derived from a portion of audio content. A portion of content in an audio signal is categorized. The audio content is characterized based, at least in part, on one or more of its features. The features may include a component that relates to one of several sound categories, e.g., speech and/or noise, which may be mixed with the audio signal. Upon categorizing the audio content as free of the speech or noise related components, the audio signal component is processed. Upon categorizing the audio content as including the speech related component and/or the noise related components, the speech or noise related components are separated from the audio signal. The audio signal is processed independent of the speech related component and/or the noise related component. Processing the audio signal includes computing the audio fingerprint, which ably corresponds to the audio signal.

摘要翻译： 强大的媒体指纹是从音频内容的一部分导出的。对音频信号中的内容的一部分进行分类。音频内容的特征在于，至少部分地基于其一个或多个特征。特征可以包括与几个声音类别中的一个相关联的组件，例如可以与音频信号混合的语音和/或噪声。在将音频内容分类为没有语音或噪声相关组件的情况下，处理音频信号分量。在将音频内容分类为包括语音相关分量和/或噪声相关分量时，语音或噪声相关分量与音频信号分离。音频信号被独立于语音相关分量和/或噪声相关分量进行处理。处理音频信号包括计算音频指纹，其与音频信号相当。

37.

发明申请
Deriving Video Signatures That Are Insensitive to Picture Modification and Frame-Rate Conversion 失效
标题翻译：导出对图像修改和帧速率转换不敏感的视频签名

公开(公告)号：US20100238350A1

公开(公告)日：2010-09-23

申请号：US12600466

申请日：2008-05-01

申请人： Regunathan Radhakrishnan , Claus Bauer

发明人： Regunathan Radhakrishnan , Claus Bauer

IPC分类号： H04N5/14

CPC分类号： G11B27/28 , G06F17/30799 , G06K9/00744 , G06T1/0028 , G06T1/005 , G06T2201/0051 , G06T2201/0061 , H04N19/467

摘要： A signature that can be used to identify video content in a series of video frames is generated by first calculating the average and variance of picture elements in a low-resolution composite image that represents a temporal and spatial composite of the video content in the series of frames. The signature is generated by applying a hash function to values derived from the average and variance composite representations. The video content of a signal can be represented by a set of signatures that are generated for multiple series of frames within the signal. A set of signatures can provide reliable identifications despite intentional and unintentional modifications to the content.

摘要翻译： 可以通过首先计算低分辨率合成图像中的图像元素的平均值和方差来生成可用于识别一系列视频帧中的视频内容的签名，该复合图像表示该系列视频内容中的视频内容的时间和空间复合框架。通过将散列函数应用于从平均和方差复合表示中导出的值来生成签名。信号的视频内容可以由为信号内的多个帧系列生成的一组签名来表示。尽管有意和无意地修改内容，一组签名可以提供可靠的标识。

38.

发明授权
Adaptive processing with multiple media processing nodes 有权

公开(公告)号：US09842596B2

公开(公告)日：2017-12-12

申请号：US13989256

申请日：2011-12-01

申请人： Jeffrey Riedmiller , Regunathan Radhakrishnan , Marvin Pribadi , Farhad Farahani , Michael Smithers

发明人： Jeffrey Riedmiller , Regunathan Radhakrishnan , Marvin Pribadi , Farhad Farahani , Michael Smithers

IPC分类号： G06F17/00 , G10L19/008 , G10L21/00

CPC分类号： G10L19/008 , G10L19/167 , G10L21/00

摘要： Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.

39.

发明授权
Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols 有权
标题翻译：音频编码方法和系统，用于通过实现不同解码协议的解码器生成统一的比特流解码

公开(公告)号：US09378743B2

公开(公告)日：2016-06-28

申请号：US14009503

申请日：2012-04-05

申请人： Jeffrey C. Riedmiller , Farhad Farahani , Michael Schug , Regunathan Radhakrishnan , Mark S. Vinton

发明人： Jeffrey C. Riedmiller , Farhad Farahani , Michael Schug , Regunathan Radhakrishnan , Mark S. Vinton

IPC分类号： G10L19/008 , G10L19/002 , G10L19/16

CPC分类号： G10L19/002 , G10L19/167

摘要： In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

摘要翻译： 在一类实施例中，音频编码系统（通常是感知编码系统，其被配置为生成与第一解码器兼容的（即可解码的）单个（“统一”）比特流，第一解码器被配置为对根据第一编码协议（例如，多频道杜比数字+或DD +协议）和被配置为对根据第二编码协议（例如立体声AAC，HE AAC v1或HE）编码的音频数据进行解码的第二解码器统一比特流可以包括可由第一解码器解码（并由第二解码器忽略）的可编码数据（例如，数据突发）和由第二解码器解码的编码数据（例如，其他数据突发）并且被第一解码器忽略），实际上，当第一解码器对比特流进行解码时，第二编码格式被隐藏在统一比特流内，并且当比特流中第一编码格式被隐藏在统一比特流内时令牌由第二解码器解码。根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。本发明的其他方面是由本发明编码器的任何实施例执行的编码方法，由本发明解码器的任何实施例执行的解码方法，以及存储用于实现本发明的任何实施例的代码的计算机可读介质（例如，盘）方法。

40.

发明授权
Ranking representative segments in media data 有权
标题翻译：在媒体数据中排列代表性细分

公开(公告)号：US09313593B2

公开(公告)日：2016-04-12

申请号：US13997866

申请日：2011-12-15

申请人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

发明人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

IPC分类号： H04R29/00 , G10H1/00 , G06F17/30 , G06F17/00 , G10L25/48

CPC分类号： H04R29/00 , G06F17/00 , G06F17/3053 , G10H1/0008 , G10H2210/061 , G10H2240/151 , G10L25/48

摘要： Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

摘要翻译： 提供了在媒体数据中排列代表片段的技术。可以从媒体数据中提取许多不同类型的媒体特征。可以将多个排名得分分配给多个候选代表段。基于从媒体数据可提取的一种或多种类型的特征，多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类