专利检索 ap:("Rongshan Yu" OR "Regunathan Radhakrishnan" OR "Robert Andersen" OR "Grant Davidson") AND inv:"Regunathan Radhakrishnan" 第 3 页

21.

发明授权
Alignment and re-association of metadata for media streams within a computing device 有权
标题翻译：计算设备内媒体流元数据的对齐和重新关联

公开(公告)号：US09075806B2

公开(公告)日：2015-07-07

申请号：US13402718

申请日：2012-02-22

申请人： Wenyu Jiang , Regunathan Radhakrishnan , Claus Bauer

发明人： Wenyu Jiang , Regunathan Radhakrishnan , Claus Bauer

IPC分类号： G06F17/00 , G06F17/30 , G06F3/16 , H03G3/30

CPC分类号： G06F17/30743 , G06F3/16 , G06F17/30017 , G10L19/167 , H03G3/3005

摘要： Techniques for re-associating dynamic metadata with media data are provided. A media processing system creates, with a first media processing stage, binding information comprising dynamic metadata and a time relationship between the dynamic metadata and media data. The binding information may be derived from the media data. While the first media processing stage delivers the media data to a second media processing stage in a first data path, the first media processing stage passes the binding information to the second media processing stage in a second data path. The media processing system re-associates, with the second media processing stage, the dynamic metadata and the media data using the binding information.

摘要翻译： 提供了将动态元数据与媒体数据重新关联的技术。媒体处理系统在第一媒体处理阶段创建包含动态元数据和动态元数据与媒体数据之间的时间关系的绑定信息。可以从媒体数据导出绑定信息。当第一媒体处理阶段将媒体数据传送到第一数据路径中的第二媒体处理阶段时，第一媒体处理阶段将绑定信息传递到第二数据路径中的第二媒体处理阶段。媒体处理系统使用绑定信息在第二媒体处理阶段重新关联动态元数据和媒体数据。

22.

发明申请
Ranking Representative Segments in Media Data 有权
标题翻译：媒体数据中的代表片段排名

公开(公告)号：US20130289756A1

公开(公告)日：2013-10-31

申请号：US13997866

申请日：2011-12-15

申请人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

发明人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

IPC分类号： G06F17/00

CPC分类号： H04R29/00 , G06F17/00 , G06F17/3053 , G10H1/0008 , G10H2210/061 , G10H2240/151 , G10L25/48

摘要： Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

摘要翻译： 提供了在媒体数据中排列代表片段的技术。可以从媒体数据中提取许多不同类型的媒体特征。可以将多个排名得分分配给多个候选代表段。基于从媒体数据可提取的一种或多种类型的特征，多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

23.

发明授权
Extracting features of video and audio signal content to provide reliable identification of the signals 有权
标题翻译：提取视频和音频信号内容的特征，提供信号的可靠识别

公开(公告)号：US08259806B2

公开(公告)日：2012-09-04

申请号：US12312840

申请日：2007-11-29

申请人： Regunathan Radhakrishnan , Claus Bauer , Kent Bennett Terry , Brian David Link , Hyung-Suk Kim , Eric Gsell

发明人： Regunathan Radhakrishnan , Claus Bauer , Kent Bennett Terry , Brian David Link , Hyung-Suk Kim , Eric Gsell

IPC分类号： H04N11/02

CPC分类号： G06T1/005 , G06F17/30743 , G06F17/30787 , G06F17/30799 , G06K9/00744 , G06K9/00758 , G06T1/0028 , G10L25/18 , G10L25/54 , G11B2020/10537

摘要： Signatures that can be used to identify video and audio content are generated from the content by generating measures of dissimilarity between features of corresponding groups of pixels in frames of video content and by generating low-resolution time-frequency representations of audio segments. The signatures are generated by applying a hash function to intermediate values derived from the measures of dissimilarity and to the low-resolution time-frequency representations. The generated signatures may be used in a variety of applications such as restoring synchronization between video and audio content streams and identifying copies of original video and audio content. The generated signatures can provide reliable identifications despite intentional and unintentional modifications to the content.

摘要翻译： 可以用于识别视频和音频内容的签名通过在视频内容的帧中产生相应的像素组的特征之间的不相似度量度和通过生成音频段的低分辨率时间频率表示来从内容产生。通过将散列函数应用于从不相似性的度量导出的中间值和低分辨率时间频率表示来生成签名。生成的签名可以用于各种应用中，例如恢复视频和音频内容流之间的同步，并识别原始视频和音频内容的副本。生成的签名可以提供可靠的标识，尽管有意和无意的修改内容。

24.

发明申请
Automatic Generation of Metadata for Audio Dominance Effects 有权
标题翻译：自动生成音频优势效应的元数据

公开(公告)号：US20120201386A1

公开(公告)日：2012-08-09

申请号：US13501086

申请日：2010-10-05

申请人： Jeffrey C. Riedmiller , Regunathan Radhakrishnan , Hannes Muesch

发明人： Jeffrey C. Riedmiller , Regunathan Radhakrishnan , Hannes Muesch

IPC分类号： H04R29/00 , G10L15/00 , H04H20/88

CPC分类号： G11B27/11 , G10L19/008 , G10L19/167 , G11B27/031 , G11B27/105 , G11B27/28 , G11B27/322

摘要： Metadata comprising a set of gain values for creating a dominance effect is automatically generated. Automatically generating the metadata includes receiving multiple audio streams and a dominance criterion for at least one of the audio streams. A set of gains is computed for one or more audio streams based on the dominance criterion for the at least one audio stream and metadata is generated with the set of gains.

摘要翻译： 包括用于产生优势效果的一组增益值的元数据被自动生成。自动生成元数据包括为至少一个音频流接收多个音频流和优势准则。基于用于至少一个音频流的优势准则为一个或多个音频流计算一组增益，并且利用该组增益生成元数据。

25.

发明申请
Robust Media Fingerprints 有权
标题翻译：强大的媒体指纹

公开(公告)号：US20110153050A1

公开(公告)日：2011-06-23

申请号：US13060032

申请日：2009-08-26

申请人： Claus Bauer , Regunathan Radhakrishnan

发明人： Claus Bauer , Regunathan Radhakrishnan

IPC分类号： G06F17/00

CPC分类号： G10L19/018

摘要： Robust media fingerprints are derived from a portion of audio content. A portion of content in an audio signal is categorized. The audio content is characterized based, at least in part, on one or more of its features. The features may include a component that relates to one of several sound categories, e.g., speech and/or noise, which may be mixed with the audio signal. Upon categorizing the audio content as free of the speech or noise related components, the audio signal component is processed. Upon categorizing the audio content as including the speech related component and/or the noise related components, the speech or noise related components are separated from the audio signal. The audio signal is processed independent of the speech related component and/or the noise related component. Processing the audio signal includes computing the audio fingerprint, which ably corresponds to the audio signal.

摘要翻译： 强大的媒体指纹是从音频内容的一部分导出的。对音频信号中的内容的一部分进行分类。音频内容的特征在于，至少部分地基于其一个或多个特征。特征可以包括与几个声音类别中的一个相关联的组件，例如可以与音频信号混合的语音和/或噪声。在将音频内容分类为没有语音或噪声相关组件的情况下，处理音频信号分量。在将音频内容分类为包括语音相关分量和/或噪声相关分量时，语音或噪声相关分量与音频信号分离。音频信号被独立于语音相关分量和/或噪声相关分量进行处理。处理音频信号包括计算音频指纹，其与音频信号相当。

26.

发明申请
Deriving Video Signatures That Are Insensitive to Picture Modification and Frame-Rate Conversion 失效
标题翻译：导出对图像修改和帧速率转换不敏感的视频签名

公开(公告)号：US20100238350A1

公开(公告)日：2010-09-23

申请号：US12600466

申请日：2008-05-01

申请人： Regunathan Radhakrishnan , Claus Bauer

发明人： Regunathan Radhakrishnan , Claus Bauer

IPC分类号： H04N5/14

CPC分类号： G11B27/28 , G06F17/30799 , G06K9/00744 , G06T1/0028 , G06T1/005 , G06T2201/0051 , G06T2201/0061 , H04N19/467

摘要： A signature that can be used to identify video content in a series of video frames is generated by first calculating the average and variance of picture elements in a low-resolution composite image that represents a temporal and spatial composite of the video content in the series of frames. The signature is generated by applying a hash function to values derived from the average and variance composite representations. The video content of a signal can be represented by a set of signatures that are generated for multiple series of frames within the signal. A set of signatures can provide reliable identifications despite intentional and unintentional modifications to the content.

摘要翻译： 可以通过首先计算低分辨率合成图像中的图像元素的平均值和方差来生成可用于识别一系列视频帧中的视频内容的签名，该复合图像表示该系列视频内容中的视频内容的时间和空间复合框架。通过将散列函数应用于从平均和方差复合表示中导出的值来生成签名。信号的视频内容可以由为信号内的多个帧系列生成的一组签名来表示。尽管有意和无意地修改内容，一组签名可以提供可靠的标识。

27.

发明授权
Multimedia event detection and summarization 失效
标题翻译：多媒体事件检测与总结

公开(公告)号：US07409407B2

公开(公告)日：2008-08-05

申请号：US10840824

申请日：2004-05-07

申请人： Regunathan Radhakrishnan , Ajay Divakaran

发明人： Regunathan Radhakrishnan , Ajay Divakaran

IPC分类号： G06F17/30 , G06F17/00

CPC分类号： G06F17/30787 , G06F17/30802 , G06F17/30808 , G06F17/30811 , G06F17/30843 , G06K9/00711 , Y10S707/99943 , Y10S707/99945

摘要： A method detects events in multimedia. Features are extracted from the multimedia. The features are sampled using a sliding window to obtain samples. A context model is constructed for each sample. An affinity matrix is determined from the models and a commutative distance metric between each pair of context models. A second generation eigenvector is determined for the affinity matrix, and the samples are then clustered into events according to the second generation eigenvector.

摘要翻译： 一种方法来检测多媒体中的事件。功能从多媒体提取。使用滑动窗口对特征进行采样以获得样品。为每个样本构建上下文模型。从模型和每对上下文模型之间的交换距离度量确定亲和度矩阵。针对亲和度矩阵确定第二代特征向量，然后根据第二代特征向量将样本聚类成事件。

28.

发明申请
Video presentation using compositional structures 审中-公开
标题翻译：使用组合结构的视频演示

公开(公告)号：US20060075346A1

公开(公告)日：2006-04-06

申请号：US10951192

申请日：2004-09-27

申请人： Tom Lanning , Ajay Divakaran , Kadir Peker , Regunathan Radhakrishnan , Ziyou Xiong , Clifton Forlines

发明人： Tom Lanning , Ajay Divakaran , Kadir Peker , Regunathan Radhakrishnan , Ziyou Xiong , Clifton Forlines

IPC分类号： G11B27/00

CPC分类号： G11B19/025 , G11B27/105 , G11B27/107 , G11B27/11 , G11B27/28 , G11B27/34 , G11B2220/20 , G11B2220/65 , G11B2220/90 , H04N5/85 , H04N9/8042 , H04N9/8233 , H04N21/42646 , H04N21/4312 , H04N21/4314 , H04N21/4325 , H04N21/812 , H04N21/84 , H04N21/8456

摘要： A method presents a video according to compositional structures associated with the video. Each compositional structure has a label, and multiple segments that can be organized temporally or hierarchically. A particular compositional structure is selected with a remote controller, and the video is presented by a playback controller on a display device according to the compositional structure.

摘要翻译： 一种方法根据与视频相关联的组合结构呈现视频。每个组成结构都有一个标签，并且可以在时间上或分层上组织的多个片段。利用遥控器选择特定的组成结构，根据组成结构，视频由显示设备上的播放控制器呈现。

29.

发明申请
Audio-visual highlights detection using coupled hidden markov models 审中-公开
标题翻译：使用耦合的隐马尔可夫模型的视听亮点检测

公开(公告)号：US20050125223A1

公开(公告)日：2005-06-09

申请号：US10729164

申请日：2003-12-05

申请人： Ajay Divakaran , Ziyou Xiong , Regunathan Radhakrishnan

发明人： Ajay Divakaran , Ziyou Xiong , Regunathan Radhakrishnan

IPC分类号： G06F17/30 , G06K9/00 , G06K9/20 , G06K9/62 , G06T7/00 , G10L11/00 , G10L15/00 , G10L15/04 , G10L15/10 , G10L15/26 , G10L17/00 , G10L19/12 , H04N5/76

CPC分类号： G06K9/00711 , G06F16/739 , G06F16/7834 , G06F16/786 , G06K9/6297

摘要： A method uses probabilistic fusion to detect highlights in videos using both audio and visual information. Specifically, the method uses coupled hidden Markov models (CHMMs). Audio labels are generated using audio classification via Gaussian mixture models (GMMs), and visual labels are generated by quantizing average motion vector magnitudes. Highlights are modeled using discrete-observation CHMMs trained with labeled videos. The CHMMs have better performance than conventional hidden Markov models (HMMs) trained only on audio signals, or only on video frames.

摘要翻译： 一种方法使用概率融合来检测使用音频和视觉信息的视频中的高光。具体来说，该方法使用耦合的隐马尔可夫模型（CHMM）。使用高斯混合模型（GMM）的音频分类生成音频标签，并且通过量化平均运动矢量幅度来生成视觉标签。亮点是使用用标记视频训练的离散观察CHMM进行建模。 CHMM具有比仅在音频信号上训练的传统隐马尔可夫模型（HMM）更好的性能，或仅在视频帧上训练。

30.

发明授权
Adaptive processing with multiple media processing nodes 有权

公开(公告)号：US09842596B2

公开(公告)日：2017-12-12

申请号：US13989256

申请日：2011-12-01

申请人： Jeffrey Riedmiller , Regunathan Radhakrishnan , Marvin Pribadi , Farhad Farahani , Michael Smithers

发明人： Jeffrey Riedmiller , Regunathan Radhakrishnan , Marvin Pribadi , Farhad Farahani , Michael Smithers

IPC分类号： G06F17/00 , G10L19/008 , G10L21/00

CPC分类号： G10L19/008 , G10L19/167 , G10L21/00

摘要： Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类