专利检索 ap:("Regunathan Radhakrishnan" OR "Jeffrey Riedmiller" OR "Claus Bauer" OR "Wenyu Jiang") AND inv:"Regunathan Radhakrishnan" 第 5 页

41.

发明申请
Video presentation using compositional structures 审中-公开
标题翻译：使用组合结构的视频演示

公开(公告)号：US20060075346A1

公开(公告)日：2006-04-06

申请号：US10951192

申请日：2004-09-27

申请人： Tom Lanning , Ajay Divakaran , Kadir Peker , Regunathan Radhakrishnan , Ziyou Xiong , Clifton Forlines

发明人： Tom Lanning , Ajay Divakaran , Kadir Peker , Regunathan Radhakrishnan , Ziyou Xiong , Clifton Forlines

IPC分类号： G11B27/00

CPC分类号： G11B19/025 , G11B27/105 , G11B27/107 , G11B27/11 , G11B27/28 , G11B27/34 , G11B2220/20 , G11B2220/65 , G11B2220/90 , H04N5/85 , H04N9/8042 , H04N9/8233 , H04N21/42646 , H04N21/4312 , H04N21/4314 , H04N21/4325 , H04N21/812 , H04N21/84 , H04N21/8456

摘要： A method presents a video according to compositional structures associated with the video. Each compositional structure has a label, and multiple segments that can be organized temporally or hierarchically. A particular compositional structure is selected with a remote controller, and the video is presented by a playback controller on a display device according to the compositional structure.

摘要翻译： 一种方法根据与视频相关联的组合结构呈现视频。每个组成结构都有一个标签，并且可以在时间上或分层上组织的多个片段。利用遥控器选择特定的组成结构，根据组成结构，视频由显示设备上的播放控制器呈现。

42.

发明申请
Audio-visual highlights detection using coupled hidden markov models 审中-公开
标题翻译：使用耦合的隐马尔可夫模型的视听亮点检测

公开(公告)号：US20050125223A1

公开(公告)日：2005-06-09

申请号：US10729164

申请日：2003-12-05

申请人： Ajay Divakaran , Ziyou Xiong , Regunathan Radhakrishnan

发明人： Ajay Divakaran , Ziyou Xiong , Regunathan Radhakrishnan

IPC分类号： G06F17/30 , G06K9/00 , G06K9/20 , G06K9/62 , G06T7/00 , G10L11/00 , G10L15/00 , G10L15/04 , G10L15/10 , G10L15/26 , G10L17/00 , G10L19/12 , H04N5/76

CPC分类号： G06K9/00711 , G06F16/739 , G06F16/7834 , G06F16/786 , G06K9/6297

摘要： A method uses probabilistic fusion to detect highlights in videos using both audio and visual information. Specifically, the method uses coupled hidden Markov models (CHMMs). Audio labels are generated using audio classification via Gaussian mixture models (GMMs), and visual labels are generated by quantizing average motion vector magnitudes. Highlights are modeled using discrete-observation CHMMs trained with labeled videos. The CHMMs have better performance than conventional hidden Markov models (HMMs) trained only on audio signals, or only on video frames.

摘要翻译： 一种方法使用概率融合来检测使用音频和视觉信息的视频中的高光。具体来说，该方法使用耦合的隐马尔可夫模型（CHMM）。使用高斯混合模型（GMM）的音频分类生成音频标签，并且通过量化平均运动矢量幅度来生成视觉标签。亮点是使用用标记视频训练的离散观察CHMM进行建模。 CHMM具有比仅在音频信号上训练的传统隐马尔可夫模型（HMM）更好的性能，或仅在视频帧上训练。

43.

发明授权
Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols 有权
标题翻译：音频编码方法和系统，用于通过实现不同解码协议的解码器生成统一的比特流解码

公开(公告)号：US09378743B2

公开(公告)日：2016-06-28

申请号：US14009503

申请日：2012-04-05

申请人： Jeffrey C. Riedmiller , Farhad Farahani , Michael Schug , Regunathan Radhakrishnan , Mark S. Vinton

发明人： Jeffrey C. Riedmiller , Farhad Farahani , Michael Schug , Regunathan Radhakrishnan , Mark S. Vinton

IPC分类号： G10L19/008 , G10L19/002 , G10L19/16

CPC分类号： G10L19/002 , G10L19/167

摘要： In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

摘要翻译： 在一类实施例中，音频编码系统（通常是感知编码系统，其被配置为生成与第一解码器兼容的（即可解码的）单个（“统一”）比特流，第一解码器被配置为对根据第一编码协议（例如，多频道杜比数字+或DD +协议）和被配置为对根据第二编码协议（例如立体声AAC，HE AAC v1或HE）编码的音频数据进行解码的第二解码器统一比特流可以包括可由第一解码器解码（并由第二解码器忽略）的可编码数据（例如，数据突发）和由第二解码器解码的编码数据（例如，其他数据突发）并且被第一解码器忽略），实际上，当第一解码器对比特流进行解码时，第二编码格式被隐藏在统一比特流内，并且当比特流中第一编码格式被隐藏在统一比特流内时令牌由第二解码器解码。根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。本发明的其他方面是由本发明编码器的任何实施例执行的编码方法，由本发明解码器的任何实施例执行的解码方法，以及存储用于实现本发明的任何实施例的代码的计算机可读介质（例如，盘）方法。

44.

发明授权
Ranking representative segments in media data 有权
标题翻译：在媒体数据中排列代表性细分

公开(公告)号：US09313593B2

公开(公告)日：2016-04-12

申请号：US13997866

申请日：2011-12-15

申请人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

发明人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

IPC分类号： H04R29/00 , G10H1/00 , G06F17/30 , G06F17/00 , G10L25/48

CPC分类号： H04R29/00 , G06F17/00 , G06F17/3053 , G10H1/0008 , G10H2210/061 , G10H2240/151 , G10L25/48

摘要： Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

摘要翻译： 提供了在媒体数据中排列代表片段的技术。可以从媒体数据中提取许多不同类型的媒体特征。可以将多个排名得分分配给多个候选代表段。基于从媒体数据可提取的一种或多种类型的特征，多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

45.

发明授权
Method, apparatus, and medium for detecting frequency extension coding in the coding history of an audio signal 有权
标题翻译：用于在音频信号的编码历史中检测频率扩展编码的方法，装置和介质

公开(公告)号：US09117440B2

公开(公告)日：2015-08-25

申请号：US14116113

申请日：2012-04-30

申请人： Harald H. Mundt , Arijit Biswas , Regunathan Radhakrishnan

发明人： Harald H. Mundt , Arijit Biswas , Regunathan Radhakrishnan

IPC分类号： G10L19/00 , G10L21/02 , G10L25/03 , G10L19/008 , G10L21/038

CPC分类号： G10L19/00 , G10L19/008 , G10L21/02 , G10L21/038 , G10L25/03

摘要： The present document relates to audio forensics, notably the blind detection of traces of parametric audio encoding/decoding. In particular, the present document relates to the detection of parametric frequency extension audio coding, such as spectral band replication (SBR) or spectral extension (SPX), from uncompressed waveforms such as PCM (pulse code modulation) encoded waveforms. A method for detecting frequency extension coding history in a time domain audio signal is described. The method may comprise transforming the time domain audio signal into a frequency domain, thereby generating a plurality of subband signals in a corresponding plurality of subbands comprising low and high frequency subbands; determining a degree of relationship between subband signals in the low frequency subbands and subband signals in the high frequency subbands; wherein the degree of relationship is determined based on the plurality of subband signals; and determining frequency extension coding history if the degree of relationship is greater than a relationship threshold.

摘要翻译： 本文件涉及音频取证，特别是盲目检测参数音频编码/解码的痕迹。特别地，本文件涉及从诸如PCM（脉冲编码调制）编码波形的未压缩波形检测参数频率扩展音频编码，例如频谱带复制（SBR）或频谱扩展（SPX）。描述了用于检测时域音频信号中的频率扩展编码历史的方法。该方法可以包括将时域音频信号变换成频域，从而在包括低频和高频子带的相应多个子带中产生多个子带信号; 确定低频子带中的子带信号与高频子带中的子带信号之间的关系程度; 其中所述关系度基于所述多个子带信号来确定; 以及如果所述关系度大于关系阈值，则确定频率扩展编码历史。

46.

发明申请
AUDIO ENCODING METHOD AND SYSTEM FOR GENERATING A UNIFIED BITSTREAM DECODABLE BY DECODERS IMPLEMENTING DIFFERENT DECODING PROTOCOLS 有权
标题翻译：音视频编码方法和系统，用于生成由解码器实现的不同解码协议解码的统一的双绞线

公开(公告)号：US20140358554A1

公开(公告)日：2014-12-04

申请号：US14009503

申请日：2012-04-05

申请人： Jeffrey C. Riedmiller , Farhad Farahani , Michael Schug , Regunathan Radhakrishnan , Mark S. Vinton

发明人： Jeffrey C. Riedmiller , Farhad Farahani , Michael Schug , Regunathan Radhakrishnan , Mark S. Vinton

IPC分类号： G10L19/002

CPC分类号： G10L19/002 , G10L19/167

摘要： In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

摘要翻译： 在一类实施例中，音频编码系统（通常是感知编码系统，其被配置为生成与第一解码器兼容的（即可解码的）单个（“统一”）比特流，第一解码器被配置为对根据第一编码协议（例如，多频道杜比数字+或DD +协议）和被配置为对根据第二编码协议（例如立体声AAC，HE AAC v1或HE）编码的音频数据进行解码的第二解码器统一比特流可以包括可由第一解码器解码（并由第二解码器忽略）的可编码数据（例如，数据突发）和由第二解码器解码的编码数据（例如，其他数据突发）并且被第一解码器忽略），实际上，当第一解码器对比特流进行解码时，第二编码格式被隐藏在统一比特流内，并且当比特流中第一编码格式被隐藏在统一比特流内时令牌由第二解码器解码。根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。本发明的其他方面是由本发明编码器的任何实施例执行的编码方法，由本发明解码器的任何实施例执行的解码方法，以及存储用于实现本发明的任何实施例的代码的计算机可读介质（例如，盘）方法。

47.

发明申请
Scene Change Detection Around a Set of Seed Points in Media Data 有权
标题翻译：媒体数据中一组种子点的场景变化检测

公开(公告)号：US20130287214A1

公开(公告)日：2013-10-31

申请号：US13997860

申请日：2011-12-15

申请人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

发明人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

IPC分类号： H04R29/00

CPC分类号： H04R29/00 , G06F17/00 , G06F17/3053 , G10H1/0008 , G10H2210/061 , G10H2240/151 , G10L25/48

摘要： Techniques for scene change detection around seed points in media data are provided. Media features of many different types may be extracted from the media data. One or more statistical patterns of media features in a plurality of time-wise intervals around a plurality of seed time points of the media data may be determined using one or more types of features extractable from the media data. At least one of the one or more types of features comprises a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data. A plurality of beginning scene change points and a plurality of ending scene change points in the media data may be detected, based on the one or more statistical patterns, for the plurality of seed time points in the media data.

摘要翻译： 提供媒体数据中种子点周围场景变化检测技术。可以从媒体数据中提取许多不同类型的媒体特征。可以使用从媒体数据可提取的一种或多种类型的特征来确定围绕媒体数据的多个种子时间点的多个时间间隔中的媒体特征的一个或多个统计模式。一种或多种类型的特征中的至少一种包括捕获与媒体数据相关的结构性质，包括和声和旋律的音调，音色，节奏，响度，立体声混合或数量的声源的特征的类型。可以基于媒体数据中的多个种子时间点的一个或多个统计模式来检测媒体数据中的多个起始场景变化点和多个结束场景变化点。

48.

发明申请
Repetition Detection in Media Data 审中-公开
标题翻译：媒体数据中的重复检测

公开(公告)号：US20130275421A1

公开(公告)日：2013-10-17

申请号：US13997847

申请日：2011-12-15

申请人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

发明人： Barbara Resch , Regunathan Radhakrishnan , Arijit Biswas , Jonas Engdegard

IPC分类号： G06F17/30

CPC分类号： G06F16/24578 , G06F17/00 , G10H1/0008 , G10H2210/061 , G10H2240/151 , G10L25/48 , H04R29/00

摘要： Techniques for repetition detection in media data are provided. Media features of many different types may be extracted from the media data. Query sequences of fingerprints may be selected time intervals that begin at query times. Matched sequences of fingerprints may be determined. A set of offset values may be determined based on the matched sequences of fingerprints. This set of offset values may be further refined into a set of significant time points using a relatively targeted search and comparison method based on the media features of a second type extracted from the media data.

摘要翻译： 提供了媒体数据中重复检测技术。可以从媒体数据中提取许多不同类型的媒体特征。指纹的查询序列可以是从查询时间开始的选择的时间间隔。可以确定匹配的指纹序列。可以基于匹配的指纹序列来确定一组偏移值。可以使用基于从媒体数据提取的第二类型的媒体特征的相对有针对性的搜索和比较方法，将这组偏移值进一步细化为一组有效时间点。

49.

发明授权
Multimode coding of speech-like and non-speech-like signals 有权
标题翻译：语音和非语音信号的多模式编码

公开(公告)号：US08392179B2

公开(公告)日：2013-03-05

申请号：US12921752

申请日：2009-03-12

申请人： Rongshan Yu , Regunathan Radhakrishnan , Robert Andersen , Grant Davidson

发明人： Rongshan Yu , Regunathan Radhakrishnan , Robert Andersen , Grant Davidson

IPC分类号： G10L11/06

CPC分类号： G10L19/18 , G10L19/093 , G10L19/12 , G10L2019/0004 , G10L2019/0005

摘要： The invention relates to the coding of audio signals that may include both speech-like and non-speech-like signal components. It describes methods and apparatus for code excited linear prediction (CELP) audio encoding and decoding that employ linear predictive coding (LPC) synthesis filters controlled by LPC parameters, a plurality of codebooks each having codevectors, at least one codebook providing an excitation more appropriate for non-speech-like signals and at least one codebook providing an excitation more appropriate for speech-like signals, and a plurality of gain factors, each associated with a codebook. The encoding methods and apparatus select from the codebooks codevectors and/or associated gain factors by minimizing a measure of the difference between the audio signal and a reconstruction of the audio signal derived from the codebook excitations. The decoding methods and apparatus generate a reconstructed output signal from the LPC parameters, codevectors, and gain factors.

摘要翻译： 本发明涉及可以包括语音类和非语音类信号分量的音频信号的编码。它描述了采用由LPC参数控制的线性预测编码（LPC）合成滤波器的码激励线性预测（CELP）音频编码和解码的方法和装置，每个具有码矢量的多个码本，提供更适合于非语音类信号和至少一个提供更适合于类似语音的信号的激励的码本，以及多个增益因子，每个与码本相关联。编码方法和装置通过最小化音频信号与从码本激励导出的音频信号的重建之间的差异的度量来从码本代码矢量和/或相关联的增益因子中选择。解码方法和装置从LPC参数，代码矢量和增益因子产生重构的输出信号。

50.

发明授权
Detecting and diagnosing faults in HVAC equipment 有权
标题翻译：检测和诊断HVAC设备故障

公开(公告)号：US07444251B2

公开(公告)日：2008-10-28

申请号：US11498289

申请日：2006-08-01

申请人： Daniel N. Nikovski , Ajay Divakaran , Regunathan Radhakrishnan , Kadir A. Peker

发明人： Daniel N. Nikovski , Ajay Divakaran , Regunathan Radhakrishnan , Kadir A. Peker

IPC分类号： G01N37/00 , G08B21/00 , G05B9/02 , G06F19/00 , G01D3/00 , G01L25/00

CPC分类号： G05B23/0254 , F24F11/30 , F24F11/52 , F24F2110/00

摘要： A method and system detects and diagnoses faults in heating, ventilating and air conditioning (HVAC) equipment. Internal state variables of the HVAC equipment are measured under external driving conditions. Expected internal state variables are predicted for the HVAC equipment operating under the external driving conditions using a locally weighted regression model. Features are determined of the HVAC based on differences between the measured and predicted state variables. The features are classified to determine a condition of the HVAC equipment.

摘要翻译： 一种方法和系统检测和诊断加热，通风和空调（HVAC）设备中的故障。 HVAC设备的内部状态变量在外部驾驶条件下测量。对于在外部驾驶条件下使用局部加权回归模型运行的暖通空调设备，预测内部状态变量。特征根据测量和预测状态变量之间的差异确定HVAC。这些特征被分类以确定HVAC设备的状况。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类