Efficient and scalable parametric stereo coding for low bitrate audio coding applications
    51.
    发明授权
    Efficient and scalable parametric stereo coding for low bitrate audio coding applications 有权
    低比特率音频编码应用的高效可扩展的参数立体声编码

    公开(公告)号:US09218818B2

    公开(公告)日:2015-12-22

    申请号:US13458492

    申请日:2012-04-27

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。

    EFFICIENT AND SCALABLE PARAMETRIC STEREO CODING FOR LOW BITRATE AUDIO CODING APPLICATIONS
    52.
    发明申请
    EFFICIENT AND SCALABLE PARAMETRIC STEREO CODING FOR LOW BITRATE AUDIO CODING APPLICATIONS 审中-公开
    低成本音频编码应用的高效和可扩展参数立体声编码

    公开(公告)号:US20120213377A1

    公开(公告)日:2012-08-23

    申请号:US13458492

    申请日:2012-04-27

    IPC分类号: H04R5/00

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。

    Stereo balance interpolation
    53.
    发明授权
    Stereo balance interpolation 有权
    立体声平衡插补

    公开(公告)号:US08073144B2

    公开(公告)日:2011-12-06

    申请号:US11237133

    申请日:2005-09-27

    IPC分类号: H04R5/00

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。

    Efficient and scalable parametric stereo coding for low bitrate audio coding applications
    54.
    发明授权
    Efficient and scalable parametric stereo coding for low bitrate audio coding applications 有权
    低比特率音频编码应用的高效可扩展的参数立体声编码

    公开(公告)号:US07382886B2

    公开(公告)日:2008-06-03

    申请号:US10483453

    申请日:2002-07-10

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。

    Efficient and scalable parametric stereo coding for low bitrate applications
    56.
    发明申请
    Efficient and scalable parametric stereo coding for low bitrate applications 有权
    低比特率应用的高效可扩展的参数立体声编码

    公开(公告)号:US20050053242A1

    公开(公告)日:2005-03-10

    申请号:US10483453

    申请日:2002-07-10

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。

    Ranking representative segments in media data
    57.
    发明授权
    Ranking representative segments in media data 有权
    在媒体数据中排列代表性细分

    公开(公告)号:US09313593B2

    公开(公告)日:2016-04-12

    申请号:US13997866

    申请日:2011-12-15

    摘要: Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

    摘要翻译: 提供了在媒体数据中排列代表片段的技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以将多个排名得分分配给多个候选代表段。 基于从媒体数据可提取的一种或多种类型的特征,多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。 可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。 可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

    Scene Change Detection Around a Set of Seed Points in Media Data
    58.
    发明申请
    Scene Change Detection Around a Set of Seed Points in Media Data 有权
    媒体数据中一组种子点的场景变化检测

    公开(公告)号:US20130287214A1

    公开(公告)日:2013-10-31

    申请号:US13997860

    申请日:2011-12-15

    IPC分类号: H04R29/00

    摘要: Techniques for scene change detection around seed points in media data are provided. Media features of many different types may be extracted from the media data. One or more statistical patterns of media features in a plurality of time-wise intervals around a plurality of seed time points of the media data may be determined using one or more types of features extractable from the media data. At least one of the one or more types of features comprises a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data. A plurality of beginning scene change points and a plurality of ending scene change points in the media data may be detected, based on the one or more statistical patterns, for the plurality of seed time points in the media data.

    摘要翻译: 提供媒体数据中种子点周围场景变化检测技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以使用从媒体数据可提取的一种或多种类型的特征来确定围绕媒体数据的多个种子时间点的多个时间间隔中的媒体特征的一个或多个统计模式。 一种或多种类型的特征中的至少一种包括捕获与媒体数据相关的结构性质,包括和声和旋律的音调,音色,节奏,响度,立体声混合或数量的声源的特征的类型。 可以基于媒体数据中的多个种子时间点的一个或多个统计模式来检测媒体数据中的多个起始场景变化点和多个结束场景变化点。

    Repetition Detection in Media Data
    59.
    发明申请
    Repetition Detection in Media Data 审中-公开
    媒体数据中的重复检测

    公开(公告)号:US20130275421A1

    公开(公告)日:2013-10-17

    申请号:US13997847

    申请日:2011-12-15

    IPC分类号: G06F17/30

    摘要: Techniques for repetition detection in media data are provided. Media features of many different types may be extracted from the media data. Query sequences of fingerprints may be selected time intervals that begin at query times. Matched sequences of fingerprints may be determined. A set of offset values may be determined based on the matched sequences of fingerprints. This set of offset values may be further refined into a set of significant time points using a relatively targeted search and comparison method based on the media features of a second type extracted from the media data.

    摘要翻译: 提供了媒体数据中重复检测技术。 可以从媒体数据中提取许多不同类型的媒体特征。 指纹的查询序列可以是从查询时间开始的选择的时间间隔。 可以确定匹配的指纹序列。 可以基于匹配的指纹序列来确定一组偏移值。 可以使用基于从媒体数据提取的第二类型的媒体特征的相对有针对性的搜索和比较方法,将这组偏移值进一步细化为一组有效时间点。

    AUDIO SIGNAL DECODER, AUDIO SIGNAL ENCODER, METHOD FOR PROVIDING AN UPMIX SIGNAL REPRESENTATION, METHOD FOR PROVIDING A DOWNMIX SIGNAL REPRESENTATION, COMPUTER PROGRAM AND BITSTREAM USING A COMMON INTER-OBJECT-CORRELATION PARAMETER VALUE
    60.
    发明申请
    AUDIO SIGNAL DECODER, AUDIO SIGNAL ENCODER, METHOD FOR PROVIDING AN UPMIX SIGNAL REPRESENTATION, METHOD FOR PROVIDING A DOWNMIX SIGNAL REPRESENTATION, COMPUTER PROGRAM AND BITSTREAM USING A COMMON INTER-OBJECT-CORRELATION PARAMETER VALUE 有权
    音频信号解码器,音频信号编码器,用于提供UPMIX信号表示的方法,使用公共对象相关参数值提供下行信号表示的方法,计算机程序和比特

    公开(公告)号:US20120269353A1

    公开(公告)日:2012-10-25

    申请号:US13434450

    申请日:2012-03-29

    IPC分类号: G10L19/00 H04R5/00

    摘要: An audio signal decoder for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information and in dependence on a rendering information has an object parameter determinator. The object parameter determinator is configured to obtain inter-object-correlation values for a plurality of pairs of audio objects. The object parameter determinator is configured to evaluate a bitstream signaling parameter in order to decide whether to evaluate individual inter-object-correlation bitstream parameter values to obtain inter-object-correlation values for a plurality of pairs of related audio objects, or to obtain inter-object-correlation values for a plurality of pairs of related audio objects using a common inter-object-correlation bitstream parameter value. The audio signal decoder also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and using the inter-object-correlation values for a plurality of pairs of related objects and the rendering information.

    摘要翻译: 用于根据降混信号表示和对象相关参数信息提供上混合信号表示的音频信号解码器,并且根据呈现信息,具有对象参数确定器。 对象参数确定器被配置为获得多对音频对象的对象间相关值。 对象参数确定器被配置为评估比特流信令参数,以便决定是否评估各个对象间相关比特流参数值以获得多对相关音频对象的对象间相关值, - 使用公共对象间相关比特流参数值的多对相关音频对象的对象相关值。 音频信号解码器还具有信号处理器,其被配置为基于下混信号表示获得上混信号表示,并且使用多对相关对象和呈现信息的对象间相关值。