Enhanced Chroma Extraction from an Audio Codec
    21.
    发明申请
    Enhanced Chroma Extraction from an Audio Codec 有权
    从音频编解码器增强色度提取

    公开(公告)号:US20140310011A1

    公开(公告)日:2014-10-16

    申请号:US14359697

    申请日:2012-11-28

    Abstract: The present document relates to methods and systems for music information retrieval (MIR). In particular, the present document relates to methods and systems for extracting a chroma vector from an audio signal. A method (900) for determining a chroma vector (100) for a block of samples of an audio signal (301) is described. The method (900) comprises receiving (901) a corresponding block of frequency coefficients derived from the block of samples of the audio signal (301) from a core encoder (412) of a spectral band replication based audio encoder (410) adapted to generate an encoded bitstream (305) of the audio signal (301) from the block of frequency coefficients; and determining (904) the chroma vector (100) for the block of samples of the audio signal (301) based on the received block of frequency coefficients.

    Abstract translation: 本文件涉及用于音乐信息检索(MIR)的方法和系统。 特别地,本文件涉及从音频信号中提取色度矢量的方法和系统。 描述了一种用于确定音频信号(301)的采样块的色度矢量(100)的方法(900)。 所述方法(900)包括从适于生成的基于频谱带复制的音频编码器(410)的核心编码器(412)接收(901)从所述音频信号(301)的采样块导出的对应的频率系数块 根据频率系数块的音频信号(301)的编码比特流(305); 以及基于所接收的频率系数块来确定用于所述音频信号(301)的采样块的色度矢量(904)(904)。

    Reduced complexity converter SNR calculation
    28.
    发明授权
    Reduced complexity converter SNR calculation 有权
    降低复杂度转换器SNR计算

    公开(公告)号:US09378748B2

    公开(公告)日:2016-06-28

    申请号:US14439795

    申请日:2013-11-04

    CPC classification number: G10L19/008 G10L19/02 G10L19/032 G10L19/173

    Abstract: The present document relates to audio encoding/decoding. In particular, the present document relates to a method and system for reducing the complexity of a bit allocation process used in the context of audio encoding/decoding. An audio encoder (300) configured to encode an audio signal according to a first audio codec system is described. The audio encoder (300) comprises a transform unit (302) configured to determine a set of spectral coefficients (312) based on the audio signal. Furthermore, the encoder (300) comprises a floating-point encoding unit (304) configured to determine a set of scale factors and a set of scaled values (314), based on the set of spectral coefficients (312); and to encode the set of scale factors to yield a set of encoded scale factors (313).

    Abstract translation: 本文件涉及音频编码/解码。 特别地,本文件涉及用于降低在音频编码/解码的上下文中使用的比特分配处理的复杂度的方法和系统。 描述了被配置为根据第一音频编解码器系统对音频信号进行编码的音频编码器(300)。 音频编码器(300)包括被配置为基于音频信号确定一组频谱系数(312)的变换单元(302)。 此外,编码器(300)包括浮点编码单元(304),其被配置为基于频谱系数集合(312)确定一组缩放因子和一组缩放值(314); 并编码一组比例因子以产生一组编码比例因子(313)。

    System and Method for Non-Destructively Normalizing Loudness of Audio Signals within Portable Devices
    29.
    发明申请
    System and Method for Non-Destructively Normalizing Loudness of Audio Signals within Portable Devices 有权
    便携式设备中音频信号响度的非破坏性系统和方法

    公开(公告)号:US20150043754A1

    公开(公告)日:2015-02-12

    申请号:US14525536

    申请日:2014-10-28

    Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio Implementations in encoders, in transcoders and in decoders are disclosed.

    Abstract translation: 许多便携式播放设备不能解码和播放具有宽带宽和宽动态范围的编码音频内容,具有一致的响度和清晰度,除非编码音频内容已经为这些设备专门准备。 通过使用编码的内容包含一些通过相对于另一已知压缩简档的绝对值或差分值来指定合适的动态范围压缩简档的元数据来克服该问题。 播放设备还可以自适应地对播放音频应用增益和限制。在代码转换器和解码器中公开了编码器中的实现。

    Methods and Systems for Efficient Recovery of High Frequency Audio Content
    30.
    发明申请
    Methods and Systems for Efficient Recovery of High Frequency Audio Content 有权
    高效率恢复高频音频内容的方法与系统

    公开(公告)号:US20150003632A1

    公开(公告)日:2015-01-01

    申请号:US14372733

    申请日:2013-02-22

    Abstract: The present document relates to the technical field of audio coding, decoding and processing. It specifically relates to methods of recovering high frequency content of an audio signal from low frequency content of the same audio signal in an efficient manner. A method for determining a first banded tonality value (311, 312) for a first frequency subband (205) of an audio signal is described. The first banded tonality value (311, 312) is used for approximating a high frequency component of the audio signal based on a low frequency component of the audio signal. The method comprises determining a set of transform coefficients in a corresponding set of frequency bins based on a block of samples of the audio signal; determining a set of bin tonality values (341) for the set of frequency bins using the set of transform coefficients, respectively; and combining a first subset of two or more of the set of bin tonality values (341) for two or more corresponding adjacent frequency bins of the set of frequency bins lying within the first frequency subband, thereby yielding the first banded tonality value (311, 312) for the first frequency subband.

    Abstract translation: 本文件涉及音频编码,解码和处理技术领域。 具体涉及以有效的方式从相同音频信号的低频内容中恢复音频信号的高频内容的方法。 描述了一种用于确定音频信号的第一频率子带(205)的第一带状音调值(311,312)的方法。 第一带状音调值(311,312)用于基于音频信号的低频分量近似音频信号的高频分量。 该方法包括基于音频信号的样本块来确定频率组的相应集合中的一组变换系数; 使用所述变换系数集合分别确定所述频率组集合的一组bin音调值(341); 以及将位于所述第一频率子带内的所述频率组集合中的两个或更多个对应的相邻频率块的所述组音调值集合(341)中的两个或更多个的第一子集合,从而产生所述第一带状音调值(311, 312)用于第一频率子带。

Patent Agency Ranking