Digital media universal elementary stream
    41.
    发明授权
    Digital media universal elementary stream 有权
    数字媒体通用基本流

    公开(公告)号:US08861927B2

    公开(公告)日:2014-10-14

    申请号:US13360577

    申请日:2012-01-27

    IPC分类号: H04N9/80 G10L19/16

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码有用的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    Quantization and inverse quantization for audio
    42.
    发明授权
    Quantization and inverse quantization for audio 有权
    音频的量化和反量化

    公开(公告)号:US08255234B2

    公开(公告)日:2012-08-28

    申请号:US13276163

    申请日:2011-10-18

    CPC分类号: G10L19/032 G10L19/008

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of quantization (e.g., weighting) and inverse quantization (e.g., inverse weighting) in audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder quantizes audio data in multiple channels, applying multiple channel-specific quantizer step modifiers, which give the encoder more control over balancing reconstruction quality between channels. The encoder also applies multiple quantization matrices and varies the resolution of the quantization matrices, which allows the encoder to use more resolution if overall quality is good and use less resolution if overall quality is poor. Finally, the encoder compresses one or more quantization matrices using temporal prediction to reduce the bitrate associated with the quantization matrices. An audio decoder performs corresponding inverse processing and decoding.

    摘要翻译: 音频编码器和解码器使用在音频编码和解码中提高量化(例如,加权)和逆量化(例如,反加权)的效率的架构和技术。 所描述的策略包括可以组合或独立使用的各种技术和工具。 例如,音频编码器对多个信道中的音频数据进行量化,应用多个信道专用量化器步进修改器,这使得编码器更多地控制平衡信道之间的重建质量。 编码器还应用多个量化矩阵并改变量化矩阵的分辨率,这允许编码器在整体质量好的情况下使用更高的分辨率,并且如果整体质量差,则使用较小的分辨率。 最后,编码器使用时间预测来压缩一个或多个量化矩阵,以减少与量化矩阵相关联的比特率。 音频解码器执行相应的反向处理和解码。

    Headphone surround using artificial reverberation
    43.
    发明授权
    Headphone surround using artificial reverberation 有权
    耳机环绕使用人造混响

    公开(公告)号:US08116469B2

    公开(公告)日:2012-02-14

    申请号:US11771886

    申请日:2007-06-29

    IPC分类号: H03G3/00

    CPC分类号: H03G5/005

    摘要: A surround experience for headphones can be created using various techniques and tools. The described techniques and tools can be applied separately or in combination. For example, a surround experience can be created by splitting input audio channel signals into high-pass and low-pass signals. The low-pass signals can undergo cross-mixing and artificial reverberation. The artificial reverberation can apply different delay profiles to the low-pass audio signals. The high-pass and low-pass audio signals can be combined to generate output audio channel signals. A surround experience can be crated using a system comprising an input module, a crossover module, a channel cross-mixing module, a reverberation module, and an adder module.

    摘要翻译: 可以使用各种技术和工具创建耳机的环绕体验。 所描述的技术和工具可以单独使用或组合使用。 例如,可以通过将输入音频信道信号分割成高通和低通信号来创建环绕体验。 低通信号可以进行交叉混合和人为混响。 人造混响可以对低通音频信号应用不同的延迟曲线。 可以组合高通和低通音频信号以产生输出音频信道信号。 可以使用包括输入模块,交叉模块,频道交叉混合模块,混响模块和加法器模块的系统来包装环绕体验。

    Multi-channel audio encoding and decoding

    公开(公告)号:US08099292B2

    公开(公告)日:2012-01-17

    申请号:US12944604

    申请日:2010-11-11

    IPC分类号: G10L19/00 G10L21/00 G10L21/04

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

    Entropy encoding and decoding using direct level and run-length/level context-adaptive arithmetic coding/decoding modes
    45.
    发明授权
    Entropy encoding and decoding using direct level and run-length/level context-adaptive arithmetic coding/decoding modes 有权
    使用直接级和游程长度/级别上下文自适应算术编码/解码模式进行熵编码和解码

    公开(公告)号:US08090574B2

    公开(公告)日:2012-01-03

    申请号:US12907848

    申请日:2010-10-19

    IPC分类号: G10L19/02 H03M7/46 H04B1/66

    摘要: An encoder performs context-adaptive arithmetic encoding of transform coefficient data. For example, an encoder switches between coding of direct levels of quantized transform coefficient data and run-level coding of run lengths and levels of quantized transform coefficient data. The encoder can determine when to switch between coding modes based on a pre-determined switch point or by counting consecutive coefficients having a predominant value (e.g., zero). A decoder performs corresponding context-adaptive arithmetic decoding.

    摘要翻译: 编码器执行变换系数数据的上下文自适应算术编码。 例如,编码器在量化变换系数数据的直接电平的编码和运行长度的运行电平编码和量化的变换系数数据的电平之间切换。 编码器可以基于预定的切换点或通过计算具有主要值(例如,零)的连续系数来确定何时在编码模式之间切换。 解码器执行相应的上下文自适应算术解码。

    Quantization and inverse quantization for audio
    46.
    发明授权
    Quantization and inverse quantization for audio 有权
    音频的量化和反量化

    公开(公告)号:US08069052B2

    公开(公告)日:2011-11-29

    申请号:US12849626

    申请日:2010-08-03

    IPC分类号: G10L19/00 G10L21/00 G10L21/04

    CPC分类号: G10L19/032 G10L19/008

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of quantization (e.g., weighting) and inverse quantization (e.g., inverse weighting) in audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder quantizes audio data in multiple channels, applying multiple channel-specific quantizer step modifiers, which give the encoder more control over balancing reconstruction quality between channels. The encoder also applies multiple quantization matrices and varies the resolution of the quantization matrices, which allows the encoder to use more resolution if overall quality is good and use less resolution if overall quality is poor. Finally, the encoder compresses one or more quantization matrices using temporal prediction to reduce the bitrate associated with the quantization matrices. An audio decoder performs corresponding inverse processing and decoding.

    摘要翻译: 音频编码器和解码器使用在音频编码和解码中提高量化(例如,加权)和逆量化(例如,反加权)的效率的架构和技术。 所描述的策略包括可以组合或独立使用的各种技术和工具。 例如,音频编码器对多个信道中的音频数据进行量化,应用多个信道专用量化器步进修改器,这使得编码器更多地控制平衡信道之间的重建质量。 编码器还应用多个量化矩阵并改变量化矩阵的分辨率,这允许编码器在整体质量好的情况下使用更高的分辨率,并且如果整体质量差,则使用较小的分辨率。 最后,编码器使用时间预测来压缩一个或多个量化矩阵,以减少与量化矩阵相关联的比特率。 音频解码器执行相应的反向处理和解码。

    Shape and scale parameters for extended-band frequency coding
    47.
    发明授权
    Shape and scale parameters for extended-band frequency coding 有权
    扩展频带编码的形状和缩放参数

    公开(公告)号:US07953604B2

    公开(公告)日:2011-05-31

    申请号:US11336618

    申请日:2006-01-20

    IPC分类号: G10L19/00

    CPC分类号: G10L21/038

    摘要: An audio encoder performs frequency extension coding that comprises determining one or more shape parameters using a displacement vector that corresponds to a displacement of an even number (e.g., an even number of sub-bands between a sub-band in a baseband frequency range and a sub-band in an extended-band frequency range). The shape parameters can be determined on a per-audio-block basis. Restricting a displacement to an even number (in frequency extension coding or in other signal modulation schemes) can improve the quality of reconstructed audio. An audio encoder also can perform frequency extension coding that comprises determining one or more scale parameters at one or more audio blocks, and determining one or more anchor points for interpolating the one or more scale parameters.

    摘要翻译: 音频编码器执行频率扩展编码,其包括使用对应于偶数位移的位移矢量来确定一个或多个形状参数(例如,基带频率范围中的子带和偶数个子带之间的偶数个子带) 子带在扩展频带范围内)。 形状参数可以基于每个音频块来确定。 将位移限制为偶数(在频率扩展编码或其他信号调制方案中)可以提高重构音频的质量。 音频编码器还可以执行频率扩展编码,其包括确定一个或多个音频块处的一个或多个缩放参数,以及确定用于内插一个或多个缩放参数的一个或多个定位点。

    MULTI-CHANNEL AUDIO ENCODING AND DECODING
    48.
    发明申请
    MULTI-CHANNEL AUDIO ENCODING AND DECODING 有权
    多通道音频编码和解码

    公开(公告)号:US20110060597A1

    公开(公告)日:2011-03-10

    申请号:US12943701

    申请日:2010-11-10

    IPC分类号: G10L19/00 G10L21/00

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

    摘要翻译: 音频编码器和解码器使用提高多声道音频编码和解码效率的架构和技术。 所描述的策略包括可以组合或独立使用的各种技术和工具。 例如,音频编码器对多声道音频数据执行预处理多声道变换,改变变换以便控制质量。 编码器将来自不同通道的多个窗口分组成一个或多个瓦片并输出瓦片配置信息,这允许编码器隔离出具有小窗口的特定通道中的瞬态,但在其他通道中使用大窗口。 使用各种技术,编码器执行灵活的多通道变换,有效利用信道间相关性。 音频解码器执行相应的处理和解码。 此外,解码器对于多个不同目的中的任一个执行后处理多信道变换。

    COMPLEX-TRANSFORM CHANNEL CODING WITH EXTENDED-BAND FREQUENCY CODING
    49.
    发明申请
    COMPLEX-TRANSFORM CHANNEL CODING WITH EXTENDED-BAND FREQUENCY CODING 有权
    具有扩展频段编码的复杂变换通道编码

    公开(公告)号:US20110035226A1

    公开(公告)日:2011-02-10

    申请号:US12907889

    申请日:2010-10-19

    IPC分类号: G10L19/00

    CPC分类号: G10L21/038 G10L19/008

    摘要: An audio encoder receives multi-channel audio data comprising a group of plural source channels and performs channel extension coding, which comprises encoding a combined channel for the group and determining plural parameters for representing individual source channels of the group as modified versions of the encoded combined channel. The encoder also performs frequency extension coding. The frequency extension coding can comprise, for example, partitioning frequency bands in the multi-channel audio data into a baseband group and an extended band group, and coding audio coefficients in the extended band group based on audio coefficients in the baseband group. The encoder also can perform other kinds of transforms. An audio decoder performs corresponding decoding and/or additional processing tasks, such as a forward complex transform.

    摘要翻译: 音频编码器接收包括一组多个源信道的多声道音频数据,并执行信道扩展编码,其包括对该组的组合信道进行编码,并确定用于表示该组的各个源信道的多个参数,作为编码组合的修改版本 渠道。 编码器还执行频率扩展编码。 频率扩展编码可以包括例如将多声道音频数据中的频带划分为基带组和扩展频带组,并且基于基带组中的音频系数对扩展频带组中的音频系数进行编码。 编码器还可以执行其他类型的转换。 音频解码器执行相应的解码和/或附加处理任务,例如前向复合变换。

    ENTROPY CODING USING ESCAPE CODES TO SWITCH BETWEEN PLURAL CODE TABLES
    50.
    发明申请
    ENTROPY CODING USING ESCAPE CODES TO SWITCH BETWEEN PLURAL CODE TABLES 有权
    使用ESCAPE代码进行熵编码,以切换一级代码表

    公开(公告)号:US20110035225A1

    公开(公告)日:2011-02-10

    申请号:US12907848

    申请日:2010-10-19

    IPC分类号: G10L19/00 G10L21/00

    摘要: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.

    摘要翻译: 音频编码器执行音频数据的自适应熵编码。 例如,音频编码器在量化音频数据的直接电平的可变维矢量霍夫曼编码和游程长度的游程级编码以及量化的音频数据的电平之间切换。 编码器可以使用例如用于对运行长度和电平进行编码的基于上下文的算术编码。 编码器可以通过计算具有主要值(例如,零)的连续系数来确定何时在编码模式之间切换。 音频解码器执行相应的自适应熵解码。