Digital media universal elementary stream
    1.
    发明申请
    Digital media universal elementary stream 有权
    数字媒体通用基本流

    公开(公告)号:US20050234731A1

    公开(公告)日:2005-10-20

    申请号:US10966443

    申请日:2004-10-14

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码有用的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    DIGITAL MEDIA UNIVERSAL ELEMENTARY STREAM
    2.
    发明申请
    DIGITAL MEDIA UNIVERSAL ELEMENTARY STREAM 有权
    数字媒体通用元素流

    公开(公告)号:US20120130721A1

    公开(公告)日:2012-05-24

    申请号:US13360577

    申请日:2012-01-27

    IPC分类号: G10L19/00

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    Digital media universal elementary stream
    3.
    发明授权
    Digital media universal elementary stream 有权
    数字媒体通用基本流

    公开(公告)号:US08131134B2

    公开(公告)日:2012-03-06

    申请号:US10966443

    申请日:2004-10-15

    IPC分类号: H04N5/92 H04N5/93

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码有用的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    Digital media universal elementary stream
    4.
    发明授权
    Digital media universal elementary stream 有权
    数字媒体通用基本流

    公开(公告)号:US08861927B2

    公开(公告)日:2014-10-14

    申请号:US13360577

    申请日:2012-01-27

    IPC分类号: H04N9/80 G10L19/16

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码有用的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    Multi-channel audio encoding and decoding
    5.
    发明授权
    Multi-channel audio encoding and decoding 有权
    多声道音频编解码

    公开(公告)号:US08255230B2

    公开(公告)日:2012-08-28

    申请号:US13326315

    申请日:2011-12-14

    IPC分类号: G10L19/00 G10L21/00 G10L21/04

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

    摘要翻译: 音频编码器和解码器使用提高多声道音频编码和解码效率的架构和技术。 所描述的策略包括可以组合或独立使用的各种技术和工具。 例如,音频编码器对多声道音频数据执行预处理多声道变换,改变变换以便控制质量。 编码器将来自不同通道的多个窗口分组成一个或多个瓦片并输出瓦片配置信息,这允许编码器隔离出具有小窗口的特定通道中的瞬态,但在其他通道中使用大窗口。 使用各种技术,编码器执行灵活的多通道变换,有效利用信道间相关性。 音频解码器执行相应的处理和解码。 此外,解码器对于多个不同目的中的任一个执行后处理多信道变换。

    QUANTIZATION AND INVERSE QUANTIZATION FOR AUDIO
    6.
    发明申请
    QUANTIZATION AND INVERSE QUANTIZATION FOR AUDIO 有权
    音频的量化和反向量化

    公开(公告)号:US20120035941A1

    公开(公告)日:2012-02-09

    申请号:US13276163

    申请日:2011-10-18

    IPC分类号: G10L19/00

    CPC分类号: G10L19/032 G10L19/008

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of quantization (e.g., weighting) and inverse quantization (e.g., inverse weighting) in audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder quantizes audio data in multiple channels, applying multiple channel-specific quantizer step modifiers, which give the encoder more control over balancing reconstruction quality between channels. The encoder also applies multiple quantization matrices and varies the resolution of the quantization matrices, which allows the encoder to use more resolution if overall quality is good and use less resolution if overall quality is poor. Finally, the encoder compresses one or more quantization matrices using temporal prediction to reduce the bitrate associated with the quantization matrices. An audio decoder performs corresponding inverse processing and decoding.

    摘要翻译: 音频编码器和解码器使用在音频编码和解码中提高量化(例如,加权)和逆量化(例如,反加权)的效率的架构和技术。 所描述的策略包括可以组合或独立使用的各种技术和工具。 例如,音频编码器对多个信道中的音频数据进行量化,应用多个信道专用量化器步进修改器,这使得编码器更多地控制平衡信道之间的重建质量。 编码器还应用多个量化矩阵并改变量化矩阵的分辨率,这允许编码器在整体质量好的情况下使用更高的分辨率,并且如果整体质量差,则使用较小的分辨率。 最后,编码器使用时间预测来压缩一个或多个量化矩阵,以减少与量化矩阵相关联的比特率。 音频解码器执行相应的反向处理和解码。

    Techniques for measurement of perceptual audio quality
    7.
    发明授权
    Techniques for measurement of perceptual audio quality 有权
    用于测量感知音频质量的技术

    公开(公告)号:US07548855B2

    公开(公告)日:2009-06-16

    申请号:US11475301

    申请日:2006-06-26

    IPC分类号: G10L19/00

    CPC分类号: G10L25/69

    摘要: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.

    摘要翻译: 音频处理工具测量重建音频数据的质量。 例如,音频编码器在量化循环中测量重构频率系数数据块的质量。 本发明包括可以组合或分开使用的几种技术和工具。 首先,在测量质量之前,刀具将程序段归一化以考虑块大小的变化。 第二,对于质量测量,该工具通过临界频带处理重构数据,这可能与用于压缩数据的量化频带不同。 第三,该工具解决了重构数据的掩蔽效应,而不仅仅是原始数据的掩蔽效应。 第四,工具带对质量测量进行加权,可用于考虑噪声替代或带截断。 最后,该工具根据信道编码模式改变质量测量技术。

    Normalizing to compensate for block size variation when computing control parameter values for quality and rate control for digital audio
    8.
    发明授权
    Normalizing to compensate for block size variation when computing control parameter values for quality and rate control for digital audio 有权
    规范化,以计算数字音频的质量和速率控制的控制参数值时的块大小变化

    公开(公告)号:US07299175B2

    公开(公告)日:2007-11-20

    申请号:US11067018

    申请日:2005-02-24

    IPC分类号: G10L19/14

    CPC分类号: G10L19/24 G10L19/002

    摘要: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

    摘要翻译: 音频编码器通过控制策略来调节质量和比特率。 该策略包括几个功能。 首先,编码器使用质量,最小位计数和最大位计数参数来调节量化。 第二,编码器使用指示复杂性度量的可靠性的噪声测量来调节量化。 第三,编码器根据可变大小块的块大小对控制参数值进行归一化。 第四,编码器使用从质量控制环路去链接的位计数控制环路。 第五,当选择量化级别时,编码器将质量测量的非单调性作为量化级别的函数。 第六,编码器使用特定的内插规则来在质量或位计数控制环路中找到量化级别。 第七,编码器过滤控制参数值以平滑质量。 第八,编码器通过根据当前缓冲区饱和度调整控制参数值来校正模型偏差。

    QUALITY IMPROVEMENT TECHNIQUES IN AN AUDIO ENCODER
    9.
    发明申请
    QUALITY IMPROVEMENT TECHNIQUES IN AN AUDIO ENCODER 有权
    音频编码器中的质量改进技术

    公开(公告)号:US20070185706A1

    公开(公告)日:2007-08-09

    申请号:US11737072

    申请日:2007-04-18

    IPC分类号: G10L19/00

    摘要: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source. In the header reduction technique, the audio encoder selectively modifies the quantization step size of zeroed quantization bands so as to encode in fewer frame header bits.

    摘要翻译: 音频编码器实现多信道编码决策,频带截断,多信道重叠矩阵和头缩减技术,以提高质量和编码效率。 在多信道编​​码决策技术中,音频编码器通过基于(a)编码信道之间的能量分离的开环决策,动态地在多声道音频信号的联合和独立编码之间进行选择,和(b)视差 在单独的输入通道的激励模式之间。 在频带截断技术中,音频编码器基于目标感知质量测量,以截止频率执行开环频带截断。 在多声道再现矩阵技术中,音频编码器根据当前平均感知质量水平,当前速率控制缓冲器充满度,编码模式和频道数量的比例因子按比例缩放差分信道的某些系数。 在源头分离。 在标题缩小技术中,音频编码器有选择地修改置零的量化频带的量化步长,以便以更少的帧头位编码。

    Reordering coefficients for waveform coding or decoding
    10.
    发明申请
    Reordering coefficients for waveform coding or decoding 有权
    用于波形编码或解码的重新排序系数

    公开(公告)号:US20070016406A1

    公开(公告)日:2007-01-18

    申请号:US11183297

    申请日:2005-07-15

    IPC分类号: G10L21/00

    摘要: Techniques and tools for reordering of spectral coefficients in encoding and decoding are described herein. For certain types and patterns of content, coefficient reordering reduces redundancy that is due to periodic patterns in the spectral coefficients, making subsequent entropy encoding more efficient. For example, an audio encoder receives spectral coefficients logically organized along one dimension such as frequency, reorders at least some of the spectral coefficients, and entropy encodes the spectral coefficients after the reordering. Or, an audio decoder receives entropy encoded information for such spectral coefficients, entropy decodes the information, and reverses reordering of at least some of the spectral coefficients.

    摘要翻译: 本文描述了用于在编码和解码中重新排序频谱系数的技术和工具。 对于某些类型和内容模式,系数重新排序减少了由于频谱系数中的周期性模式造成的冗余,使后续的熵编码更有效率。 例如,音频编码器接收沿着诸如频率的一个维度逻辑组织的频谱系数,对至少一些频谱系数进行重新排序,以及对重新排序之后的频谱系数进行熵编码。 或者,音频解码器接收这种频谱系数的熵编码信息,熵解码信息,并且反转至少一些频谱系数的重新排序。