Measuring and using reliability of complexity estimates during quality and rate control for digital audio
    71.
    发明授权
    Measuring and using reliability of complexity estimates during quality and rate control for digital audio 有权
    在数字音频的质量和速率控制过程中测量和使用复杂度估计的可靠性

    公开(公告)号:US07277848B2

    公开(公告)日:2007-10-02

    申请号:US11066859

    申请日:2005-02-24

    IPC分类号: G10L21/02

    CPC分类号: G10L19/24 G10L19/002

    摘要: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level In a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

    摘要翻译: 音频编码器通过控制策略来调节质量和比特率。 该策略包括几个功能。 首先,编码器使用质量,最小位计数和最大位计数参数来调节量化。 第二,编码器使用指示复杂性度量的可靠性的噪声测量来调节量化。 第三,编码器根据可变大小块的块大小对控制参数值进行归一化。 第四,编码器使用从质量控制环路去链接的位计数控制环路。 第五,当选择量化级别时,编码器将质量测量的非单调性作为量化级别的函数。 第六,编码器使用特定的插值规则来在质量或位计数控制环路中找到量化级别。 第七,编码器过滤控制参数值以平滑质量。 第八,编码器通过根据当前缓冲区饱和度调整控制参数值来校正模型偏差。

    Quality and rate control strategy for digital audio
    72.
    发明申请
    Quality and rate control strategy for digital audio 有权
    数字音频质量和速率控制策略

    公开(公告)号:US20070061138A1

    公开(公告)日:2007-03-15

    申请号:US11599686

    申请日:2006-11-14

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24 G10L19/002

    摘要: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

    摘要翻译: 音频编码器通过控制策略来调节质量和比特率。 该策略包括几个功能。 首先,编码器使用质量,最小位计数和最大位计数参数来调节量化。 第二,编码器使用指示复杂性度量的可靠性的噪声测量来调节量化。 第三,编码器根据可变大小块的块大小对控制参数值进行归一化。 第四,编码器使用从质量控制环路去链接的位计数控制环路。 第五,当选择量化级别时,编码器将质量测量的非单调性作为量化级别的函数。 第六,编码器使用特定的内插规则来在质量或位计数控制环路中找到量化级别。 第七,编码器过滤控制参数值以平滑质量。 第八,编码器通过根据当前缓冲区饱和度调整控制参数值来校正模型偏差。

    Techniques for measurement of perceptual audio quality

    公开(公告)号:US07146313B2

    公开(公告)日:2006-12-05

    申请号:US10017861

    申请日:2001-12-14

    IPC分类号: G01L19/00

    CPC分类号: G10L25/69

    摘要: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.

    Techniques for measurement of perceptual audio quality

    公开(公告)号:US20060241942A1

    公开(公告)日:2006-10-26

    申请号:US11475302

    申请日:2006-06-26

    IPC分类号: G10L19/00

    CPC分类号: G10L25/69

    摘要: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.

    Digital audio processing
    75.
    发明申请
    Digital audio processing 有权
    数字音频处理

    公开(公告)号:US20060241796A1

    公开(公告)日:2006-10-26

    申请号:US11114873

    申请日:2005-04-25

    IPC分类号: G06F17/00

    摘要: A compressed digital audio signal is transmitted from an audio source along a connection wire to an audio receiver. The digital audio signal can encode digital audio data having different sampling frequencies, frames sizes, and other information. The audio receiver that receives the digital audio signal can decode and convert the compressed digital audio signal into multiple synchronized analog signals, which are used to drive multiple speakers. The audio receiver may also synchronize the audio data with associated video data so that the audio playback and video playback are “in sync”, despite delay introduced by the audio signal decoding at the audio receiver.

    摘要翻译: 压缩的数字音频信号从音频源沿着连接线传输到音频接收器。 数字音频信号可以编码具有不同采样频率,帧大小和其他信息的数字音频数据。 接收数字音频信号的音频接收器可以将压缩的数字音频信号解码并转换成多个同步的模拟信号,用于驱动多个扬声器。 音频接收器还可以使音频数据与相关联的视频数据同步,使音频播放和视频播放“尽管同步”,尽管由音频接收器处的音频信号解码引入了延迟。

    Digital media universal elementary stream
    76.
    发明申请
    Digital media universal elementary stream 有权
    数字媒体通用基本流

    公开(公告)号:US20050234731A1

    公开(公告)日:2005-10-20

    申请号:US10966443

    申请日:2004-10-14

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码有用的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    Quantization matrices for digital audio
    77.
    发明申请
    Quantization matrices for digital audio 有权
    数字音频量化矩阵

    公开(公告)号:US20050149324A1

    公开(公告)日:2005-07-07

    申请号:US11061012

    申请日:2005-02-17

    摘要: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.

    摘要翻译: 量化矩阵便于数字音频编码和解码。 音频编码器生成并压缩量化矩阵; 音频解码器解压缩并应用量化矩阵。 本发明包括可以组合或分开使用的几种技术和工具。 例如,音频编码器可以生成用于音频数据块的临界频带模式的量化矩阵。 编码器可以直接从临界频带模式计算量化矩阵,这可以从正被压缩的相同音频数据计算。 音频编码器/解码器可以根据多声道音频数据的编码信道模式使用不同的模式来产生/应用量化矩阵。 音频编码器/解码器可以对量化矩阵使用不同的压缩/解压缩模式,包括参数压缩/解压缩模式。

    Quality and rate control strategy for digital audio

    公开(公告)号:US20050143993A1

    公开(公告)日:2005-06-30

    申请号:US11067170

    申请日:2005-02-24

    CPC分类号: G10L19/24 G10L19/002

    摘要: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

    Quality and rate control strategy for digital audio

    公开(公告)号:US20050143991A1

    公开(公告)日:2005-06-30

    申请号:US11066898

    申请日:2005-02-24

    CPC分类号: G10L19/24 G10L19/002

    摘要: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

    Process to extract regions of homogeneous texture in a digital picture
    80.
    发明授权
    Process to extract regions of homogeneous texture in a digital picture 失效
    在数字图片中提取均质纹理区域的过程

    公开(公告)号:US06693962B1

    公开(公告)日:2004-02-17

    申请号:US09495189

    申请日:2000-02-01

    IPC分类号: H04N718

    摘要: A method of extracting regions of homogeneous texture in a digital picture divides the digital picture into blocks, and for each block generates a feature vector as a function of the data moments. From the feature vectors a gradient for each block is extracted in one of two ways, either using a weighted Euclidean distance between the feature vectors or a probability mass function-based distance metric. The gradients are submitted to morphological preprocessing to remove small bumps in the gradient field. A watershed algorithm is then applied to the preprocessed gradient field to segment the gradient field into a set of spatially connected regions of homogeneous texture.

    摘要翻译: 一种在数字图像中提取均匀纹理区域的方法将数字图像划分为块,并且对于每个块,根据数据时刻生成特征向量。 从特征向量中,以两种方式之一提取每个块的梯度,使用特征向量之间的加权欧几里德距离或基于概率质量函数的距离度量。 梯度被提交到形态预处理以去除梯度场中的小凸起。 然后将分水岭算法应用于预处理梯度场,以将梯度场分割成一组均匀纹理的空间连接区域。