Filter bank optimization for acoustic echo cancellation
    1.
    发明授权
    Filter bank optimization for acoustic echo cancellation 有权
    用于声学回声消除的滤波器组优化

    公开(公告)号:US08942370B2

    公开(公告)日:2015-01-27

    申请号:US13411865

    申请日:2012-03-05

    IPC分类号: H04M9/08

    CPC分类号: H04M9/082

    摘要: A communications device is presented for providing bi-directional audio communications between a near-end user and a far-end user via a bidirectional communications channel. The communications device includes an adaptive echo canceller receiving a near-end audio signal and a far-end audio signal and providing an echo-canceled near-end audio signal for transmission to the far-end user via the communications channel. The adaptive echo canceller includes a first bank of analysis filters for filtering the near-end audio signal, a second bank of analysis filters for filtering the far-end audio signal, and a bank of synthesis filters for filtering sub-band echo-canceled signals generated within the adaptive echo canceller. The first and second filter banks have a frequency response optimized to reduce echo residual gain.

    摘要翻译: 呈现通信设备,用于经由双向通信信道在近端用户和远端用户之间提供双向音频通信。 通信设备包括接收近端音频信号和远端音频信号的自适应回波消除器,并提供回波消除的近端音频信号,以经由通信信道传输给远端用户。 自适应回波消除器包括用于滤波近端音频信号的第一组分析滤波器,用于滤波远端音频信号的第二组分析滤波器,以及用于滤波子带回波消除信号的一组合成滤波器 在自适应回波消除器内产生。 第一和第二滤波器组具有优化的频率响应以减少回波残余增益。

    Spatialized audio over headphones
    2.
    发明授权
    Spatialized audio over headphones 有权
    通过耳机进行空间化音频

    公开(公告)号:US08737648B2

    公开(公告)日:2014-05-27

    申请号:US12472080

    申请日:2009-05-26

    IPC分类号: H04R5/02

    CPC分类号: H04R27/00

    摘要: A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

    摘要翻译: 一个空间元素添加到通信中,包括通过耳机听到的电话会议通话或立体声扬声器设置。 创建功能来修改来自不同呼叫者的信号,以创建呼叫者从房间的不同部分讲话的错觉。

    Entropy encoding and decoding using direct level and run-length/level context-adaptive arithmetic coding/decoding modes
    3.
    发明授权
    Entropy encoding and decoding using direct level and run-length/level context-adaptive arithmetic coding/decoding modes 有权
    使用直接级和游程长度/级别上下文自适应算术编码/解码模式进行熵编码和解码

    公开(公告)号:US08712783B2

    公开(公告)日:2014-04-29

    申请号:US13306761

    申请日:2011-11-29

    IPC分类号: G10L19/02 H03M7/46 H03M7/48

    摘要: An encoder performs context-adaptive arithmetic encoding of transform coefficient data. For example, an encoder switches between coding of direct levels of quantized transform coefficient data and run-level coding of run lengths and levels of quantized transform coefficient data. The encoder can determine when to switch between coding modes based on a pre-determined switch point or by counting consecutive coefficients having a predominant value (e.g., zero). A decoder performs corresponding context-adaptive arithmetic decoding.

    摘要翻译: 编码器执行变换系数数据的上下文自适应算术编码。 例如,编码器在量化变换系数数据的直接电平的编码和运行长度的运行电平编码和量化的变换系数数据的电平之间切换。 编码器可以基于预定的切换点或通过计算具有主要值(例如,零)的连续系数来确定何时在编码模式之间切换。 解码器执行相应的上下文自适应算术解码。

    Audio transcoder using encoder-generated side information to transcode to target bit-rate
    4.
    发明授权
    Audio transcoder using encoder-generated side information to transcode to target bit-rate 有权
    音频代码转换器使用编码器生成的侧面信息转码为目标比特率

    公开(公告)号:US08457958B2

    公开(公告)日:2013-06-04

    申请号:US11938194

    申请日:2007-11-09

    IPC分类号: G10L19/02 H04B1/66

    CPC分类号: G10L19/173

    摘要: An audio encoder encodes side information into a compressed audio bitstream containing encoding parameters used by the encoder for one or more encoding techniques, such as a noise-mask-ratio curve used for rate control. A transcoder uses the encoder generated side information to transcode the audio from the original compressed bitstream having an initial bit-rate into a second bitstream having a new bit-rate. Because the side information is derived from the original audio, the transcoder is able to better maintain audio quality of the transcoding. The side information also allows the transcoder to re-encode from an intermediate decoding/encoding stage for faster and lower complexity transcoding.

    摘要翻译: 音频编码器将侧信息编码成包含由编码器使用的编码参数的压缩音频比特流,用于一种或多种编码技术,例如用于速率控制的噪声屏蔽比曲线。 代码转换器使用编码器产生的侧信息将来自具有初始比特率的原始压缩比特流的音频转码为具有新比特率的第二比特流。 因为侧信息是从原始音频导出的,所以代码转换器能够更好地保持转码的音频质量。 侧面信息还允许代码转换器从中间解码/编码级重新编码,以实现更快和更低复杂度的代码转换。

    Multi-channel audio encoding and decoding

    公开(公告)号:US08386269B2

    公开(公告)日:2013-02-26

    申请号:US13327138

    申请日:2011-12-15

    IPC分类号: G10L19/00 G10L21/00 G10L21/04

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

    VIRTUAL AUDIO ENVIRONMENT FOR MULTIDIMENSIONAL CONFERENCING
    6.
    发明申请
    VIRTUAL AUDIO ENVIRONMENT FOR MULTIDIMENSIONAL CONFERENCING 有权
    多媒体会议的虚拟音频环境

    公开(公告)号:US20120155680A1

    公开(公告)日:2012-06-21

    申请号:US12970964

    申请日:2010-12-17

    IPC分类号: H04R5/02

    摘要: The disclosed architecture employs signal processing techniques to provide audio perception only, or audio perception that matches the visual perception. This also provides spatial audio reproduction for multiparty teleconferencing such that the teleconferencing participants perceive themselves as if they were sitting in the same room. The solution is based on the premise that people perceive sounds as a reconstructed wavefront, and hence, the wavefronts are used to provide the spatial perceptual cues. The differences between the spatial perceptual cues derived from the reconstructed wavefront of sound waves and the ideal wavefront of sound waves form an objective metric for spatial perceptual quality, and provide the means of evaluating the overall system performance. Additionally, compensation filters are employed to improve the spatial perceptual quality of stereophonic systems by optimizing the objective metrics.

    摘要翻译: 所公开的架构采用信号处理技术来仅提供音频感知,或者与视觉感知匹配的音频感知。 这也为多方电话会议提供了空间音频再现,使得电话会议参与者将自己视为坐在同一个房间中。 解决方案是基于人们将声音视为重建波前的前提,因此波前用于提供空间感知线索。 从声波重构波前衍生的空间感知线索与声波理想波阵面之间的差异形成了空间感知质量的客观指标,并提供了评估整体系统性能的手段。 另外,通过优化客观指标,采用补偿滤波器来提高立体声系统的空间感知质量。

    Complex cross-correlation parameters for multi-channel audio
    7.
    发明授权
    Complex cross-correlation parameters for multi-channel audio 有权
    多通道音频的复杂互相关参数

    公开(公告)号:US08190425B2

    公开(公告)日:2012-05-29

    申请号:US11336403

    申请日:2006-01-20

    IPC分类号: G10L19/02

    CPC分类号: H04S3/008

    摘要: An audio encoder encodes a combined channel (e.g., a sum channel) for a group of plural physical audio channels. The encoder determines plural parameters for representing individual physical channels of the group as modified versions of the encoded combined channel. The plural parameters comprise ratios of power in each individual channel to power in the combined channel (e.g., a ratio of the power of a right channel to the power of the combined channel, and a ratio of the power of the left channel to the power of the combined channel). The plural parameters can include a complex parameter. The combined channel and the plural parameters facilitate reconstruction at the audio decoder of source channels. An audio decoder performs a forward complex transform on the multi-channel audio data and reconstructs plural channels from the multi-channel audio data. The decoder can maintain second-order statistics for the source channels.

    摘要翻译: 音频编码器对一组多个物理音频通道的组合通道(例如,和通道)进行编码。 编码器确定用于表示组的各个物理信道的多个参数,作为编码组合信道的修改版本。 多个参数包括每个单独信道中的功率与组合信道中的功率的比率(例如,右信道的功率与组合信道的功率的比率,以及左信道的功率与功率的比率 的组合通道)。 多个参数可以包括复参数。 组合通道和多个参数便于在源通道的音频解码器上进行重建。 音频解码器对多声道音频数据执行前向复合变换,并从多声道音频数据重建多个声道。 解码器可以维护源通道的二阶统计。

    DIGITAL MEDIA UNIVERSAL ELEMENTARY STREAM
    8.
    发明申请
    DIGITAL MEDIA UNIVERSAL ELEMENTARY STREAM 有权
    数字媒体通用元素流

    公开(公告)号:US20120130721A1

    公开(公告)日:2012-05-24

    申请号:US13360577

    申请日:2012-01-27

    IPC分类号: G10L19/00

    CPC分类号: G10L19/167

    摘要: Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.

    摘要翻译: 描述的技术和工具包括用于将给定格式的数字媒体数据(例如,音频,视频,静止图像和/或文本等)映射到用于对光盘上的数据进行编码的传输或文件容器格式的技术和工具 例如数字视频盘(DVD)。 数字媒体通用基本流可用于将数字媒体流(例如,音频流,视频流或图像)映射到任何任意的传输或文件容器中,包括光盘格式和其他传输,例如广播流,无线 传输等。用于解码流中数字媒体的任何给定帧的信息可以在每个编码帧中传送。 数字媒体通用基本流包括称为块的流组件。 数字媒体通用基本流的实现将帧中的媒体流的数据排列,帧具有一个或多个块。

    Filter bank optimization for acoustic echo cancellation
    9.
    发明授权
    Filter bank optimization for acoustic echo cancellation 有权
    用于声学回声消除的滤波器组优化

    公开(公告)号:US08155304B2

    公开(公告)日:2012-04-10

    申请号:US11784893

    申请日:2007-04-10

    IPC分类号: H04M9/08

    CPC分类号: H04M9/082

    摘要: A communications device is presented for providing bi-directional audio communications between a near-end user and a far-end user via a bidirectional communications channel. The communications device includes an adaptive echo canceller receiving a near-end audio signal and a far-end audio signal and providing an echo-canceled near-end audio signal for transmission to the far-end user via the communications channel. The adaptive echo canceller includes a first bank of analysis filters for filtering the near-end audio signal, a second bank of analysis filters for filtering the far-end audio signal, and a bank of synthesis filters for filtering sub-band echo-canceled signals generated within the adaptive echo canceller. The first and second filter banks have a frequency response optimized to reduce echo residual gain.

    摘要翻译: 呈现通信设备,用于经由双向通信信道在近端用户和远端用户之间提供双向音频通信。 通信设备包括接收近端音频信号和远端音频信号的自适应回波消除器,并提供回波消除的近端音频信号,以经由通信信道传输给远端用户。 自适应回波消除器包括用于滤波近端音频信号的第一组分析滤波器,用于滤波远端音频信号的第二组分析滤波器,以及用于滤波子带回波消除信号的一组合成滤波器 在自适应回波消除器内产生。 第一和第二滤波器组具有优化的频率响应以减少回波残余增益。

    MULTI-CHANNEL AUDIO ENCODING AND DECODING

    公开(公告)号:US20120082316A1

    公开(公告)日:2012-04-05

    申请号:US13326315

    申请日:2011-12-14

    IPC分类号: H04R5/00

    摘要: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.