-
公开(公告)号:US20150348546A1
公开(公告)日:2015-12-03
申请号:US14650214
申请日:2013-11-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Xuejing SUN , Shen HUANG , Poppy CRUM , Hannes MUESCH , Glenn N. DICKINS , Michael ECKERT
IPC: G10L15/20
Abstract: An audio processing apparatus and an audio processing method are described. In one embodiment, the audio processing apparatus include an audio masker separator for separating from a first audio signal an audio material comprising a sound other than stationary noise and utterance meaningful in semantics, as an audio masker candidate. The apparatus also includes a first context analyzer for obtaining statistics regarding contextual information of detected audio masker candidates, and a masker library builder for building a masker library or updating an existing masker library by adding, based on the statistics, at least one audio masker candidate as an audio masker into the masker library, wherein audio maskers in the maker library are used to be inserted into a target position in a second audio signal to conceal defects in the second audio signal.
Abstract translation: 描述音频处理装置和音频处理方法。 在一个实施例中,音频处理设备包括一个音频掩蔽器分离器,用于将音频材料与第一音频信号分离,该音频材料包括除了固定噪声之外的声音以及在语义上有意义的话语作为音频掩蔽者候选者。 该装置还包括用于获得关于检测到的音频掩蔽者候选者的上下文信息的统计信息的第一上下文分析器,以及用于构建掩蔽程序库或通过基于统计信息添加至少一个音频掩码选择器来构建掩蔽程序库或更新现有掩蔽程序库的掩码程序库构建器 作为音频掩蔽器进入掩蔽器库,其中制造商库中的音频掩蔽器被用于插入第二音频信号中的目标位置以隐藏第二音频信号中的缺陷。
-
公开(公告)号:US20240331708A1
公开(公告)日:2024-10-03
申请号:US18658853
申请日:2024-05-08
Inventor: Stefan BRUHN , Michael ECKERT , Juan Felix TORRES , Stefanie BROWN , David S. MCGRATH
IPC: G10L19/008 , H04S3/00
CPC classification number: G10L19/008 , H04S3/008 , H04S2400/01
Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
-
公开(公告)号:US20240105192A1
公开(公告)日:2024-03-28
申请号:US18255506
申请日:2021-12-01
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Rishabh TYAGI , Michael ECKERT
IPC: G10L19/03 , G10L19/008 , G10L21/0216
CPC classification number: G10L19/03 , G10L19/008 , G10L21/0216
Abstract: Embodiments are disclosed for spatial noise filling in multi-channel codecs. In an embodiment, a method of regenerating background noise ambience in a multi-channel codec by generating spatial hole filling noise comprises: computing noise estimates based on a primary downmix channel generated from an input audio signal representing a spatial audio scene with background noise ambience; computing spectral shaping filter coefficients based on the noise estimates; spectrally shaping the multi-channel noise signal using the spectral shaping filter coefficients and a noise distribution, the spectral shaping resulting in a diffused, multi-channel noise signal with uncorrelated channels; spatially shaping the diffused, uncorrelated multi-channel noise signal with uncorrelated channels based on a noise ambience of the spatial audio scene; and adding the spatially and spectrally shaped multi-channel noise to a multi-channel codec output to synthesize the background noise ambience of the spatial audio scene.
-
公开(公告)号:US20220215847A1
公开(公告)日:2022-07-07
申请号:US17702698
申请日:2022-03-23
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Shen HUANG , Michael ECKERT , Glenn N. DICKINS
IPC: G10L19/005 , G10L19/008 , G10L19/02 , H04L1/00 , H04S3/00
Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.
-
公开(公告)号:US20210166708A1
公开(公告)日:2021-06-03
申请号:US17251913
申请日:2019-07-02
Inventor: David S. MCGRATH , Michael ECKERT , Heiko PURNHAGEN , Stefan BRUHN
IPC: G10L19/16 , G10L19/008 , G10L19/18
Abstract: The present document describes a method (700) for encoding a multi-channel input signal (201). The method (700) comprises determining (701) a plurality of downmix channel signals (203) from the multi-channel input signal (201) and performing (702) energy compaction of the plurality of downmix channel signals (203) to provide a plurality of compacted channel signals (404). Furthermore, the method (700) comprises determining (703) joint coding metadata (205) based on the plurality of compacted channel signals (404) and based on the multi-channel input signal (201), wherein the joint coding metadata (205) is such that it allows upmixing of the plurality of compacted channel signals (404) to an approximation of the multi-channel input signal (201). In addition, the method (700) comprises encoding (704) the plurality of compacted channel signals (404) and the joint coding metadata (205).
-
公开(公告)号:US20200342883A1
公开(公告)日:2020-10-29
申请号:US16928918
申请日:2020-07-14
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Shen HUANG , Michael ECKERT , Glenn N. DICKINS
IPC: G10L19/005 , G10L19/008 , H04S3/00 , H04L1/00 , G10L19/02
Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.
-
公开(公告)号:US20180279063A1
公开(公告)日:2018-09-27
申请号:US15547441
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Xuejing SUN , Richard J. CARTWRIGHT , Michael P. HOLLIER , Michael ECKERT
IPC: H04S7/00 , G10L21/043
CPC classification number: H04S7/302 , G10L21/043 , H04L12/1831 , H04M3/42221 , H04M3/565 , H04M3/568 , H04M2203/305 , H04R27/00 , H04R2227/003 , H04S7/30 , H04S2400/11 , H04S2420/01
Abstract: A method for processing audio data, the method comprising: receiving audio data corresponding to a plurality of instances of audio, including at least one of: (a) audio data from multiple endpoints, recorded separately or (b) audio data from a single endpoint corresponding to multiple talkers and including spatial information for each of the multiple talkers; rendering the audio data in a virtual acoustic space such that each of the instances of audio has a respective different virtual position in the virtual acoustic space; and scheduling the instances of audio to be played back with a playback overlap between at least two of the instances of audio, wherein the scheduling is performed, at least in part, according to a set of perceptually-motivated rules.
-
-
-
-
-
-