-
公开(公告)号:US20210104254A1
公开(公告)日:2021-04-08
申请号:US17075659
申请日:2020-10-20
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Dong SHI , Kai LI , Hannes MUESCH , David GUNAWAN , Paul HOLMBERG , Glenn N. DICKINS
IPC: G10L21/0232 , H04R3/02 , G10L21/0264 , H04R3/04
Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.
-
公开(公告)号:US20190342521A1
公开(公告)日:2019-11-07
申请号:US16518887
申请日:2019-07-22
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Erwin GOESNAR , Hannes MUESCH , David GUNAWAN , Michael ECKERT , Glenn N. DICKINS
Abstract: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.
-
公开(公告)号:US20190237086A1
公开(公告)日:2019-08-01
申请号:US16228690
申请日:2018-12-20
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Shen HUANG , Michael ECKERT , Glenn N. DICKINS
IPC: G10L19/005 , G10L19/008 , H04S3/00 , G10L19/02 , H04L1/00
CPC classification number: G10L19/005 , G10L19/008 , G10L19/0212 , H04L1/0011 , H04L1/0041 , H04S3/008 , H04S2400/01
Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.
-
公开(公告)号:US20180374496A1
公开(公告)日:2018-12-27
申请号:US16063225
申请日:2016-12-15
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dong SHI , David GUNAWAN , Glenn N. DICKINS
IPC: G10L21/0264 , A61B7/00 , G10L21/0232 , G10L25/21 , G10L25/51 , G10L21/0324 , H04M3/56
Abstract: Example embodiments disclosed herein relate to audio signal processing. A method of processing an audio signal is disclosed. The method includes detecting, based on a power distribution of the audio signal, a type of content of a frame of the audio signal, generating a first gain based on a sound level of the frame for adjusting the sound level, processing the audio signal by applying the first gain to the frame; and in response to the type of content being detected to be a breath sound, generating a second gain for mitigating the breath sound and processing the audio signal by applying the second gain to the frame. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20180301157A1
公开(公告)日:2018-10-18
申请号:US15569555
申请日:2016-04-27
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David GUNAWAN , Dong SHI , Glenn N. DICKINS
IPC: G10L21/0208 , H04R1/32 , G10L21/034
CPC classification number: G10L21/0208 , G10L21/034 , G10L25/48 , H04R1/326 , H04R2410/03
Abstract: Example embodiments disclosed herein relate to impulsive noise suppression. A method of impulsive noise suppression in an audio signal is disclosed. The method includes determining an impulsive noise related feature from a current frame of the audio signal. The method also includes detecting an impulsive noise in the current frame based on the impulsive noise related feature, and in response to detecting the impulsive noise in the current frame, applying a suppression gain to the current frame to suppress the impulsive noise. Corresponding system and computer program product of impulsive noise suppression in an audio signal are also disclosed.
-
公开(公告)号:US20180014139A1
公开(公告)日:2018-01-11
申请号:US15547043
申请日:2016-02-02
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Glenn N. DICKINS , Richard J. CARTWRIGHT
CPC classification number: H04S7/303 , H04R3/005 , H04R5/027 , H04S7/00 , H04S2400/11 , H04S2400/15
Abstract: Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.
-
17.
公开(公告)号:US20180006837A1
公开(公告)日:2018-01-04
申请号:US15546925
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Glenn N. DICKINS
Abstract: Some aspects of the present disclosure involve the recording, processing and playback of audio data corresponding to conferences, such as teleconferences. In some teleconference implementations, the audio experience heard when a recording of the conference is played back may be substantially different from the audio experience of an individual conference participant during the original teleconference. In some implementations, the recorded audio data may include at least some audio data that was not available during the teleconference. In some examples, the spatial characteristics of the played-back audio data may be different from that of the audio heard by participants of the teleconference.
-
公开(公告)号:US20160027447A1
公开(公告)日:2016-01-28
申请号:US14774966
申请日:2014-03-04
Inventor: Glenn N. DICKINS , Xuejing SUN , Yen-Liang SHUE , Heiko PURNHAGEN
IPC: G10L19/012 , G10L19/008 , G10L21/0208 , H04M3/56
CPC classification number: G10L19/012 , G10L19/008 , G10L21/0208 , H04L12/1813 , H04L12/1827 , H04M3/568
Abstract: A method, an apparatus, logic (e.g., executable instructions encoded in a non-transitory computer-readable medium to carry out a method), and a non-transitory computer-readable medium configured with such instructions. The method is to generate and spatially render spatial comfort noise at a receiving endpoint of a conference system, such that the comfort noise has target spectral characteristics typical of comfort noise, and at least one spatial property that at least substantially matches at least one target spatial property. On version includes receiving one or more or more audio signals from other endpoints, combining the received audio signals with the spatial comfort noise signals, and rendering the combination of the received audio signals and the spatial comfort noise signals to a set of output signals for loudspeakers, such that the spatial comfort noise signals are continually in the output signal sin addition to output from the received audio signals.
Abstract translation: 一种方法,装置,逻辑(例如,在非暂时性计算机可读介质中编码以执行方法的可执行指令)以及配置有这种指令的非暂时计算机可读介质。 该方法是在会议系统的接收端产生和空间地呈现空间舒适噪声,使得舒适噪声具有典型的舒适噪声的目标频谱特性,以及至少基本匹配至少一个目标空间 属性。 On版本包括从其他端点接收一个或多个或更多个音频信号,将接收到的音频信号与空间舒适噪声信号组合,以及将接收到的音频信号和空间舒适噪声信号的组合呈现给用于扬声器的一组输出信号 ,使得空间舒适噪声信号连续地在输出信号sin中,从接收到的音频信号输出。
-
公开(公告)号:US20150348546A1
公开(公告)日:2015-12-03
申请号:US14650214
申请日:2013-11-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Xuejing SUN , Shen HUANG , Poppy CRUM , Hannes MUESCH , Glenn N. DICKINS , Michael ECKERT
IPC: G10L15/20
Abstract: An audio processing apparatus and an audio processing method are described. In one embodiment, the audio processing apparatus include an audio masker separator for separating from a first audio signal an audio material comprising a sound other than stationary noise and utterance meaningful in semantics, as an audio masker candidate. The apparatus also includes a first context analyzer for obtaining statistics regarding contextual information of detected audio masker candidates, and a masker library builder for building a masker library or updating an existing masker library by adding, based on the statistics, at least one audio masker candidate as an audio masker into the masker library, wherein audio maskers in the maker library are used to be inserted into a target position in a second audio signal to conceal defects in the second audio signal.
Abstract translation: 描述音频处理装置和音频处理方法。 在一个实施例中,音频处理设备包括一个音频掩蔽器分离器,用于将音频材料与第一音频信号分离,该音频材料包括除了固定噪声之外的声音以及在语义上有意义的话语作为音频掩蔽者候选者。 该装置还包括用于获得关于检测到的音频掩蔽者候选者的上下文信息的统计信息的第一上下文分析器,以及用于构建掩蔽程序库或通过基于统计信息添加至少一个音频掩码选择器来构建掩蔽程序库或更新现有掩蔽程序库的掩码程序库构建器 作为音频掩蔽器进入掩蔽器库,其中制造商库中的音频掩蔽器被用于插入第二音频信号中的目标位置以隐藏第二音频信号中的缺陷。
-
公开(公告)号:US20250104728A1
公开(公告)日:2025-03-27
申请号:US18906046
申请日:2024-10-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Xuejing SUN , Glenn N. DICKINS
IPC: G10L21/0364 , G10K11/16 , G10L21/0224 , G10L21/0316 , G10L21/034 , G10L25/78 , H03G3/30 , H03G3/32
Abstract: A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.
-
-
-
-
-
-
-
-
-