-
公开(公告)号:US12046247B2
公开(公告)日:2024-07-23
申请号:US17702698
申请日:2022-03-23
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Shen Huang , Michael Eckert , Glenn N. Dickins
IPC: G10L19/005 , G10L19/008 , G10L19/02 , H04L1/00 , H04S3/00
CPC classification number: G10L19/005 , G10L19/008 , G10L19/0212 , H04L1/0011 , H04L1/0041 , H04S3/008 , H04S2400/01
Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.
-
公开(公告)号:US11803351B2
公开(公告)日:2023-10-31
申请号:US17601199
申请日:2020-04-03
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Glenn N. Dickins , Feng Deng , Michael Eckert , Craig Johnston , Paul Holmberg
IPC: G06F3/16 , H04L65/403 , H04M3/56
CPC classification number: G06F3/165 , H04L65/403 , H04M3/568
Abstract: A communication system, method, and computer-readable medium therefor comprise a media server configured to receive a plurality of audio streams from a corresponding plurality of client devices, the media server including circuitry configured to rank the plurality of audio streams based on a predetermined metric, group a first portion of the plurality of audio streams into a first set, the first portion of the plurality of audio streams being the N highest-ranked audio streams, group a second portion of the plurality of audio streams into a second set, the second portion of the plurality of audio streams being the M lowest-ranked audio streams, forward respective audio streams of the first set to a receiver device, and discard respective audio streams of the second set, wherein N and M are independent integers.
-
公开(公告)号:US11699451B2
公开(公告)日:2023-07-11
申请号:US17251913
申请日:2019-07-02
Inventor: David S. McGrath , Michael Eckert , Heiko Purnhagen , Stefan Bruhn
IPC: G10L19/16 , G10L19/008 , G10L19/18 , H04S3/00
CPC classification number: G10L19/167 , G10L19/008 , G10L19/18
Abstract: The present document describes a method (700) for encoding a multi-channel input signal (201). The method (700) comprises determining (701) a plurality of downmix channel signals (203) from the multi-channel input signal (201) and performing (702) energy compaction of the plurality of downmix channel signals (203) to provide a plurality of compacted channel signals (404). Furthermore, the method (700) comprises determining (703) joint coding metadata (205) based on the plurality of compacted channel signals (404) and based on the multi-channel input signal (201), wherein the joint coding metadata (205) is such that it allows upmixing of the plurality of compacted channel signals (404) to an approximation of the multi-channel input signal (201). In addition, the method (700) comprises encoding (704) the plurality of compacted channel signals (404) and the joint coding metadata (205).
-
4.
公开(公告)号:US20150078594A1
公开(公告)日:2015-03-19
申请号:US14385083
申请日:2013-03-21
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: David S. Mcgrath , Glenn N. Dickins , Paul Holmberg , Gary Spittle , Michael Eckert
CPC classification number: H04S3/002 , H04M3/568 , H04S7/30 , H04S2400/11 , H04S2400/15 , H04S2420/01
Abstract: A method of outputting audio in a teleconferencing environment includes receiving audio streams, processing the audio streams according to information regarding effective spatial positions, and outputting, by at least three speakers arranged in more than one dimension, the audio streams having been processed. The information regarding the plurality of effective spatial positions corresponds to a perceived spatial scene that extends beyond the speakers in at least two dimensions. In this manner, participants in the teleconference perceive the audio from the remote participants as originating at different positions in the teleconference room.
Abstract translation: 在电话会议环境中输出音频的方法包括接收音频流,根据关于有效空间位置的信息来处理音频流,并且通过至少三个扬声器以多于一维的方式输出已经处理的音频流。 关于多个有效空间位置的信息对应于在至少两个维度上延伸超出扬声器的感知空间场景。 以这种方式,电话会议中的参与者将来自远程参与者的音频感知为来自电话会议室中的不同位置。
-
公开(公告)号:US11289103B2
公开(公告)日:2022-03-29
申请号:US16928918
申请日:2020-07-14
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Shen Huang , Michael Eckert , Glenn N. Dickins
IPC: G10L19/005 , G10L19/008 , H04S3/00 , H04L1/00 , G10L19/02
Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.
-
公开(公告)号:US10051400B2
公开(公告)日:2018-08-14
申请号:US14385083
申请日:2013-03-21
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: David S. McGrath , Glenn N. Dickins , Paul Holmberg , Gary Spittle , Michael Eckert
CPC classification number: H04S3/002 , H04M3/568 , H04S7/30 , H04S2400/11 , H04S2400/15 , H04S2420/01
Abstract: A method of outputting audio in a teleconferencing environment includes receiving audio streams, processing the audio streams according to information regarding effective spatial positions, and outputting, by at least three speakers arranged in more than one dimension, the audio streams having been processed. The information regarding the plurality of effective spatial positions corresponds to a perceived spatial scene that extends beyond the speakers in at least two dimensions. In this manner, participants in the teleconference perceive the audio from the remote participants as originating at different positions in the teleconference room.
-
公开(公告)号:US10015443B2
公开(公告)日:2018-07-03
申请号:US15527272
申请日:2015-11-18
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Xuejing Sun , Michael Eckert
CPC classification number: H04N7/147 , H04S7/30 , H04S2420/01 , H04S2420/11
Abstract: Example embodiments disclosed herein relate to spatial congruency adjustment. A method for adjusting spatial congruency in a video conference is disclosed. The method in unwarping a visual scene captured by a video endpoint device into at least one rectilinear scene, the video endpoint device being configured to capture the visual scene in an omnidirectional manner, detecting spatial congruency between the at least one rectilinear scene and an auditory scene captured by an audio endpoint device that is positioned in relation to the video endpoint device. The spatial congruency being a degree of alignment between the auditory scene and the at least one rectilinear scene and in response to the detected spatial congruency being below the threshold, adjusting the spatial congruency. Corresponding system and computer program products are also disclosed.
-
公开(公告)号:US09558744B2
公开(公告)日:2017-01-31
申请号:US14650214
申请日:2013-11-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Xuejing Sun , Shen Huang , Poppy Crum , Hannes Muesch , Glenn N. Dickins , Michael Eckert
Abstract: An audio processing apparatus and an audio processing method are described. In one embodiment, the audio processing apparatus include an audio masker separator for separating from a first audio signal an audio material comprising a sound other than stationary noise and utterance meaningful in semantics, as an audio masker candidate. The apparatus also includes a first context analyzer for obtaining statistics regarding contextual information of detected audio masker candidates, and a masker library builder for building a masker library or updating an existing masker library by adding, based on the statistics, at least one audio masker candidate as an audio masker into the masker library, wherein audio maskers in the maker library are used to be inserted into a target position in a second audio signal to conceal defects in the second audio signal.
Abstract translation: 描述音频处理装置和音频处理方法。 在一个实施例中,音频处理设备包括一个音频掩蔽器分离器,用于将音频材料与第一音频信号分离,该音频材料包括除了固定噪声之外的声音以及在语义上有意义的话语作为音频掩蔽者候选者。 该装置还包括用于获得关于检测到的音频掩蔽者候选者的上下文信息的统计信息的第一上下文分析器,以及用于构建掩蔽程序库或通过基于统计信息添加至少一个音频掩码选择器来构建掩蔽程序库或更新现有掩蔽程序库的掩码程序库构建器 作为音频掩蔽器进入掩蔽器库,其中制造商库中的音频掩蔽器被用于插入第二音频信号中的目标位置以隐藏第二音频信号中的缺陷。
-
公开(公告)号:US12014745B2
公开(公告)日:2024-06-18
申请号:US17882900
申请日:2022-08-08
Inventor: Stefan Bruhn , Michael Eckert , Juan Felix Torres , Stefanie Brown , David S. McGrath
IPC: G10L19/008 , H04S3/00
CPC classification number: G10L19/008 , H04S3/008 , H04S2400/01
Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
-
公开(公告)号:US20230335142A1
公开(公告)日:2023-10-19
申请号:US18043905
申请日:2021-09-07
Inventor: Dirk Jeroen Breebaart , Michael Eckert , Heiko Purnhagen
IPC: G10L19/008 , G10L19/22
CPC classification number: G10L19/008 , G10L19/22
Abstract: A method comprising receiving a first input bit stream for a first parametrically coded input audio signal, the first input bit stream including data representing a first input core audio signal and a first set including at least one spatial parameter relating to the first parametrically coded input audio signal. A first covariance matrix of the first parametrically coded audio signal is determined based on the spatial parameter(s) of the first set. A modified set including at least one spatial parameter is determined based on the determined first covariance matrix, wherein the modified set is different from the first set. An output core audio signal is determined, which is based on, or constituted by, the first input core audio signal. An output bit stream for a parametrically coded output audio signal is generated, the output bit stream including data representing the output core audio signal and the modified set.
-
-
-
-
-
-
-
-
-