-
公开(公告)号:US11770666B2
公开(公告)日:2023-09-26
申请号:US17397887
申请日:2021-08-09
CPC分类号: H04S1/005 , H04M3/568 , H04R3/12 , H04R5/033 , H04R5/04 , H04S7/30 , H04S7/304 , H04S2400/01 , H04S2400/03 , H04S2400/11 , H04S2420/01 , H04S2420/11
摘要: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
公开(公告)号:US11694711B2
公开(公告)日:2023-07-04
申请号:US17723317
申请日:2022-04-18
发明人: Xuejing Sun , Glenn N. Dickins
IPC分类号: G10L21/0208 , G10L25/78 , G10L21/0364 , G10L21/0316 , G10K11/16 , H03G3/32 , G10L21/0224 , G10L21/034 , H03G3/30
CPC分类号: G10L21/0364 , G10K11/16 , G10L21/0224 , G10L21/034 , G10L21/0316 , G10L25/78 , H03G3/301 , H03G3/32
摘要: A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.
-
公开(公告)号:US11107481B2
公开(公告)日:2021-08-31
申请号:US16379479
申请日:2019-04-09
IPC分类号: G10L19/005 , G10L19/022 , G10L19/16 , H04L29/06 , G10L19/02
摘要: Systems and methods are described for concealing packet loss in a received audio stream. Packets of the audio stream may be received in a non-lapped transform domain format, where at least one packet is missing in the stream. The received packets are decoded, and each missing packet in the decoded stream is replaced by a reduced-energy signal block. Each reduced-energy signal block may also be modified at a beginning or ending boundary, and shifted such that a start or end of each missing packet does not coincide with a peak of a transform window of a lapped transform domain format. The raw audio signal may then be encoded into transform windows having the lapped transform domain format. Packet loss concealment may then be performed for selected transform windows that include modified reduced-energy blocks, either prior to transmission or after transmission by the receiving endpoint.
-
公开(公告)号:US11017793B2
公开(公告)日:2021-05-25
申请号:US16061771
申请日:2016-12-14
发明人: Dong Shi , David Gunawan , Glenn N. Dickins
IPC分类号: G10L21/02 , G10L21/0232 , G10L25/72 , G10L25/18 , G10L21/0208
摘要: Example embodiments disclosed herein relate to audio signal processing. A method of indicating a presence of a nuisance in an audio signal is disclosed. The method includes determining a probability of the presence of the nuisance in a frame of the audio signal based on a feature of the audio signal, the nuisance representing an unwanted sound made by a user, in response to the probability of the presence of the nuisance exceeding a threshold, tracking the audio signal based on a metric over a plurality of frames following the frame, determining, based on the tracking, that the presence of the nuisance is to be indicated to the user, and in response to the determination, presenting to the user a notification of the presence of the nuisance. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US10924872B2
公开(公告)日:2021-02-16
申请号:US16079071
申请日:2017-02-16
发明人: David Gunawan , Glenn N. Dickins
摘要: Described herein are audio capture systems and methods. One embodiment provides an audio capture system (1) including: microphones (9-11) positioned to capture respective audio signals from different directions or locations within an audio environment; a mixing module (7) configured to mix the audio signals in accordance with a mixing control signal to produce an output audio mix, wherein, upon the detection of vibration activity, the mixing control signal controls the mixing module (7) to selectively temporarily modify one or more of the audio signals to reduce the presence of noise associated with vibration activity in the output audio mix.
-
公开(公告)号:US10812759B2
公开(公告)日:2020-10-20
申请号:US16518887
申请日:2019-07-22
IPC分类号: H04N7/15 , H04M3/56 , H04N7/14 , H04L12/18 , G01S3/80 , G06T7/70 , G01S5/18 , H04R3/12 , H04S7/00 , G06K9/32 , G06K9/00
摘要: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.
-
公开(公告)号:US10560661B2
公开(公告)日:2020-02-11
申请号:US15918214
申请日:2018-03-12
摘要: Systems and methods are described for detecting and remedying potential incongruence in a video conference. A camera of a video conferencing system may capture video images of a conference room. A processor of the video conferencing system may identify locations of a plurality of participants within an image plane of a video image. Using face and shape detection, a location of a center point of each identified participant's torso may be calculated. A region of congruence bounded by key parallax lines may be calculated, the key parallax lines being a subset of all parallax lines running through the center points of each identified participant. When the audio device location is not within the region of congruence, audio captured by an audio device may be adjusted to reduce effects of incongruence when the captured audio is replayed at a far end of the video conference.
-
公开(公告)号:US10446166B2
公开(公告)日:2019-10-15
申请号:US15648111
申请日:2017-07-12
发明人: Glenn N. Dickins , Paul Holmberg , Dong Shi
IPC分类号: G10L21/02 , G10L21/0232 , H04R29/00 , G10L25/48 , H04S7/00 , G10L21/0216 , G10L21/0208
摘要: Example embodiments disclosed herein relate to assessment and adjustment for an audio environment. A computer-implemented method is provided. The method includes obtaining a first audio signal captured by a device located in an environment. The method also includes analyzing a characteristic of the first audio signal to determine an acoustic performance metric for the environment. The method further includes, in response to the acoustic performance metric being below a threshold, providing a first task for a user to perform based on the characteristic of the first audio signal. The first task is related to an adjustment to a setting of the environment. Embodiments in this regard further provide a corresponding computer program product. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20190281404A1
公开(公告)日:2019-09-12
申请号:US16424409
申请日:2019-05-28
摘要: Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.
-
公开(公告)号:US10393571B2
公开(公告)日:2019-08-27
申请号:US15580242
申请日:2016-07-06
发明人: Dong Shi , David Gunawan , Glenn N. Dickins , Kai Li
摘要: Example embodiments disclosed herein relate to a estimation of reverberant energy components from audio sources. A method of estimating a reverberant energy component from an active audio source (100) is disclosed. The method comprises determining a correspondence between the active audio source and a plurality of sample sources by comparing one or more spatial features of the active audio source with one or more spatial features of the plurality of sample sources, each of the sample sources being associated with an adaptive filtering model (101); obtaining an adaptive filtering model for the active audio source based on the determined correspondence (102); and estimating the reverberant energy component from the active audio source over time based on the adaptive filtering model (103). Corresponding system (800) and computer program product (900) are also disclosed.
-
-
-
-
-
-
-
-
-