Bitstream Syntax for Spatial Voice Coding
    31.
    发明申请
    Bitstream Syntax for Spatial Voice Coding 有权
    用于空间语音编码的位流语法

    公开(公告)号:US20160155447A1

    公开(公告)日:2016-06-02

    申请号:US14392287

    申请日:2014-06-26

    Abstract: An encoding system (100) encodes a first (E1) and further (E2, E3) audio signals as a layered bitstream (B), wherein a quantizer for each frequency band of each signal is selected using a rate allocation rule based on signal-specific rate allocation data, a spectral envelope of the signal and a reference level (EnvE1Max), which is determined based on the spectral envelope of the first signal and is not necessarily included in the bitstream. Further disclosed is a decoding system for reconstructing the audio signals based on the bitstream. In embodiments, the bitstream has a basic layer (BE1), which contains data that enable decoding of the first audio signal, and a spatial layer (Bspatial) facilitating decoding of the further audio signal(s). In embodiments, the encoding system prepares the bitstream subject to a basic-layer bitrate constraint and a total bitrate constraint.

    Abstract translation: 编码系统(100)将第一(E1)和另外(E2,E3)音频信号编码为分层比特流(B),其中使用基于信号的比特率的速率分配规则来选择每个信号的每个频带的量化器, 特定速率分配数据,信号的频谱包络和基于第一信号的频谱包络确定的参考电平(EnvE1Max),并且不一定包括在比特流中。 还公开了一种用于基于比特流重建音频信号的解码系统。 在实施例中,比特流具有包含能够对第一音频信号进行解码的数据的基本层(BE1)以及便于对其它音频信号进行解码的空间层(B空间)。 在实施例中,编码系统根据基本层比特率约束和总比特率约束准备比特流。

    SCALABLE VOICE SCENE MEDIA SERVER
    34.
    发明申请

    公开(公告)号:US20220197592A1

    公开(公告)日:2022-06-23

    申请号:US17601199

    申请日:2020-04-03

    Abstract: A communication system, method, and computer-readable medium therefor comprise a media server configured to receive a plurality of audio streams from a corresponding plurality of client devices, the media server including circuitry configured to rank the plurality of audio streams based on a predetermined metric, group a first portion of the plurality of audio streams into a first set, the first portion of the plurality of audio streams being the N highest-ranked audio streams, group a second portion of the plurality of audio streams into a second set, the second portion of the plurality of audio streams being the M lowest-ranked audio streams, forward respective audio streams of the first set to a receiver device, and discard respective audio streams of the second set, wherein N and M are independent integers.

    Detecting and Mitigating Audio-Visual Incongruence

    公开(公告)号:US20200177837A1

    公开(公告)日:2020-06-04

    申请号:US16786799

    申请日:2020-02-10

    Abstract: Systems and methods are described for detecting and remedying potential incongruence in a video conference. A camera of a video conferencing system may capture video images of a conference room. A processor of the video conferencing system may identify locations of a plurality of participants within an image plane of a video image. Using face and shape detection, a location of a center point of each identified participant's torso may be calculated. A region of congruence bounded by key parallax lines may be calculated, the key parallax lines being a subset of all parallax lines running through the center points of each identified participant. When the audio device location is not within the region of congruence, audio captured by an audio device may be adjusted to reduce effects of incongruence when the captured audio is replayed at a far end of the video conference.

    Post-Teleconference Playback Using Non-Destructive Audio Transport

    公开(公告)号:US20200092422A1

    公开(公告)日:2020-03-19

    申请号:US16691487

    申请日:2019-11-21

    Abstract: Teleconference audio data including a plurality of individual uplink data packet streams, may be received during a teleconference. Each uplink data packet stream may corresponding to a telephone endpoint used by one or more teleconference participants. The teleconference audio data may be analyzed to determine a plurality of suppressive gain coefficients, which may be applied to first instances of the teleconference audio data during the teleconference, to produce first gain-suppressed audio data provided to the telephone endpoints during the teleconference. Second instances of the teleconference audio data, as well as gain coefficient data corresponding to the plurality of suppressive gain coefficients, may be sent to a memory system as individual uplink data packet streams. The second instances of the teleconference audio data may be less gain-suppressed than the first gain-suppressed audio data.

    ADAPTIVE AUDIO FILTERING
    38.
    发明申请

    公开(公告)号:US20190392855A1

    公开(公告)日:2019-12-26

    申请号:US16564532

    申请日:2019-09-09

    Abstract: In an audio processing system (300), a filtering section (350, 400): receives subband signals (410, 420, 430) corresponding to audio content of a reference signal (301) in respective frequency subbands; receives subband signals (411, 421, 431) corresponding to audio content of a response signal (304) in the respective subbands; and forms filtered inband references (412, 422, 432) by applying respective filters (413, 423, 433) to the subband signals of the reference signal. For a frequency subband: filtered crossband references (424, 425) are formed by multiplying, by scalar factors (426, 427), filtered inband references of other subbands; a composite filtered reference (428) is formed by summing the filtered inband reference of the subband (422) and the filtered crossband references; a residual signal (429) is computed as a difference between the composite filtered reference and the subband signal of the response signal corresponding to the subband; and the scalar factors and the filter applied to the subband signal of the reference signal corresponding to the subband are adjusted based on the residual signal.

    POST-PROCESSING GAINS FOR SIGNAL ENHANCEMENT
    39.
    发明申请

    公开(公告)号:US20190287548A1

    公开(公告)日:2019-09-19

    申请号:US16429552

    申请日:2019-06-03

    Abstract: A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.

    Auxiliary Signal for Detecting Microphone Impairment

    公开(公告)号:US20190045312A1

    公开(公告)日:2019-02-07

    申请号:US16079071

    申请日:2017-02-16

    Abstract: Described herein are audio capture systems and methods. One embodiment provides an audio capture system (1) including: microphones (9-11) positioned to capture respective audio signals from different directions or locations within an audio environment; a mixing module (7) configured to mix the audio signals in accordance with a mixing control signal to produce an output audio mix, wherein, upon the detection of vibration activity, the mixing control signal controls the mixing module (7) to selectively temporarily modify one or more of the audio signals to reduce the presence of noise associated with vibration activity in the output audio mix.

Patent Agency Ranking