Selective forward error correction for spatial audio codecs

    公开(公告)号:US12046247B2

    公开(公告)日:2024-07-23

    申请号:US17702698

    申请日:2022-03-23

    Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.

    Scalable voice scene media server

    公开(公告)号:US11803351B2

    公开(公告)日:2023-10-31

    申请号:US17601199

    申请日:2020-04-03

    CPC classification number: G06F3/165 H04L65/403 H04M3/568

    Abstract: A communication system, method, and computer-readable medium therefor comprise a media server configured to receive a plurality of audio streams from a corresponding plurality of client devices, the media server including circuitry configured to rank the plurality of audio streams based on a predetermined metric, group a first portion of the plurality of audio streams into a first set, the first portion of the plurality of audio streams being the N highest-ranked audio streams, group a second portion of the plurality of audio streams into a second set, the second portion of the plurality of audio streams being the M lowest-ranked audio streams, forward respective audio streams of the first set to a receiver device, and discard respective audio streams of the second set, wherein N and M are independent integers.

    System and Method of Speaker Cluster Design and Rendering
    4.
    发明申请
    System and Method of Speaker Cluster Design and Rendering 审中-公开
    扬声器群集设计与渲染的系统与方法

    公开(公告)号:US20150078594A1

    公开(公告)日:2015-03-19

    申请号:US14385083

    申请日:2013-03-21

    Abstract: A method of outputting audio in a teleconferencing environment includes receiving audio streams, processing the audio streams according to information regarding effective spatial positions, and outputting, by at least three speakers arranged in more than one dimension, the audio streams having been processed. The information regarding the plurality of effective spatial positions corresponds to a perceived spatial scene that extends beyond the speakers in at least two dimensions. In this manner, participants in the teleconference perceive the audio from the remote participants as originating at different positions in the teleconference room.

    Abstract translation: 在电话会议环境中输出音频的方法包括接收音频流,根据关于有效空间位置的信息来处理音频流,并且通过至少三个扬声器以多于一维的方式输出已经处理的音频流。 关于多个有效空间位置的信息对应于在至少两个维度上延伸超出扬声器的感知空间场景。 以这种方式,电话会议中的参与者将来自远程参与者的音频感知为来自电话会议室中的不同位置。

    Selective forward error correction for spatial audio codecs

    公开(公告)号:US11289103B2

    公开(公告)日:2022-03-29

    申请号:US16928918

    申请日:2020-07-14

    Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.

    Adjusting spatial congruency in a video conferencing system

    公开(公告)号:US10015443B2

    公开(公告)日:2018-07-03

    申请号:US15527272

    申请日:2015-11-18

    CPC classification number: H04N7/147 H04S7/30 H04S2420/01 H04S2420/11

    Abstract: Example embodiments disclosed herein relate to spatial congruency adjustment. A method for adjusting spatial congruency in a video conference is disclosed. The method in unwarping a visual scene captured by a video endpoint device into at least one rectilinear scene, the video endpoint device being configured to capture the visual scene in an omnidirectional manner, detecting spatial congruency between the at least one rectilinear scene and an auditory scene captured by an audio endpoint device that is positioned in relation to the video endpoint device. The spatial congruency being a degree of alignment between the auditory scene and the at least one rectilinear scene and in response to the detected spatial congruency being below the threshold, adjusting the spatial congruency. Corresponding system and computer program products are also disclosed.

    Audio processing apparatus and audio processing method
    8.
    发明授权
    Audio processing apparatus and audio processing method 有权
    音频处理装置和音频处理方法

    公开(公告)号:US09558744B2

    公开(公告)日:2017-01-31

    申请号:US14650214

    申请日:2013-11-27

    CPC classification number: G10L15/20 G10L21/02 H04M3/568

    Abstract: An audio processing apparatus and an audio processing method are described. In one embodiment, the audio processing apparatus include an audio masker separator for separating from a first audio signal an audio material comprising a sound other than stationary noise and utterance meaningful in semantics, as an audio masker candidate. The apparatus also includes a first context analyzer for obtaining statistics regarding contextual information of detected audio masker candidates, and a masker library builder for building a masker library or updating an existing masker library by adding, based on the statistics, at least one audio masker candidate as an audio masker into the masker library, wherein audio maskers in the maker library are used to be inserted into a target position in a second audio signal to conceal defects in the second audio signal.

    Abstract translation: 描述音频处理装置和音频处理方法。 在一个实施例中,音频处理设备包括一个音频掩蔽器分离器,用于将音频材料与第一音频信号分离,该音频材料包括除了固定噪声之外的声音以及在语义上有意义的话语作为音频掩蔽者候选者。 该装置还包括用于获得关于检测到的音频掩蔽者候选者的上下文信息的统计信息的第一上下文分析器,以及用于构建掩蔽程序库或通过基于统计信息添加至少一个音频掩码选择器来构建掩蔽程序库或更新现有掩蔽程序库的掩码程序库构建器 作为音频掩蔽器进入掩蔽器库,其中制造商库中的音频掩蔽器被用于插入第二音频信号中的目标位置以隐藏第二音频信号中的缺陷。

    PROCESSING PARAMETRICALLY CODED AUDIO
    10.
    发明公开

    公开(公告)号:US20230335142A1

    公开(公告)日:2023-10-19

    申请号:US18043905

    申请日:2021-09-07

    CPC classification number: G10L19/008 G10L19/22

    Abstract: A method comprising receiving a first input bit stream for a first parametrically coded input audio signal, the first input bit stream including data representing a first input core audio signal and a first set including at least one spatial parameter relating to the first parametrically coded input audio signal. A first covariance matrix of the first parametrically coded audio signal is determined based on the spatial parameter(s) of the first set. A modified set including at least one spatial parameter is determined based on the determined first covariance matrix, wherein the modified set is different from the first set. An output core audio signal is determined, which is based on, or constituted by, the first input core audio signal. An output bit stream for a parametrically coded output audio signal is generated, the output bit stream including data representing the output core audio signal and the modified set.

Patent Agency Ranking