SUBBAND DOMAIN ACOUSTIC ECHO CANCELLER BASED ACOUSTIC STATE ESTIMATOR

    公开(公告)号:US20230421952A1

    公开(公告)日:2023-12-28

    申请号:US18255573

    申请日:2021-12-02

    CPC classification number: H04R3/02

    Abstract: Some implementations involve receiving, from a first subband domain acoustic echo canceller (AEC) of a first audio device in an audio environment, first adaptive filter management data from each of a plurality of first adaptive filter management modules, each first adaptive filter management module corresponding to a subband of the first subband domain AEC, each first adaptive filter management module being configured to control a first plurality of adaptive filters. The first plurality of adaptive filters may include at least a first adaptive filter type and a second adaptive filter type. Some implementations involve extracting, from the first adaptive filter management data, a first plurality of extracted features corresponding to a plurality of subbands of the first subband domain AEC and estimating a current local acoustic state based, at least in part, on the first plurality of extracted features.

    System and method for spatial processing of soundfield signals

    公开(公告)号:US10959032B2

    公开(公告)日:2021-03-23

    申请号:US16077040

    申请日:2017-02-09

    Abstract: A method for interactive and user guided manipulation of multichannel audio content, the method including the steps of: providing a content preview facility for replay and review of multichannel audio content by a user; providing a user interface for the user selection of a segment of multichannel audio content having an unsatisfactory audio content; processing the audio content to include associated audio object activity spatial or signal space regions, to create a time line of activity where one or more spatial or signal space regions are active at any given time; matching the user's gesture input against at least one of the active spatial or signal space regions; signal processing the audio emanating from selected active spatial or signal space region using a number of differing techniques to determine at least one processed alternative; providing the user with an interactive playback facility to listen to the processed alternative.

    Audio capture and render device having a visual display and user interface for use for audio conferencing

    公开(公告)号:US10079941B2

    公开(公告)日:2018-09-18

    申请号:US14788963

    申请日:2015-07-01

    CPC classification number: H04M9/085 G10L21/0272 H04M3/567 H04M3/568

    Abstract: A method in a soundfield-capturing endpoint and the capturing endpoint that comprises a microphone array capturing soundfield, and an input processor pre-processing and performing auditory scene analysis to detect local sound objects and positions, de-clutter the sound objects, and integrate with auxiliary audio signals to form a de-cluttered local auditory scene that has a measure of plausibility and perceptual continuity. The input processor also codes the resulting de-cluttered auditory scene to form coded scene data comprising mono audio and additional scene data to send to others. The endpoint includes an output processor generating signals for a display unit that displays a summary of the de-cluttered local auditory scene and/or a summary of activity in the communication system from received data, the display including a shaped ribbon display element that has an extent with locations on the extent representing locations and other properties of different sound objects.

    Sound field analysis system
    5.
    发明授权
    Sound field analysis system 有权
    声场分析系统

    公开(公告)号:US09451379B2

    公开(公告)日:2016-09-20

    申请号:US14187616

    申请日:2014-02-24

    CPC classification number: H04S7/30 G01S3/802 G10L19/008 H04S2400/15

    Abstract: In one embodiment, a sound field is mapped by extracting spatial angle information, diffusivity information, and optionally, sound level information. The extracted information is mapped for representation in the form of a Riemann sphere, wherein spatial angle varies longitudinally, diffusivity varies latitudinally, and level varies radially along the sphere. A more generalized mapping employs mapping the spatial angle and diffusivity information onto a representative region exhibiting variations in direction of arrival that correspond to the extracted spatial information and variations in distance that correspond to the extracted diffusivity information.

    Abstract translation: 在一个实施例中,通过提取空间角度信息,扩散性信息和可选地声级信息来映射声场。 所提取的信息被映射为黎曼球形式的表示,其中空间角度纵向变化,扩散性在纬度上变化,并且水平面沿球体径向变化。 更广义的映射将空间角度和扩散度信息映射到呈现对应于提取的空间信息的距离方向的变化方向的代表区域和对应于提取的扩散率信息的距离变化。

    Method and system for signal transmission control
    6.
    发明授权
    Method and system for signal transmission control 有权
    信号传输控制方法与系统

    公开(公告)号:US09373343B2

    公开(公告)日:2016-06-21

    申请号:US14382667

    申请日:2013-03-21

    CPC classification number: G10L25/84 G10L25/78 G10L2025/783

    Abstract: An audio signal with a temporal sequence of blocks or frames is received or accessed. Features are determined as characterizing aggregately the sequential audio blocks/frames that have been processed recently, relative to current time. The feature determination exceeds a specificity criterion and is delayed, relative to the recently processed audio blocks/frames. Voice activity indication is detected in the audio signal. VAD is based on a decision that exceeds a preset sensitivity threshold and is computed over a brief time period, relative to blocks/frames duration, and relates to current block/frame features. The VAD and the recent feature determination are combined with state related information, which is based on a history of previous feature determinations that are compiled from multiple features, determined over a time prior to the recent feature determination time period. Decisions to commence or terminate the audio signal, or related gains, are outputted based on the combination.

    Abstract translation: 具有块或帧的时间序列的音频信号被接收或访问。 确定特征是综合表征最近相对于当前时间最近处理的顺序音频块/帧。 相对于最近处理的音频块/帧,特征确定超过特定性标准并被延迟。 在音频信号中检测到语音活动指示。 VAD基于超过预设灵敏度阈值的决定,并且相对于块/帧持续时间在短时间段内计算,并且涉及当前块/帧特征。 VAD和最近的特征确定与状态相关信息相结合,状态相关信息基于在最近的特征确定时间段之前的时间确定的从多个特征编译的先前特征确定的历史。 基于该组合输出开始或终止音频信号或相关增益的决定。

    Method and system for bias corrected speech level determination
    7.
    发明授权
    Method and system for bias corrected speech level determination 有权
    用于偏差校正语音级别确定的方法和系统

    公开(公告)号:US09373341B2

    公开(公告)日:2016-06-21

    申请号:US14384586

    申请日:2013-03-21

    CPC classification number: G10L21/0316 G10L25/18 G10L25/21 G10L25/48 G10L25/78

    Abstract: Method for measuring level of speech determined by an audio signal in a manner which corrects for and reduces the effect of modification of the signal by the addition of noise thereto and/or amplitude compression thereof, and a system configured to perform any embodiment of the method. In some embodiments, the method includes steps of generating frequency banded, frequency-domain data indicative of an input speech signal, determining from the data a Gaussian parametric spectral model of the speech signal, and determining from the parametric spectral model an estimated mean speech level and a standard deviation value for each frequency band of the data; and generating speech level data indicative of a bias corrected mean speech level for each frequency band, including using at least one correction value to correct the estimated mean speech level for the frequency band, where each correction value has been predetermined using a reference speech model.

    Abstract translation: 一种用音频信号测定的语音水平的方法,该方法通过增加噪声对其进行修正和/或降低其变化的影响和/或对其进行幅度压缩,以及被配置为执行该方法的任何实施例的系统 。 在一些实施例中,该方法包括以下步骤:产生表示输入语音信号的频带,频域数据,根据数据确定语音信号的高斯参数频谱模型,以及从参数频谱模型确定估计的平均语音电平 和数据的每个频带的标准偏差值; 以及生成指示针对每个频带的偏置校正的平均语音电平的语音电平数据,包括使用至少一个校正值来校正所述频带的估计平均语音电平,其中每个校正值已经使用参考语音模型预先确定。

    HOWL DETECTION IN CONFERENCE SYSTEMS

    公开(公告)号:US20220201125A1

    公开(公告)日:2022-06-23

    申请号:US17691966

    申请日:2022-03-10

    Abstract: Some disclosed teleconferencing methods may involve detecting a howl state during a teleconference. The teleconference may involve two or more teleconference client locations and a teleconference server. The teleconference server may be configured for providing full-duplex audio connectivity between the teleconference client locations. The howl state may be a state of acoustic feedback involving two or more teleconference devices in a teleconference client location. Detecting the howl state may involve an analysis of both spectral and temporal characteristics of teleconference audio data. Some disclosed teleconferencing methods may involve determining which client location is causing the howl state. Some such methods may involve mitigating the howl state and/or sending a howl state detection message.

    Multitalker optimised beamforming system and method

    公开(公告)号:US10412490B2

    公开(公告)日:2019-09-10

    申请号:US16078563

    申请日:2017-02-23

    Abstract: A method of processing a series of microphone inputs of an audio conference, the method including the steps of: (a) conducting a spatial analysis and feature extraction of the audio conference based on current audio activity; (b) aggregating historical information to obtain information about the approximate relative location of recent sound objects relative to the series of microphone inputs; (c) utilizing the relative location or distance of the sound objects from the series of microphone inputs to determine if beam forming should be utilized to enhance the audio reception from recent sound objects.

Patent Agency Ranking