Nearby talker obscuring, duplicate dialogue amelioration and automatic muting of acoustically proximate participants

    公开(公告)号:US10142484B2

    公开(公告)日:2018-11-27

    申请号:US15549581

    申请日:2016-02-08

    Abstract: In an audio conferencing environment, including multiple users participating by means of a series of associated audio input devices for the provision of audio input, and a series of audio output devices for the output of audio output streams to the multiple users, with the audio input and output devices being interconnected to a mixing control server for the control and mixing of the audio inputs from each audio input devices to present a series of audio streams to the audio output devices, a method of reducing the effects of cross talk pickup of at least a first audio conversation by multiple audio input devices, the method including the steps of: (a) monitoring the series of audio input devices for the presence of a duplicate audio conversation input from at least two input audio sources in an audio output stream; and (b) where a duplicate audio conversation input is detected, suppressing the presence of the duplicate audio conversation input in the audio output stream.

    Determining a harmonicity measure for voice processing
    13.
    发明授权
    Determining a harmonicity measure for voice processing 有权
    确定语音处理的谐波度量

    公开(公告)号:US09520144B2

    公开(公告)日:2016-12-13

    申请号:US14384842

    申请日:2013-03-21

    CPC classification number: G10L25/84 G10L2025/937

    Abstract: A method, an apparatus, and a computer-readable medium configured with instructions that when executed carry out the method for determining a measure of harmonicity. In one embodiment the method includes selecting candidate fundamental frequencies within a range, and for candidate determining a mask or retrieving a pre-calculated mask that has positive value for each frequency that contributed to harmonicity, and negative value for each frequency that contributes to inharmonicity. A candidate harmonicity measure is calculated for each candidate fundamental by summing the product of the mask and the magnitude measure spectrum. The harmonicity measure is selected as the maximum of the candidate harmonicity measures.

    Abstract translation: 一种配置有指令的方法,装置和计算机可读介质,其在执行时执行用于确定谐波度量的方法。 在一个实施例中,该方法包括选择范围内的候选基本频率,以及候选者确定掩模或检索对有助于谐波的每个频率具有正值的预先计算的掩模以及有助于不协调性的每个频率的负值。 通过对掩模和幅度测量光谱的乘积相加来计算每个候选基波的候选谐波度量。 选择谐波度量作为候选谐波度量的最大值。

    Method and System for Object-Dependent Adjustment of Levels of Audio Objects
    14.
    发明申请
    Method and System for Object-Dependent Adjustment of Levels of Audio Objects 有权
    用于对象相关调整音频对象级别的方法和系统

    公开(公告)号:US20150228293A1

    公开(公告)日:2015-08-13

    申请号:US14428419

    申请日:2013-09-11

    CPC classification number: G10L21/0324 G10L21/0272 G10L21/034 H03G3/32

    Abstract: In some embodiments, a method for adaptive control of gain applied to an audio signal, including steps of analyzing segments of the signal to identify audio objects (e.g., voices of participants in a voice conference); storing information regarding each distinct identified object; using at least some of the information to determine at least one of a target gain, or a gain change rate for reaching a target gain, for each identified object; and applying gain to segments of the signal indicative of an identified object such that the gain changes (typically, at the gain change rate for the object) from an initial gain to the target gain for the object. The information stored may include a scene description. Aspects of the invention include a system configured (e.g., programmed) to per form any embodiment of the inventive method.

    Abstract translation: 在一些实施例中,一种用于对应用于音频信号的增益进行自适应控制的方法,包括分析信号的段以识别音频对象(例如语音会议中的参与者的语音)的步骤; 存储关于每个不同的识别对象的信息; 使用所述信息中的至少一些来为每个识别的对象确定目标增益或用于达到目标增益的增益变化率中的至少一个; 以及将增益应用于指示所识别的对象的信号的段,使得所述增益从所述对象的初始增益改变到所述目标增益(通常以所述对象的增益变化率)改变。 存储的信息可以包括场景描述。 本发明的方面包括构造(例如,被编程)以形成本发明方法的任何实施例的系统。

    Determining a Harmonicity Measure for Voice Processing
    15.
    发明申请
    Determining a Harmonicity Measure for Voice Processing 有权
    确定语音处理的谐波度量

    公开(公告)号:US20150032447A1

    公开(公告)日:2015-01-29

    申请号:US14384842

    申请日:2013-03-21

    CPC classification number: G10L25/84 G10L2025/937

    Abstract: A method, an apparatus, and a computer-readable medium configured with instructions that when executed carry out the method for determining a measure of harmonicity. In one embodiment the method includes selecting candidate fundamental frequencies within a range, and for candidate determining a mask or retrieving a pre-calculated mask that has positive value for each frequency that contributed to harmonicity, and negative value for each frequency that contributes to inharmonicity. A candidate harmonicity measure is calculated for each candidate fundamental by summing the product of the mask and the magnitude measure spectrum. The harmonicity measure is selected as the maximum of the candidate harmonicity measures.

    Abstract translation: 一种配置有指令的方法,装置和计算机可读介质,其在执行时执行用于确定谐波度量的方法。 在一个实施例中,该方法包括选择范围内的候选基本频率,以及候选者确定掩模或检索对有助于谐波的每个频率具有正值的预先计算的掩模以及有助于不协调性的每个频率的负值。 通过对掩模和幅度测量光谱的乘积相加来计算每个候选基波的候选谐波度量。 选择谐波度量作为候选谐波度量的最大值。

    METHOD AND SYSTEM FOR SIGNAL TRANSMISSION CONTROL
    16.
    发明申请
    METHOD AND SYSTEM FOR SIGNAL TRANSMISSION CONTROL 有权
    信号传输控制方法与系统

    公开(公告)号:US20150032446A1

    公开(公告)日:2015-01-29

    申请号:US14382667

    申请日:2013-03-21

    CPC classification number: G10L25/84 G10L25/78 G10L2025/783

    Abstract: An audio signal with a temporal sequence of blocks or frames is received or accessed. Features are determined as characterizing aggregately the sequential audio blocks/frames that have been processed recently, relative to current time. The feature determination exceeds a specificity criterion and is delayed, relative to the recently processed audio blocks/frames. Voice activity indication is detected in the audio signal. VAD is based on a decision that exceeds a preset sensitivity threshold and is computed over a brief time period, relative to blocks/frames duration, and relates to current block/frame features. The VAD and the recent feature determination are combined with state related information, which is based on a history of previous feature determinations that are compiled from multiple features, determined over a time prior to the recent feature determination time period. Decisions to commence or terminate the audio signal, or related gains, are outputted based on the combination.

    Abstract translation: 具有块或帧的时间序列的音频信号被接收或访问。 确定特征是综合表征最近相对于当前时间最近处理的顺序音频块/帧。 相对于最近处理的音频块/帧,特征确定超过特定性标准并被延迟。 在音频信号中检测到语音活动指示。 VAD基于超过预设灵敏度阈值的决定,并且相对于块/帧持续时间在短时间段内计算,并且涉及当前块/帧特征。 VAD和最近的特征确定与状态相关信息相结合,状态相关信息基于在最近的特征确定时间段之前的时间确定的从多个特征编译的先前特征确定的历史。 基于该组合输出开始或终止音频信号或相关增益的决定。

Patent Agency Ranking