METHOD AND APPARATUS FOR VOICE ACTIVITY DETERMINATION

    公开(公告)号:EP3392668A1

    公开(公告)日:2018-10-24

    申请号:EP18174931.8

    申请日:2009-04-24

    CPC classification number: G10L25/78 G10L2021/02165 G10L2021/02166

    Abstract: In accordance with an example embodiment of the invention, there is provided an apparatus for detecting voice activity in an audio signal. The apparatus comprises a first voice activity detector (6b) for making a first voice activity detection decision (D2) based at least in part on the voice activity of a first audio signal (A1) received from a first microphone (1a). The apparatus also comprises a second voice activity detector (6a) for making a second voice activity detection decision (D1) based at least in part on an estimate of a direction of the first audio signal (A1) and an estimate of a direction of a second audio signal (A2) received from a second microphone (1b). The apparatus further comprises a classifier (6c) for making a third voice activity detection decision (D3) based at least in part on the first and second voice activity detection decisions.

    Sound field spatial stabilizer
    6.
    发明授权
    Sound field spatial stabilizer 有权
    声场空间稳定器

    公开(公告)号:EP2760021B1

    公开(公告)日:2018-01-17

    申请号:EP13153065.1

    申请日:2013-01-29

    CPC classification number: G10L21/0208 G10L2021/02082 G10L2021/02165

    Abstract: In a system and method for maintaining the spatial stability of a sound field a balance gain may be calculated for two or more microphone signals. The balance gain may be associated with a spatial image in the sound field. Signal values may be calculated for each of the microphone. The signal values may be signal estimates or signal gains calculated to improve a characteristic of the microphone signals. The differences between the signal values associated with each microphone signal may be limited although some difference between signal values may be allowable. One or more microphone signals are adjusted responsive to the two or more balance gains and the signal gains to maintain the spatial stability of the sound field. The adjustments of one or more microphone signals may include mixing of two or more microphone. The signal gains are applied to the two or more microphone signals.

    VOICE CLARIFICATION DEVICE AND COMPUTER PROGRAM THEREFOR
    7.
    发明公开
    VOICE CLARIFICATION DEVICE AND COMPUTER PROGRAM THEREFOR 有权
    SPRACHKLÄRUNGSVORRICHTUNGUND COMPUTERPROGRAMMDAFÜR

    公开(公告)号:EP3113183A4

    公开(公告)日:2017-07-26

    申请号:EP15755932

    申请日:2015-02-12

    Inventor: SHIGA YOSHINORI

    Abstract: [Object] To provide a speech intelligibility improving apparatus capable of generating highly intelligible speech in various environments without unnecessarily amplifying sound volume. [Solution] A speech intelligibility improving apparatus 250 includes: an envelope surface extracting unit 292 extracting, from a spectrum of speech signal 254 as an object of processing, a curve representing a general outline of peaks of spectral envelope in contact with or along local peaks of spectral envelope of the spectrum; a noise adapting unit 300 modifying spectrum of speech signal 254 based on the curve extracted by envelope surface extracting unit 292; and a sinusoidal wave speech synthesizing unit 305 generating a modified speech signal 260 for the speech improved in intelligibility based on the spectrum modified by noise adapting unit 300.

    Abstract translation: 本发明的目的在于提供一种能够在各种环境下生成高度清晰的语音而不会不必要地放大音量的语音清晰度改善装置。 解决方案语音清晰度改善装置250包括:包络表面提取单元292,从作为处理对象的语音信号254的频谱中提取表示与局部峰值接触或沿着局部峰值的频谱包络的​​峰值的总体轮廓的曲线 谱的谱包络; 噪声适应单元300,其基于由包络面提取单元292提取的曲线来修改语音信号254的频谱; 以及正弦波语音合成单元305,其基于由噪声适应单元300修改的频谱来生成用于可理解性提高的语音的修改语音信号260。

    AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND RECORDING MEDIUM STORING A PROGRAM
    8.
    发明公开
    AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND RECORDING MEDIUM STORING A PROGRAM 审中-公开
    一个音频信号,TONSIGNALVERARBEITUNGSVERFAHREN和记录介质,在一个程序保存

    公开(公告)号:EP3147901A1

    公开(公告)日:2017-03-29

    申请号:EP16185469.0

    申请日:2016-08-24

    Inventor: Matsuo, Naoshi

    Abstract: An audio signal processing device that includes: a processor configured to execute a procedure, the procedure comprising: detecting a speech segment of an audio signal; suppressing noise in the audio signal; and adjusting an amount of suppression of noise such that the amount of suppression during a specific period, which starts from a position based on a terminal end of the detected speech segment and is a period shorter than a period spanning from the terminal end of the detected speech segment to a starting end of a next speech segment, becomes greater than in other segments, and a memory configured to store audio signals before and after noise suppression and the amount of suppression before and after adjustment.

    Abstract translation: 一种音频信号处理装置,并包括:被配置为执行一个过程,该过程包括处理器:检测音频信号的语音段; 抑制所述音频信号的噪声; 和调整达噪声的抑制检查确实抑制在特定期间的量,这从基于检测到的语音段的末端的位置开始,并且周期比一个周期从所检测的终端跨越短 语音段到下一个语音段的起始端,变得比在其它段时,和存储器,被配置之前和噪声抑制和抑制的前和调整后的量之后存储的音频信号。

    NOISE SUPPRESSION
    10.
    发明公开
    NOISE SUPPRESSION 审中-公开
    RAUSCHUNTERDRÜCKUNG

    公开(公告)号:EP3120355A2

    公开(公告)日:2017-01-25

    申请号:EP15707356.0

    申请日:2015-03-02

    Abstract: A noise suppressor comprises a first (401) and a second transformer (403) for generating a first and second frequency domain signal from a frequency transform of a first and second microphone signal. A gain unit (405, 407, 409) determines time frequency tile gains in response to a difference measure for magnitude time frequency tile values of the first frequency domain signal and magnitude time frequency tile values of the second frequency domain signal. A scaler (411) generates a third frequency domain signal by scaling time frequency tile values of the first frequency domain signal by the time frequency tile gains; and the resulting signal is converted to the time domain by a third transformer (413). A designator (405, 407, 415) designates time frequency tiles of the first frequency domain signal as speech tiles or noise tiles; and the gain unit (409) determines the gains in response to the designation of the time frequency tiles as speech tiles or noise tiles.

    Abstract translation: 噪声抑制器包括用于从第一和第二麦克风信号的频率变换产生第一和第二频域信号的第一(401)和第二变换器(403)。 增益单元(405,407,409)响应于所述第一频域信号的幅度时间频率瓦片值和所述第二频域信号的幅度时间频率瓦片值的差测量来确定时间频率瓦片增益。 缩放器(411)通过按照时间频率瓦片增益缩放第一频域信号的时间频率瓦片值来产生第三频域信号; 并且由第三变压器(413)将所得到的信号转换成时域。 指示符(405,407,415)指定第一频域信号的时间频率瓦片作为语音瓦片或噪声瓦片; 并且增益单元(409)响应于将时间频率瓦片指定为语音瓦片或噪声瓦片来确定增益。

Patent Agency Ranking