Sound processing device, correcting device, correcting method and recording medium
    2.
    发明授权
    Sound processing device, correcting device, correcting method and recording medium 有权
    声音处理装置,校正装置,校正方法和记录介质

    公开(公告)号:US08615092B2

    公开(公告)日:2013-12-24

    申请号:US12788107

    申请日:2010-05-26

    申请人: Naoshi Matsuo

    发明人: Naoshi Matsuo

    IPC分类号: H04R3/00

    CPC分类号: H04S7/30

    摘要: A sound processing device includes: a plurality of sound input units; a detecting unit for detecting a frequency component of each sound input to the plurality of sound signal unit, the each sound arriving from a direction approximately perpendicular to a line determined by arrangement positions of two sound input units among the plurality of sound input units; a correction coefficient unit for obtaining a correction coefficient for correcting a level of at least one of the sound signals generated from the input sounds by the two sound input units so as to match the levels of the sound signals with each other based on the sound of the detected frequency component; a correcting unit for correcting the level of at least one of the sound signals using the obtained correction coefficient; and a processing unit for performing a sound process based on the sound signal with the corrected level.

    摘要翻译: 声音处理装置包括:多个声音输入单元; 检测单元,用于检测输入到多个声音信号单元的每个声音的频率分量,每个声音从大致垂直于由多个声音输入单元中的两个声音输入单元的布置位置确定的线的方向到达; 校正系数单元,用于获得用于校正由两个声音输入单元从输入声音产生的至少一个声音信号的电平的校正系数,以便基于声音的声音来匹配声音信号的电平 检测频率分量; 校正单元,用于使用所获得的校正系数校正至少一个声音信号的电平; 以及处理单元,用于基于具有校正电平的声音信号执行声音处理。

    STATE DETECTING APPARATUS, COMMUNICATION APPARATUS, AND STORAGE MEDIUM STORING STATE DETECTING PROGRAM
    3.
    发明申请
    STATE DETECTING APPARATUS, COMMUNICATION APPARATUS, AND STORAGE MEDIUM STORING STATE DETECTING PROGRAM 有权
    状态检测装置,通信装置和存储媒体存储状态检测程序

    公开(公告)号:US20130006630A1

    公开(公告)日:2013-01-03

    申请号:US13446019

    申请日:2012-04-13

    IPC分类号: G10L15/06

    摘要: A state detecting apparatus includes: a processor to execute acquiring utterance data related to uttered speech, computing a plurality of statistical quantities for feature parameters regarding features of the utterance data, creating, on the basis of the plurality of statistical quantities regarding the utterance data and another plurality of statistical quantities regarding reference utterance data based on other uttered speech, pseudo-utterance data having at least one statistical quantity equal to a statistical quantity in the other plurality of statistical quantities, computing a plurality of statistical quantities for synthetic utterance data synthesized on the basis of the pseudo-utterance data and the utterance data, and determining, on the basis of a comparison between statistical quantities of the synthetic utterance data and statistical quantities of the reference utterance data, whether the speaker who produced the uttered speech is in a first state or a second state; and a memory.

    摘要翻译: 一种状态检测装置,包括:处理器,用于执行获取与发话语音有关的话语数据,计算关于话语数据特征的特征参数的多个统计量,基于关于话语数据的多个统计量,以及 关于基于其他语音的参考话语数据的另外多个统计量,具有至少一个等于其他多个统计量中的统计量的统计量的伪话语数据,计算合成语音数据的多个统计量 伪话音数据和发声数据的基础,并且基于合成发音数据的统计量与参考语音数据的统计量之间的比较,确定产生发言语音的演讲者是否处于 第一状态或第二状态; 和记忆。

    MICROPHONE ARRAY DEVICE
    4.
    发明申请
    MICROPHONE ARRAY DEVICE 有权
    麦克风阵列设备

    公开(公告)号:US20110286604A1

    公开(公告)日:2011-11-24

    申请号:US13107497

    申请日:2011-05-13

    申请人: Naoshi MATSUO

    发明人: Naoshi MATSUO

    IPC分类号: G10K11/16

    摘要: A microphone array device includes a first sound reception unit configured to obtain a first sound signal that is input from a first microphone, a second sound reception unit configured to obtain a second sound signal that is input from a second microphone, a noise state evaluation unit configured to compare the first sound signal and the second sound signal and to obtain an evaluation parameter to evaluate an influence of a non-target sound included in the second sound signal on a target sound included in the first sound signal according to a result of the comparison, a subtraction adjustment unit configured to set a suppression amount for the second sound signal based on the evaluation parameter and to generate a third sound signal; and a subtraction unit configured to generate a signal to be output based on the third sound signal and the first sound signal.

    摘要翻译: 麦克风阵列装置包括:第一声音接收单元,被配置为获得从第一麦克风输入的第一声音信号;第二声音接收单元,被配置为获得从第二麦克风输入的第二声音信号;噪声状态评估单元 被配置为比较第一声音信号和第二声音信号,并且获得评估参数,以根据第一声音信号的结果来评估包括在第二声音信号中的非目标声音对包括在第一声音信号中的目标声音的影响 比较,减法调整单元,被配置为基于评估参数设置第二声音信号的抑制量并产生第三声音信号; 以及减法单元,被配置为基于所述第三声音信号和所述第一声音信号来生成要输出的信号。

    Utterance state detection device and utterance state detection method
    5.
    发明申请
    Utterance state detection device and utterance state detection method 有权
    发音状态检测装置和发声状态检测方法

    公开(公告)号:US20110282666A1

    公开(公告)日:2011-11-17

    申请号:US13064871

    申请日:2011-04-21

    IPC分类号: G10L17/00

    CPC分类号: G10L17/26 G10L25/48

    摘要: An utterance state detection device includes an user voice stream data input unit that gets user voice stream data of an user, a frequency element extraction unit that extracts high frequency elements by frequency-analyzing the user voice stream data, a fluctuation degree calculation unit that calculates a fluctuation degree of the high frequency elements thus extracted every unit time, a statistic calculation unit that calculates a statistic every certain interval based on a plurality of the fluctuation degrees in a certain period of time, and an utterance state detection unit that detects an utterance state of a specified user based on the statistic obtained from user voice stream data of the specified user.

    摘要翻译: 发声状态检测装置包括:用户语音流数据输入单元,其获取用户的用户语音流数据;频率元素提取单元,其通过对用户语音流数据进行频率分析来提取高频元素;波动度计算单元, 每单位时间提取的高频元件的波动程度,统计量计算单元,其基于一定时间段内的多个波动度计算每一定间隔的统计量;以及发声状态检测单元,其检测发音 基于从指定用户的用户语音流数据获得的统计量来指定用户的状态。

    Speech recognition system, speech recognition method and storage medium
    6.
    发明授权
    Speech recognition system, speech recognition method and storage medium 有权
    语音识别系统,语音识别方法和存储介质

    公开(公告)号:US08010359B2

    公开(公告)日:2011-08-30

    申请号:US11165120

    申请日:2005-06-24

    申请人: Naoshi Matsuo

    发明人: Naoshi Matsuo

    IPC分类号: G10L15/04

    CPC分类号: G10L21/028 G10L2015/228

    摘要: Provided are a speech recognition system, a method and a storage medium capable of, even in a case where plural speakers input superimposed speeches, recognizing a speech of an individual each speaker and making a single application program sharable among the speakers in execution. In a speech recognition system receiving speeches of plural speakers to execute a predetermined application program, the received speeches are separated according to the respective speakers if necessary, the received speeches of individual speakers are speech-recognized, results of speech recognition are matched with data items necessary for executing the application program, one of results of recognition of plural speeches which are found as a result of the matching to be overlapping is selected, and the results of recognition of plural speeches which are found as a result of the matching not to be overlapping are linked to the selected result of speech recognition.

    摘要翻译: 提供了一种语音识别系统,方法和存储介质,即使在多个扬声器输入叠加语音的情况下,识别每个扬声器的个人的语音并且使得在执行中的扬声器之间可共享一个应用程序。 在接收到多个扬声器的演讲以执行预定的应用程序的语音识别系统中,如果需要,则根据各个扬声器分离所接收的演讲,所接收到的各个扬声器的讲话被语音识别,语音识别结果与数据项匹配 选择执行应用程序所必需的结果之一,作为匹配结果被重复发现的多个演讲的识别结果之一被重叠,并且作为匹配结果而被找到的多个演讲的识别结果不是 重叠与所选择的语音识别结果相关联。

    SIGNAL PROCESSING APPARATUS, MICROPHONE ARRAY DEVICE, AND STORAGE MEDIUM STORING SIGNAL PROCESSING PROGRAM
    7.
    发明申请
    SIGNAL PROCESSING APPARATUS, MICROPHONE ARRAY DEVICE, AND STORAGE MEDIUM STORING SIGNAL PROCESSING PROGRAM 审中-公开
    信号处理设备,麦克风阵列设备和存储介质存储信号处理程序

    公开(公告)号:US20110158426A1

    公开(公告)日:2011-06-30

    申请号:US12977341

    申请日:2010-12-23

    申请人: Naoshi MATSUO

    发明人: Naoshi MATSUO

    IPC分类号: H04R3/00 H04B15/00

    CPC分类号: H04R3/005

    摘要: A signal processing apparatus includes: two sound input units, an orthogonal transformer to transform two sound signals input from the two sound input units into respective spectral signals in a frequency domain, a phase difference calculator to calculate a phase difference between the spectral signals in the frequency domain, a range determiner to determine a coefficient responsive to a frequency in the phase difference as a function of frequency, and determine a suppression range related to a phase on a per frequency basis of the frequency responsive to the coefficient; and a filter to phase-shift a component of one of the spectral signals on a per frequency basis in order to generate a phase-shifted spectral signal when the phase difference at each frequency falls within the suppression range, synthesizing the phase-shifted spectral signal and the other of the spectral signals in order to generate a filtered spectral signal.

    摘要翻译: 信号处理装置包括:两个声音输入单元,将从两个声音输入单元输入的两个声音信号变换成频域的各个频谱信号的正交变换器,相位差计算器,用于计算频域信号中的频谱信号之间的相位差 频域,范围确定器,用于确定响应于作为频率的函数的相位差中的频率的系数,并且响应于系数确定与基于每频率的频率相关的抑制范围; 以及滤波器,用于在每个频率基础上相移一个频谱信号的分量,以便当每个频率的相位差落入抑制范围内时产生相移频谱信号,合成相移频谱信号 和另一个光谱信号,以便产生滤波的光谱信号。

    SOUND PROCESSOR, SOUND PROCESSING METHOD AND RECORDING MEDIUM STORING SOUND PROCESSING PROGRAM
    8.
    发明申请
    SOUND PROCESSOR, SOUND PROCESSING METHOD AND RECORDING MEDIUM STORING SOUND PROCESSING PROGRAM 有权
    声音处理器,声音处理方法和录音中央存储声音处理程序

    公开(公告)号:US20110019832A1

    公开(公告)日:2011-01-27

    申请号:US12860322

    申请日:2010-08-20

    IPC分类号: H04B3/20

    摘要: A sound processor includes a conversion unit converts a reference sound signal corresponding to a base of sound to be output and an observation sound signal based on each of sound signals output by a plurality of sound receiving units into frequency components, an echo suppression unit estimates echo derived from sound based on a converted reference sound signal and suppressing the estimated echo in a converted observation sound signal, a noise suppression unit estimates noise based on an arrival direction of sound and suppressing the estimated noise in the converted observation sound signal and an integrating process unit suppresses, with respect to each frequency component, echo and noise in the converted sound signal based on a observation sound signal obtained after echo suppression and a observation sound signal obtained after noise suppression.

    摘要翻译: 声音处理器包括转换单元,将基于要输出的声音的基准的参考声音信号和基于由多个声音接收单元输出的每个声音信号的观测声音信号转换为频率分量,回波抑制单元估计回波 基于转换的参考声音信号导出的声音并抑制转换的观测声音信号中的估计回波,噪声抑制单元基于声音的到达方向估计噪声,并抑制转换的观测声音信号中的估计噪声,以及积分处理 基于在回波抑制之后获得的观察声音信号和在噪声抑制之后获得的观察声音信号,单位对于每个频率分量抑制转换的声音信号中的回波和噪声。

    ECHO SUPPRESSING SYSTEM, ECHO SUPPRESSING METHOD, RECORDING MEDIUM, ECHO SUPPRESSOR, SOUND OUTPUT DEVICE, AUDIO SYSTEM, NAVIGATION SYSTEM AND MOBILE OBJECT
    9.
    发明申请
    ECHO SUPPRESSING SYSTEM, ECHO SUPPRESSING METHOD, RECORDING MEDIUM, ECHO SUPPRESSOR, SOUND OUTPUT DEVICE, AUDIO SYSTEM, NAVIGATION SYSTEM AND MOBILE OBJECT 有权
    ECHO抑制系统,ECHO抑制方法,记录介质,ECHO抑制器,声音输出设备,音频系统,导航系统和移动对象

    公开(公告)号:US20100191527A1

    公开(公告)日:2010-07-29

    申请号:US12756768

    申请日:2010-04-08

    IPC分类号: G10L21/02 H04B3/20

    摘要: An echo suppressing system includes: a sound output device for outputting sound based on a sound signal, including a passing section for allowing passage of a component of a different frequency band, and a plurality of sound output sections, each of which outputs sound based on each of the plurality of sound signals passed through the passing section; a summer for summing the plurality of sound signals to generate a reference sound signal; a sound input device for converting input sound into a sound signal; and an echo suppressor for suppressing echo based on the sound output by the sound output device, including an input section to which a sound signal is input from the sound input device as an observation sound signal, and a correction section for correcting the observation sound signal so as to suppress echo included in the observation sound signal.

    摘要翻译: 回波抑制系统包括:声音输出装置,用于输出基于声音信号的声音,包括允许不同频带的分量通过的通过部分和多个声音输出部分,每个声音输出部分基于 多个声音信号中的每一个通过通过部分; 用于对所述多个声音信号求和以产生参考声音信号的加法器; 用于将输入声音转换成声音信号的声音输入装置; 以及基于声音输出装置输出的声音抑制回波的回波抑制器,包括从声音输入装置输入声音信号的输入部分作为观察声音信号,以及校正部分,用于校正观察声音信号 以抑制观察声信号中包含的回波。

    COMPUTER-READABLE MEDIUM FOR RECORDING AUDIO SIGNAL PROCESSING ESTIMATING PROGRAM AND AUDIO SIGNAL PROCESSING ESTIMATING DEVICE
    10.
    发明申请
    COMPUTER-READABLE MEDIUM FOR RECORDING AUDIO SIGNAL PROCESSING ESTIMATING PROGRAM AND AUDIO SIGNAL PROCESSING ESTIMATING DEVICE 有权
    用于记录音频信号处理估计程序和音频信号处理估计设备的计算机可读介质

    公开(公告)号:US20100138220A1

    公开(公告)日:2010-06-03

    申请号:US12621918

    申请日:2009-11-19

    IPC分类号: G10L21/02

    摘要: A computer-readable medium recording a program allowing a computer to execute: setting a plurality of frames on a common time axis between a first waveform of an input to the audio processing and a second waveform of an output from the audio processing, detecting a voice frame and a noise frame in the first and second waveform, calculating a first and second spectrum from the first and second waveform, adjusting the level of the first or second spectrum of the noise frame, and setting the adjusted first and second spectrum of the noise frame as a third and fourth spectrum, calculating a distortion amount of the noise frame from the third and fourth spectrum, estimating a noise model spectrum from the first or second spectrum, and calculating a distortion amount of the voice frame from the first and second spectrum of the voice frame at the selected frequency.

    摘要翻译: 一种记录程序的计算机可读介质,其允许计算机执行:在音频处理的输入的第一波形与来自音频处理的输出的第二波形之间的公共时间轴上设置多个帧,检测声音 在第一和第二波形中的帧和噪声帧,从第一和第二波形计算第一和第二频谱,调整噪声帧的第一或第二频谱的电平,以及设置调整的噪声的第一和第二频谱 帧,作为第三和第四频谱,计算来自第三和第四频谱的噪声帧的失真量,估计来自第一或第二频谱的噪声模型频谱,以及从第一和第二频谱计算语音帧的失真量 在所选频率下的语音帧。