-
公开(公告)号:US20210158834A1
公开(公告)日:2021-05-27
申请号:US17046777
申请日:2019-04-17
申请人: Ninispeech Ltd.
发明人: Yoav MEDAN , Shai SHAPIRA
IPC分类号: G10L21/057 , G10L17/04 , G10L17/26 , G10L15/06 , G10L13/033
摘要: There are provided herein, a method and system for creating a speech/language pathologies classifier, the method comprising: producing a pathological speech repository of pathological speech samples of multiple impairments; computing speech qualities/pathologies, based on data receive from the pathological speech repository; producing a text repository, the text repository comprises multiple known text passages; converting each one of a selection of the text passages from the multiple known text passages, to a speech segment, while introducing to the speech segment one or more of the computed speech pathologies, thereby creating multiple synthetic impaired speech segments; and training a classifier with the multiple synthetic impaired speech segments thereby creating a speech/language pathologies classifier.
-
公开(公告)号:US10079028B2
公开(公告)日:2018-09-18
申请号:US14963175
申请日:2015-12-08
IPC分类号: H04R1/40 , G10L21/057 , G10L25/48 , H04S7/00 , H03G5/00 , G10L21/0208
CPC分类号: G10L21/057 , G10L21/02 , G10L21/028 , G10L25/48 , G10L2021/02082 , H04S7/305 , H04S2400/15
摘要: Embodiments of the present invention relate to enhancing sound through reverberation matching. In sonic implementations, a first sound recording recorded in a first environment is received. The first sound recording is decomposed to a first clean signal and a first reverb kernel. A second reverb kernel corresponding with a second sound recording recorded in a second environment is accessed, for example, based on a user indication to enhance the first sound recording to sound as though recorded in the second environment. An enhanced sound recording is generated based on the first clean signal and the second reverb kernel. The enhanced sound recording is a modification of the first sound recording to sound as though recorded in the second environment.
-
公开(公告)号:US09672809B2
公开(公告)日:2017-06-06
申请号:US14260449
申请日:2014-04-24
申请人: FUJITSU LIMITED
发明人: Taro Togawa , Chisato Shioda , Takeshi Otani
IPC分类号: G10L13/027 , G10L15/02 , G10L15/04 , G10L15/08 , G10L21/0364 , G10L21/057
CPC分类号: G10L13/027 , G10L15/02 , G10L15/04 , G10L15/08 , G10L21/0364 , G10L21/057
摘要: A speech processing device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: obtaining input speech, detecting a vowel segment contained in the input speech, estimating an accent segment contained in the input speech, calculating a first vowel segment length containing the accent segment and a second vowel segment length excluding the accent segment, and controlling at least one of the first vowel segment length and the second vowel segment length.
-
公开(公告)号:US09532897B2
公开(公告)日:2017-01-03
申请号:US14332679
申请日:2014-07-16
IPC分类号: H04R25/02 , A61F5/58 , G09B19/04 , G09B21/00 , H04R3/00 , G10L21/0364 , G10L21/057 , H04R1/10
CPC分类号: A61F5/58 , G09B19/04 , G09B21/00 , G10L2021/03646 , G10L2021/0575 , H04R1/1016 , H04R1/1091 , H04R3/00 , H04R2420/07 , H04R2420/09 , H04R2460/13
摘要: A voice enhancement device including an earpiece configured to be positioned in an ear canal of a user. A microcontroller is operatively coupled to the earpiece. The microcontroller is configured to selectively provide at least multitalker babble. An accelerometer is located within the earpiece and operatively coupled to the microcontroller. The accelerometer is configured to detect speech by the user and communicate with the microcontroller to provide the multitalker babble to the earpiece during the detected speech by the user. A method of making the voice enhancement device, and a method for increasing vocal loudness in a patient using the voice enhancement device are also disclosed.
摘要翻译: 一种语音增强设备,包括被配置为定位在用户的耳道中的耳机。 微控制器可操作地耦合到耳机。 微控制器被配置为选择性地提供至少多节点跳跃。 加速度计位于听筒内并且可操作地耦合到微控制器。 加速度计被配置为由用户检测语音并与微控制器通信,以在用户检测到的语音期间向听筒提供多声道跳跃。 还公开了一种制作语音增强设备的方法以及使用该语音增强设备增加患者声乐响度的方法。
-
公开(公告)号:US09286889B2
公开(公告)日:2016-03-15
申请号:US13752503
申请日:2013-01-29
IPC分类号: G10L21/00 , G10L21/02 , G10L15/08 , G10L15/18 , G10L15/26 , G10L21/003 , G10L21/057 , G10L25/60 , H04M3/56
CPC分类号: G10L15/08 , G10L15/18 , G10L15/26 , G10L21/003 , G10L21/02 , G10L21/057 , G10L25/60 , H04M3/56
摘要: Systems and methods for improving communication over a network are provided. A system for improving communication over a network, comprises a detection module capable of detecting data indicating a problem with a communication between at least two participants communicating via communication devices over the network, a management module capable of analyzing the data to determine whether a participant is dissatisfied with the communication, wherein the management module includes a determining module capable of determining that the participant is dissatisfied, and identifying an event causing the dissatisfaction, and a resolution module capable of providing a solution for eliminating the problem.
-
公开(公告)号:US09129610B2
公开(公告)日:2015-09-08
申请号:US13590675
申请日:2012-08-21
申请人: James Mulvey , Joseph Gaalaas
发明人: James Mulvey , Joseph Gaalaas
IPC分类号: H04B15/00 , G10L21/057
CPC分类号: G10L21/057
摘要: Processing a signal includes: receiving data that includes an input signal; filtering the input signal to generate a filtered signal, such that if the input signal includes at least one instance of a nonlinear distortion of a desired signal then the filtered signal includes a signature signal corresponding to the nonlinear distortion, the nonlinear distortion characterized by a time duration that is within a predetermined range; and detecting whether or not the filtered signal includes the signature signal.
摘要翻译: 处理信号包括:接收包括输入信号的数据; 对输入信号进行滤波以产生经滤波的信号,使得如果输入信号包括期望信号的非线性失真的至少一个情况,则滤波后的信号包括对应于非线性失真的签名信号,以时间为特征的非线性失真 持续时间在预定范围内; 以及检测滤波后的信号是否包括签名信号。
-
公开(公告)号:US20150120310A1
公开(公告)日:2015-04-30
申请号:US14582871
申请日:2014-12-24
申请人: Roger ROBERTS
发明人: Roger ROBERTS
IPC分类号: G10L21/057 , G10L21/043 , G10L25/48
CPC分类号: G10L21/057 , G10L21/02 , G10L21/0316 , G10L21/043 , G10L25/48 , H04R1/1008 , H04R1/1091 , H04R25/04 , H04R25/552 , H04R25/554 , H04R25/70 , H04R2225/41 , H04R2225/61 , H04R2420/07
摘要: An audio input device is provided which can include a number of features. In some embodiments, the audio input device includes a housing, a microphone carried by the housing, and a processor carried by the housing and configured to modify an input sound signal so as to amplify frequencies corresponding to a target human voice and diminish frequencies not corresponding to the target human voice. In another embodiment, an audio input device is configured to treat an auditory gap condition of a user by extending gaps in continuous speech and outputting the modified speech to the user. In another embodiment, the audio input device is configured to treat a dichotic hearing condition of a user. Methods of use are also described.
摘要翻译: 提供了可以包括多个特征的音频输入设备。 在一些实施例中,音频输入设备包括外壳,由外壳承载的麦克风和由外壳承载并被配置为修改输入声音信号的处理器,以便放大对应于目标人声的频率并减少不对应的频率 到目标人的声音。 在另一个实施例中,音频输入设备被配置为通过扩展连续语音中的间隙并将修改的语音输出给用户来处理用户的听觉间隙状况。 在另一个实施例中,音频输入设备被配置为治疗用户的双耳听觉状况。 还描述了使用方法。
-
8.
公开(公告)号:US20150030171A1
公开(公告)日:2015-01-29
申请号:US14381989
申请日:2013-01-23
申请人: CLARION CO., LTD
发明人: Takeshi Hashimoto , Tetsuo Watanabe
IPC分类号: G10L21/057 , G10K11/175
CPC分类号: G10L21/057 , G10H1/0091 , G10H1/02 , G10H2210/281 , G10K11/175 , G10L19/025 , G10L21/02 , G10L21/0364 , H04R3/04 , H04R2227/007
摘要: Provided is an acoustic signal processing device for producing an output sound meeting listener's preferences by adjusting attack sound, reverberation, and noise component. The device includes: an FFT section for transforming an input audio signal from a time-domain to a frequency-domain to calculate a frequency spectrum signal and for generating a first amplitude spectrum signal and a phase spectrum signal; an attack component controller (10) for controlling an attack component of the first amplitude spectrum signal to generate a second amplitude spectrum signal; a reverberation component controller (20) for controlling a reverberation component of the first amplitude spectrum signal to generate a third amplitude spectrum signal; a first adding section (40) for synthesizing the first amplitude spectrum signal, the second amplitude spectrum signal, and the third amplitude spectrum signal to generate a fourth amplitude spectrum signal; and an IFFT section for generating an audio signal transformed from a frequency domain to a time domain based on the fourth amplitude spectrum signal and the phase spectrum signal generated by the FFT section.
摘要翻译: 提供一种声音信号处理装置,用于通过调整攻击声,混响和噪声分量来产生会议听众的喜好的输出声音。 该装置包括:FFT部分,用于将输入音频信号从时域变换到频域,以计算频谱信号,并产生第一振幅谱信号和相位谱信号; 攻击部件控制器(10),用于控制第一幅度频谱信号的攻击分量以产生第二幅度频谱信号; 混响分量控制器(20),用于控制第一幅度频谱信号的混响分量以产生第三幅度频谱信号; 用于合成第一幅度频谱信号的第一加法部分(40),第二幅度频谱信号和第三幅度频谱信号以产生第四幅度频谱信号; 以及IFFT部分,用于基于由FFT部分生成的第四幅度频谱信号和相位频谱信号,生成从频域变换到时域的音频信号。
-
公开(公告)号:US20240005944A1
公开(公告)日:2024-01-04
申请号:US17810172
申请日:2022-06-30
申请人: David R. Baraff
发明人: David R. Baraff , Gene Kang
IPC分类号: G10L21/057 , G06N3/04 , H03M1/82
CPC分类号: G10L21/057 , G06N3/0454 , H03M1/82
摘要: Real-time speech output with improved intelligibility are described. One example embodiment includes a device. The device includes a microphone configured to capture one or more frames of unintelligible speech from a user. The device also includes an analog-to-digital converter (ADC) configured to convert the one or more captured frames of unintelligible speech into a digital representation. Additionally, the device includes a computing device. The computing device is configured to receive the digital representation from the ADC. The computing device is also configured to apply a machine-learned model to the digital representation to generate one or more frames with improved intelligibility. Further, the computing device is configured to output the one or more frames with improved intelligibility. In addition, the device includes a digital-to-analog converter (DAC) configured to convert the one or more frames with improved intelligibility into an analog form. Yet further, the device includes a speaker.
-
公开(公告)号:US11783846B2
公开(公告)日:2023-10-10
申请号:US17794230
申请日:2020-01-30
发明人: Yasufumi Uezu , Sadao Hiroya , Takemi Mochida
IPC分类号: G10L21/0232 , G10L25/60 , G10L21/0364 , G10L25/15 , G10L21/003 , G10L21/14 , G10L21/057
CPC分类号: G10L21/0232 , G10L21/003 , G10L21/0364 , G10L21/14 , G10L25/15 , G10L25/60 , G10L2021/03646 , G10L2021/0575
摘要: A training device changes feedback formant frequencies which are formant frequencies of a picked-up speech signal, applies a lowpass filter, converts the picked-up speech signal, adds high-pass noise to the converted speech signal, feeds back the converted speech signal with the high-pass noise added to a subject, calculates a compensatory response vector by using pickup formant frequencies which are formant frequencies of a speech signal acquired by picking up an utterance made by the subject while feeding back a speech signal that has been converted with change of the feedback formant frequencies to the subject, and pickup formant frequencies which are formant frequencies of a speech signal acquired by picking up an utterance made by the subject while feeding back a speech signal that has been converted without change of the feedback formant frequencies to the subject, and determines an evaluation based on the compensatory response vector and a correct compensatory response vector.
-
-
-
-
-
-
-
-
-