Patent search ipc:"G10L21/057" Page 1

1.

发明授权
Automated audio tuning and compensation procedure 有权

公开(公告)号：US12192737B2

公开(公告)日：2025-01-07

申请号：US17952191

申请日：2022-09-23

Applicant: Biamp Systems, LLC

Inventor： Zach Snook , Eugene F. Goff , Raymond J. Dippert , Matthew V. Kotvis , Samarth Behura

IPC: H04S7/00 , G10L21/057 , H04R3/12

Abstract: An example may include detecting, via a controller, one or more microphones and one or more speakers in an area, measuring, via the one or more microphones, an initial frequency response of an audio signal generated by the one or more speakers inside the area and generating an initial room performance rating, comparing the initial frequency response to a target frequency response, creating audio compensation values to apply to the one or more speakers based on the comparison, and applying the audio compensation values to the one or more speakers.

2.

发明申请
DIAGNOSING AND TREATMENT OF SPEECH PATHOLOGIES USING ANALYSIS BY SYNTHESIS TECHNOLOGY 有权

公开(公告)号：US20210158834A1

公开(公告)日：2021-05-27

申请号：US17046777

申请日：2019-04-17

Applicant: Ninispeech Ltd.

Inventor： Yoav MEDAN , Shai SHAPIRA

IPC: G10L21/057 , G10L17/04 , G10L17/26 , G10L15/06 , G10L13/033

Abstract: There are provided herein, a method and system for creating a speech/language pathologies classifier, the method comprising: producing a pathological speech repository of pathological speech samples of multiple impairments; computing speech qualities/pathologies, based on data receive from the pathological speech repository; producing a text repository, the text repository comprises multiple known text passages; converting each one of a selection of the text passages from the multiple known text passages, to a speech segment, while introducing to the speech segment one or more of the computed speech pathologies, thereby creating multiple synthetic impaired speech segments; and training a classifier with the multiple synthetic impaired speech segments thereby creating a speech/language pathologies classifier.

3.

发明授权
Sound enhancement through reverberation matching 有权

公开(公告)号：US10079028B2

公开(公告)日：2018-09-18

申请号：US14963175

申请日：2015-12-08

Applicant: ADOBE SYSTEMS INCORPORATED

Inventor： Ramin Anushiravani , Paris Smaragdis , Gautham Mysore

IPC: H04R1/40 , G10L21/057 , G10L25/48 , H04S7/00 , H03G5/00 , G10L21/0208

CPC classification number: G10L21/057 , G10L21/02 , G10L21/028 , G10L25/48 , G10L2021/02082 , H04S7/305 , H04S2400/15

Abstract: Embodiments of the present invention relate to enhancing sound through reverberation matching. In sonic implementations, a first sound recording recorded in a first environment is received. The first sound recording is decomposed to a first clean signal and a first reverb kernel. A second reverb kernel corresponding with a second sound recording recorded in a second environment is accessed, for example, based on a user indication to enhance the first sound recording to sound as though recorded in the second environment. An enhanced sound recording is generated based on the first clean signal and the second reverb kernel. The enhanced sound recording is a modification of the first sound recording to sound as though recorded in the second environment.

4.

发明授权
Speech processing device and method 有权

公开(公告)号：US09672809B2

公开(公告)日：2017-06-06

申请号：US14260449

申请日：2014-04-24

Applicant: FUJITSU LIMITED

Inventor： Taro Togawa , Chisato Shioda , Takeshi Otani

IPC: G10L13/027 , G10L15/02 , G10L15/04 , G10L15/08 , G10L21/0364 , G10L21/057

CPC classification number: G10L13/027 , G10L15/02 , G10L15/04 , G10L15/08 , G10L21/0364 , G10L21/057

Abstract: A speech processing device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: obtaining input speech, detecting a vowel segment contained in the input speech, estimating an accent segment contained in the input speech, calculating a first vowel segment length containing the accent segment and a second vowel segment length excluding the accent segment, and controlling at least one of the first vowel segment length and the second vowel segment length.

5.

发明授权
Devices that train voice patterns and methods thereof 有权
Title translation: 训练语音模式的装置及其方法

公开(公告)号：US09532897B2

公开(公告)日：2017-01-03

申请号：US14332679

申请日：2014-07-16

Applicant: Purdue Research Foundation

Inventor： Jessica E. Huber , Scott Kepner , Derek Tully , James Thomas Jones , Kirk Solon Foster

IPC: H04R25/02 , A61F5/58 , G09B19/04 , G09B21/00 , H04R3/00 , G10L21/0364 , G10L21/057 , H04R1/10

CPC classification number: A61F5/58 , G09B19/04 , G09B21/00 , G10L2021/03646 , G10L2021/0575 , H04R1/1016 , H04R1/1091 , H04R3/00 , H04R2420/07 , H04R2420/09 , H04R2460/13

Abstract: A voice enhancement device including an earpiece configured to be positioned in an ear canal of a user. A microcontroller is operatively coupled to the earpiece. The microcontroller is configured to selectively provide at least multitalker babble. An accelerometer is located within the earpiece and operatively coupled to the microcontroller. The accelerometer is configured to detect speech by the user and communicate with the microcontroller to provide the multitalker babble to the earpiece during the detected speech by the user. A method of making the voice enhancement device, and a method for increasing vocal loudness in a patient using the voice enhancement device are also disclosed.

Abstract translation: 一种语音增强设备，包括被配置为定位在用户的耳道中的耳机。微控制器可操作地耦合到耳机。微控制器被配置为选择性地提供至少多节点跳跃。加速度计位于听筒内并且可操作地耦合到微控制器。加速度计被配置为由用户检测语音并与微控制器通信，以在用户检测到的语音期间向听筒提供多声道跳跃。还公开了一种制作语音增强设备的方法以及使用该语音增强设备增加患者声乐响度的方法。

6.

发明授权
Improving voice communication over a network 有权

公开(公告)号：US09286889B2

公开(公告)日：2016-03-15

申请号：US13752503

申请日：2013-01-29

Applicant: International Business Machines Corporation

Inventor： Dimitri Kanevsky , Pamela A. Nesbitt , Tara N. Sainath , Elizabeth V. Woodward

IPC: G10L21/00 , G10L21/02 , G10L15/08 , G10L15/18 , G10L15/26 , G10L21/003 , G10L21/057 , G10L25/60 , H04M3/56

CPC classification number: G10L15/08 , G10L15/18 , G10L15/26 , G10L21/003 , G10L21/02 , G10L21/057 , G10L25/60 , H04M3/56

Abstract: Systems and methods for improving communication over a network are provided. A system for improving communication over a network, comprises a detection module capable of detecting data indicating a problem with a communication between at least two participants communicating via communication devices over the network, a management module capable of analyzing the data to determine whether a participant is dissatisfied with the communication, wherein the management module includes a determining module capable of determining that the participant is dissatisfied, and identifying an event causing the dissatisfaction, and a resolution module capable of providing a solution for eliminating the problem.

7.

发明授权
Filtering for detection of limited-duration distortion 有权
Title translation: 用于检测有限持续时间失真的滤波

公开(公告)号：US09129610B2

公开(公告)日：2015-09-08

申请号：US13590675

申请日：2012-08-21

Applicant: James Mulvey , Joseph Gaalaas

Inventor： James Mulvey , Joseph Gaalaas

IPC: H04B15/00 , G10L21/057

CPC classification number: G10L21/057

Abstract: Processing a signal includes: receiving data that includes an input signal; filtering the input signal to generate a filtered signal, such that if the input signal includes at least one instance of a nonlinear distortion of a desired signal then the filtered signal includes a signature signal corresponding to the nonlinear distortion, the nonlinear distortion characterized by a time duration that is within a predetermined range; and detecting whether or not the filtered signal includes the signature signal.

Abstract translation: 处理信号包括：接收包括输入信号的数据; 对输入信号进行滤波以产生经滤波的信号，使得如果输入信号包括期望信号的非线性失真的至少一个情况，则滤波后的信号包括对应于非线性失真的签名信号，以时间为特征的非线性失真持续时间在预定范围内; 以及检测滤波后的信号是否包括签名信号。

8.

发明申请
AUDIO INPUT DEVICE 有权
Title translation: 音频输入设备

公开(公告)号：US20150120310A1

公开(公告)日：2015-04-30

申请号：US14582871

申请日：2014-12-24

Applicant: Roger ROBERTS

Inventor： Roger ROBERTS

IPC: G10L21/057 , G10L21/043 , G10L25/48

CPC classification number: G10L21/057 , G10L21/02 , G10L21/0316 , G10L21/043 , G10L25/48 , H04R1/1008 , H04R1/1091 , H04R25/04 , H04R25/552 , H04R25/554 , H04R25/70 , H04R2225/41 , H04R2225/61 , H04R2420/07

Abstract: An audio input device is provided which can include a number of features. In some embodiments, the audio input device includes a housing, a microphone carried by the housing, and a processor carried by the housing and configured to modify an input sound signal so as to amplify frequencies corresponding to a target human voice and diminish frequencies not corresponding to the target human voice. In another embodiment, an audio input device is configured to treat an auditory gap condition of a user by extending gaps in continuous speech and outputting the modified speech to the user. In another embodiment, the audio input device is configured to treat a dichotic hearing condition of a user. Methods of use are also described.

Abstract translation: 提供了可以包括多个特征的音频输入设备。在一些实施例中，音频输入设备包括外壳，由外壳承载的麦克风和由外壳承载并被配置为修改输入声音信号的处理器，以便放大对应于目标人声的频率并减少不对应的频率到目标人的声音。在另一个实施例中，音频输入设备被配置为通过扩展连续语音中的间隙并将修改的语音输出给用户来处理用户的听觉间隙状况。在另一个实施例中，音频输入设备被配置为治疗用户的双耳听觉状况。还描述了使用方法。

9.

发明申请
ACOUSTIC SIGNAL PROCESSING DEVICE AND ACOUSTIC SIGNAL PROCESSING METHOD 有权
Title translation: 声音信号处理装置和声音信号处理方法

公开(公告)号：US20150030171A1

公开(公告)日：2015-01-29

申请号：US14381989

申请日：2013-01-23

Applicant: CLARION CO., LTD

Inventor： Takeshi Hashimoto , Tetsuo Watanabe

IPC: G10L21/057 , G10K11/175

CPC classification number: G10L21/057 , G10H1/0091 , G10H1/02 , G10H2210/281 , G10K11/175 , G10L19/025 , G10L21/02 , G10L21/0364 , H04R3/04 , H04R2227/007

Abstract: Provided is an acoustic signal processing device for producing an output sound meeting listener's preferences by adjusting attack sound, reverberation, and noise component. The device includes: an FFT section for transforming an input audio signal from a time-domain to a frequency-domain to calculate a frequency spectrum signal and for generating a first amplitude spectrum signal and a phase spectrum signal; an attack component controller (10) for controlling an attack component of the first amplitude spectrum signal to generate a second amplitude spectrum signal; a reverberation component controller (20) for controlling a reverberation component of the first amplitude spectrum signal to generate a third amplitude spectrum signal; a first adding section (40) for synthesizing the first amplitude spectrum signal, the second amplitude spectrum signal, and the third amplitude spectrum signal to generate a fourth amplitude spectrum signal; and an IFFT section for generating an audio signal transformed from a frequency domain to a time domain based on the fourth amplitude spectrum signal and the phase spectrum signal generated by the FFT section.

Abstract translation: 提供一种声音信号处理装置，用于通过调整攻击声，混响和噪声分量来产生会议听众的喜好的输出声音。该装置包括：FFT部分，用于将输入音频信号从时域变换到频域，以计算频谱信号，并产生第一振幅谱信号和相位谱信号; 攻击部件控制器（10），用于控制第一幅度频谱信号的攻击分量以产生第二幅度频谱信号; 混响分量控制器（20），用于控制第一幅度频谱信号的混响分量以产生第三幅度频谱信号; 用于合成第一幅度频谱信号的第一加法部分（40），第二幅度频谱信号和第三幅度频谱信号以产生第四幅度频谱信号; 以及IFFT部分，用于基于由FFT部分生成的第四幅度频谱信号和相位频谱信号，生成从频域变换到时域的音频信号。

10.

发明公开
Devices for Real-time Speech Output with Improved Intelligibility 审中-公开

公开(公告)号：US20240005944A1

公开(公告)日：2024-01-04

申请号：US17810172

申请日：2022-06-30

Applicant: David R. Baraff

Inventor： David R. Baraff , Gene Kang

IPC: G10L21/057 , G06N3/04 , H03M1/82

CPC classification number: G10L21/057 , G06N3/0454 , H03M1/82

Abstract: Real-time speech output with improved intelligibility are described. One example embodiment includes a device. The device includes a microphone configured to capture one or more frames of unintelligible speech from a user. The device also includes an analog-to-digital converter (ADC) configured to convert the one or more captured frames of unintelligible speech into a digital representation. Additionally, the device includes a computing device. The computing device is configured to receive the digital representation from the ADC. The computing device is also configured to apply a machine-learned model to the digital representation to generate one or more frames with improved intelligibility. Further, the computing device is configured to output the one or more frames with improved intelligibility. In addition, the device includes a digital-to-analog converter (DAC) configured to convert the one or more frames with improved intelligibility into an analog form. Yet further, the device includes a speaker.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification