专利检索 cpc:"G10L25/75" 第 1 页

1.

发明申请
Estimating Clean Speech Features Using Manifold Modeling 审中-公开

公开(公告)号：US20170316790A1

公开(公告)日：2017-11-02

申请号：US15140081

申请日：2016-04-27

申请人： KnuEdge Incorporated

发明人： Bengt Jonas Borgstrom

IPC分类号： G10L21/0232 , G10L15/14 , G10L13/07 , G10L15/02 , G10L25/84 , G10L15/20

CPC分类号： G10L21/0232 , G10L13/04 , G10L13/06 , G10L13/07 , G10L15/02 , G10L15/142 , G10L15/20 , G10L17/20 , G10L25/75 , G10L25/84

摘要： The technology described in this document can be embodied in a computer-implemented method that includes receiving, at one or more processing devices, a portion of an input signal representing noisy speech, and extracting, from the portion of the input signal, one or more frequency domain features of the noisy speech. The method also includes generating a set of projected features by projecting each of the one or more frequency domain features on a manifold that represents a model of frequency domain features for clean speech. The method further includes using the set of projected features for at least one of: a) generating synthesized speech that represents a noise-reduced version of the noisy speech, b) performing speaker recognition, or c) performing speech recognition.

2.

发明申请
ADAPTIVE VOICE AUTHENTICATION SYSTEM AND METHOD 有权

公开(公告)号：US20170140760A1

公开(公告)日：2017-05-18

申请号：US15064740

申请日：2016-03-09

申请人： Uniphore Software Systems

发明人： Umesh SACHDEV

IPC分类号： G10L17/04 , G10L17/06 , G10L17/02

CPC分类号： G10L17/04 , G10L17/02 , G10L17/06 , G10L25/15 , G10L25/24 , G10L25/75

摘要： An adaptive voice authentication system is provided. The adaptive voice authentication system includes an adaptive module configured to compare a feature quality index of the plurality of authentication features and the plurality of enrolment features and dynamically replace and store one or more enrolment features with one or more authentication features to form a plurality of updated enrolment features. The adaptive module is configured to generate an updated enrolment voice print model from the plurality of the updated enrolment features. The adaptive module is further configured to compare the updated enrolment voice print model with the previously stored enrolment voice print model and dynamically update the previously stored enrolment voice print model with the updated enrolment voice print model based on a model quality index.

3.

发明申请
System and Method for Evaluating Vocal Function Using an Impedance-Based Inverse Filtering of Neck Surface Acceleration 审中-公开
标题翻译：使用基于阻抗的颈部表面加速反过滤来评估声乐功能的系统和方法

公开(公告)号：US20170014082A1

公开(公告)日：2017-01-19

申请号：US15278007

申请日：2016-09-27

申请人： The General Hospital Corporation , Perdue Research Foundation

发明人： Matias Zanartu , Julio C. Ho , Daryush D Mehta , George R. Wodicka , Robert E. Hillman

IPC分类号： A61B5/00 , G10L25/75 , A61B5/087 , A61B5/107 , A61B5/11

CPC分类号： A61B5/725 , A61B5/087 , A61B5/1075 , A61B5/1107 , A61B5/4803 , A61B5/7278 , A61B2562/0219 , G10L25/27 , G10L25/75 , G10L2015/226

摘要： A system and method to assess vocal function of a subject. The system includes an accelerometer configured to acquire surface acceleration data associated with vocal functionality of the subject and a computer system configured to analyze the surface acceleration data and to estimate glottal airflow waveforms produced by the subject based on the surface acceleration data. The computer system performs the analysis and estimation by applying an inverse filter to the surface acceleration data based on a calibrated transmission line model and generates an indication of vocal functionality of the subject based on the estimated glottal airflow waveforms.

摘要翻译： 评估主体声乐功能的系统和方法。该系统包括加速度计，其被配置为获取与被摄体的声乐功能相关联的表面加速度数据，以及被配置为分析表面加速度数据并基于表面加速度数据估计由对象产生的声门气流波形的计算机系统。计算机系统通过基于经校准的传输线模型对表面加速度数据应用逆滤波器来执行分析和估计，并且基于估计的声门气流波形来生成对象的声乐功能的指示。

4.

发明申请
Methods and Systems for Voice Conversion 有权
标题翻译：语音转换方法与系统

公开(公告)号：US20160005403A1

公开(公告)日：2016-01-07

申请号：US14631464

申请日：2015-02-25

申请人： Google Inc.

发明人： Ioannis Agiomyrgiannakis , Zoi Roupakia

IPC分类号： G10L17/00

CPC分类号： G10L15/07 , G10L17/06 , G10L25/75 , G10L2021/0135

摘要： A device may receive data indicative of a plurality of speech sounds associated with first voice characteristics of a first voice. The device may receive an input indicative of speech associated with second voice characteristics of a second voice. The device may map at least one portion of the speech of the second voice to one or more speech sounds of the plurality of speech sounds of the first voice. The device may compare the first voice characteristics with the second voice characteristics based on the map. The comparison may include vocal tract characteristics, nasal cavity characteristics, and voicing characteristics. The device may determine a given representation configured to associate the first voice characteristics with the second voice characteristics. The device may provide an output indicative of pronunciations of the one or more speech sounds of the first voice according to the second voice characteristics based on the given representation.

摘要翻译： 设备可以接收指示与第一语音的第一语音特征相关联的多个语音的数据。设备可以接收指示与第二语音的第二语音特征相关联的语音的输入。设备可以将第二语音的语音的至少一部分映射到第一语音的多个语音的一个或多个语音。设备可以基于地图将第一语音特征与第二语音特征进行比较。比较可以包括声道特征，鼻腔特征和发音特征。设备可以确定被配置为将第一语音特征与第二语音特征相关联的给定表示。该装置可以基于给定的表示，根据第二语音特征提供指示第一语音的一个或多个语音的发音的输出。

5.

发明申请
AUDIO PROCESSING APPARATUS 审中-公开
标题翻译：音频处理设备

公开(公告)号：US20130226593A1

公开(公告)日：2013-08-29

申请号：US13883610

申请日：2010-11-12

申请人： Birgir Magnusson , Koray Ozcan

发明人： Birgir Magnusson , Koray Ozcan

IPC分类号： G10L25/75

CPC分类号： G10L25/75 , G11B27/038 , H04N5/765 , H04N9/806 , H04R1/406 , H04R3/005

摘要： An apparatus comprising: an audio source determiner configured to determine at least one audio source; a visualizer configured to generate a visual representation associated with the at least one audio source; and a controller configured to process an audio signal associated with the at least one audio source dependent on interaction with the visual representation.

摘要翻译： 一种装置，包括：音频源确定器，被配置为确定至少一个音频源; 被配置为生成与所述至少一个音频源相关联的视觉表示的可视化器; 以及控制器，被配置为根据与视觉表示的交互来处理与所述至少一个音频源相关联的音频信号。

6.

发明公开
GENERATING IMAGE DATA OF A VIRTUAL OBJECT BASED ON A FUSED AUDIO FEATURE 审中-公开

公开(公告)号：US20240282033A1

公开(公告)日：2024-08-22

申请号：US18649772

申请日：2024-04-29

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Guinan SU , Gao WU , Zhifeng LI , Wei LIU , Yushi CUI

IPC分类号： G06T13/20 , G06T13/40 , G10L15/18 , G10L25/24 , G10L25/75

CPC分类号： G06T13/205 , G06T13/40 , G10L15/1815 , G10L25/24 , G10L25/75

摘要： A video generation method includes obtaining audio data and initial image data of a virtual object, extracting an audio feature from the audio data, and performing predictive encoding on the audio data to obtain an encoded feature representing vocal channel characteristics of the audio data. The method further includes fusing the audio feature and the encoded feature to obtain a fused audio feature, and generating updated image data of the virtual object according to the fused audio feature and the initial image data. The method further includes generating video data including the updated image data and the audio data

7.

发明授权
Emotion estimation apparatus, emotion estimation method, and computer readable recording medium 有权

公开(公告)号：US11984136B2

公开(公告)日：2024-05-14

申请号：US17433694

申请日：2019-02-28

申请人： NEC Corporation

发明人： Takayuki Arakawa

IPC分类号： G10L25/63 , A61B5/00 , A61B5/16 , G10L25/18 , G10L25/75

CPC分类号： G10L25/63 , A61B5/165 , A61B5/4803 , G10L25/18 , G10L25/75

摘要： An emotion estimation apparatus 1 includes: a generation unit 2 configured to generate acoustic characteristic information indicating an acoustic characteristic using a first acoustic signal output to the ear canal and a second acoustic signal produced by the first acoustic signal echoing inside the body; and an estimation unit 3 configured to estimate emotion using the acoustic characteristic information.

8.

发明授权
Method and system for automatic detection and correction of sound caused by facial coverings 有权

公开(公告)号：US11967332B2

公开(公告)日：2024-04-23

申请号：US17477592

申请日：2021-09-17

申请人： International Business Machines Corporation

发明人： Girmaw Abebe Tadesse , Michael S. Gordon , Komminist Weldemariam

IPC分类号： G10L21/0232 , G10L25/60 , G10L25/75

CPC分类号： G10L21/0232 , G10L25/60 , G10L25/75

摘要： A computer-implemented method for correcting muffled speech caused by facial coverings is disclosed. The computer-implemented method includes monitoring a user's speech for speech distortion. The computer-implemented method further includes determining that the user's speech is distorted. The computer-implemented method further includes determining that a cause of the user's speech distortion is based, at least in part, on a presence of a particular type of facial covering. The computer-implemented method further includes automatically correcting the speech distortion of the user based, at least in part, on the particular type of facial covering causing the speech distortion.

9.

发明授权
Adaptive voice authentication system and method 有权

公开(公告)号：US09940934B2

公开(公告)日：2018-04-10

申请号：US15064740

申请日：2016-03-09

申请人： Uniphore Software Systems

发明人： Umesh Sachdev

IPC分类号： G10L15/00 , G10L17/04 , G10L17/02 , G10L17/06 , G10L25/24 , G10L25/75 , G10L25/15

CPC分类号： G10L17/04 , G10L17/02 , G10L17/06 , G10L25/15 , G10L25/24 , G10L25/75

摘要： An adaptive voice authentication system is provided. The adaptive voice authentication system includes an adaptive module configured to compare a feature quality index of the plurality of authentication features and the plurality of enrollment features and dynamically replace and store one or more enrollment features with one or more authentication features to form a plurality of updated enrollment features. The adaptive module is configured to generate an updated enrollment voice print model from the plurality of the updated enrollment features. The adaptive module is further configured to compare the updated enrollment voice print model with the previously stored enrollment voice print model and dynamically update the previously stored enrollment voice print model with the updated enrollment voice print model based on a model quality index.

10.

发明授权
Devices and methods for use of phase information in speech synthesis systems 有权

公开(公告)号：US09865247B2

公开(公告)日：2018-01-09

申请号：US14631583

申请日：2015-02-25

申请人： Google LLC

发明人： Ioannis Agiomyrgiannakis , Byung Ha Chun

IPC分类号： G10L13/08 , G10L25/75 , G10L13/02

CPC分类号： G10L13/02 , G10L13/08 , G10L25/75

摘要： A device may receive a speech signal. The device may determine acoustic feature parameters for the speech signal. The acoustic feature parameters may include phase data. The device may determine circular space representations for the phase data based on an alignment of the phase data with given axes of the circular space representations. The device may map the phase data to linguistic features based on the circular space representations. The linguistic features may be associated with linguistic content that includes phonemic content or text content. The device may provide a synthetic audio pronunciation of the linguistic content based on the mapping.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类