System and method for F0 transfer learning for improving F0 prediction with deep neural network models

Invention Grant

US11302307B2 System and method for F0 transfer learning for improving F0 prediction with deep neural network models 有权

Please log in to see more content

Patent Title: System and method for F0 transfer learning for improving F0 prediction with deep neural network models
Application No.: US16448384

Application Date: 2019-06-21
Publication No.: US11302307B2

Publication Date: 2022-04-12
Inventor: Elluru Veera Raghavendra , Aravind Ganapathiraju
Applicant: Genesys Telecommunications Laboratories, Inc.
Applicant Address: US CA Daly City
Assignee: Genesys Telecommunications Laboratories, Inc.
Current Assignee: Genesys Telecommunications Laboratories, Inc.
Current Assignee Address: US CA Daly City
Main IPC: G10L15/16
IPC: G10L15/16 ; G10L15/06 ; G06K9/62 ; G10L13/04 ; G10L15/18

System and method for F0 transfer learning for improving F0 prediction with deep neural network models

Abstract:

A system and method are presented for F0 transfer learning for improving F0 prediction with deep neural network models. Larger models are trained using long short-term memory (LSTM) and multi-layer perceptron (MLP) feed-forward hidden layer modeling. The fundamental frequency values for voiced and unvoiced segments are identified and extracted from the larger models. The values for voiced regions are transferred and applied to training a smaller model and the smaller model is applied in the text to speech system for real-time speech synthesis output.

Public/Granted literature

US20190392815A1 SYSTEM AND METHOD FOR F0 TRANSFER LEARNING FOR IMPROVING F0 PREDICTION WITH DEEP NEURAL NETWORK MODELS Public/Granted day:2019-12-26

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络