System and method for F0 transfer learning for improving F0 prediction with deep neural network models
Abstract:
A system and method are presented for F0 transfer learning for improving F0 prediction with deep neural network models. Larger models are trained using long short-term memory (LSTM) and multi-layer perceptron (MLP) feed-forward hidden layer modeling. The fundamental frequency values for voiced and unvoiced segments are identified and extracted from the larger models. The values for voiced regions are transferred and applied to training a smaller model and the smaller model is applied in the text to speech system for real-time speech synthesis output.
Information query
Patent Agency Ranking
0/0