Voice conversion using deep neural network with intermediate voice training

发明授权

US10535336B1 Voice conversion using deep neural network with intermediate voice training 有权

请登陆查看更多内容

专利标题： Voice conversion using deep neural network with intermediate voice training
申请号： US16254194

申请日： 2019-01-22
公开(公告)号： US10535336B1

公开(公告)日： 2020-01-14
发明人: Seyed Hamidreza Mohammadi
申请人： Seyed Hamidreza Mohammadi
申请人地址： US CA Pasadena
专利权人： OBEN, INC.
当前专利权人： OBEN, INC.
当前专利权人地址： US CA Pasadena
代理商 Andrew S. Naglestad
主分类号： G10L13/047
IPC分类号： G10L13/047 ; G10L13/033 ; G10L15/26 ; G10L25/30

Voice conversion using deep neural network with intermediate voice training

摘要：

A system and method of converting source speech to target speech using intermediate speech data is disclosed. The method comprises identifying intermediate speech data that match target voice training data based on acoustic features; performing dynamic time warping to match the second set of acoustic features of intermediate speech data and the first set of acoustic features of target voice training data; training a neural network to convert the intermediate speech data to target voice training data; receiving source speech data; converting the source speech data to an intermediate speech; converting the intermediate speech to a target speech sequence using the neural network; and converting the target speech sequence to target speech using the pitch from the target voice training data.

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理
G10L13/047	...语音合成设备的体系结构