专利检索 ap:("BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.") AND inv:"Zhipeng Chen" 第 1 页

1.

发明授权
Training method and apparatus for a speech synthesis model, and storage medium 有权

公开(公告)号：US11488577B2

公开(公告)日：2022-11-01

申请号：US16907006

申请日：2020-06-19

申请人： BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

发明人： Zhipeng Chen , Jinfeng Bai , Lei Jia

IPC分类号： G10L13/047 , G06N3/08 , G10L13/06 , G10L13/08

摘要： The present application discloses a training method and an apparatus for a speech synthesis model, electronic device, and storage medium. The method includes: taking a syllable input sequence, a phoneme input sequence and a Chinese character input sequence of a current sample as inputs of an encoder of a model to be trained, to obtain encoded representations of these three sequences at an output end of the encoder; fusing the encoded representations of these three sequences, to obtain a weighted combination of these three sequences; taking the weighted combination as an input of an attention module, to obtain a weighted average of the weighted combination at each moment at an output end of the attention module; taking the weighted average as an input of a decoder of the model to be trained, to obtain a speech Mel spectrum of the current sample at an output end of the decoder.