-
公开(公告)号:US20230169953A1
公开(公告)日:2023-06-01
申请号:US17919982
申请日:2021-03-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ran Zhang , Jian LUAN , Yahuan Cong
Abstract: The present disclosure provides methods and apparatuses for phrase-based end-to-end text-to-speech (TTS) synthesis.
A text may be obtained. A target phrase in the text may be identified. A phrase context of the target phrase may be determined. An acoustic feature corresponding to the target phrase may be generated based at least on the target phrase and the phrase context. A speech waveform corresponding to the target phrase may be generated based on the acoustic feature.-
公开(公告)号:US20230206899A1
公开(公告)日:2023-06-29
申请号:US17926994
申请日:2021-04-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ran Zhang , Jian LUAN , Yahuan Cong
CPC classification number: G10L13/10 , G10L13/04 , G10L2013/105
Abstract: The present disclosure provides methods and apparatuses for spontaneous text-to-speech (TTS) synthesis. A target text may be obtained. A fluency reference factor may be determined based at least on the target text. An acoustic feature corresponding to the target text may be generated with the fluency reference factor. A speech waveform corresponding to the target text may be generated based on the acoustic feature.
-