TEXT-TO-SPEECH SYNTHESIS METHOD, ELECTRONIC DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM

    公开(公告)号:US20230410791A1

    公开(公告)日:2023-12-21

    申请号:US18212140

    申请日:2023-06-20

    CPC classification number: G10L13/10 G10L13/04

    Abstract: A text-to-speech synthesis method, an electronic device, and a computer-readable storage medium are provided. The method includes: obtaining prosodic pause features of an input text by performing a prosodic pause prediction processing on the input text, and dividing the input text into a plurality of prosodic phrases according to the prosodic pause features; synthesizing short sentence audios according to the prosodic phrases by performing a streamed speech synthesis processing on each of the prosodic phrases in the input text in a manner of asynchronous processing of a thread pool; and performing an audio playback operation of the input text according to the short sentence audios corresponding to the first prosodic phrase of the input text, in response to synthesizing the short sentence audio corresponding to the first prosodic phrase of the input text.

Patent Agency Ranking