SPEECH SYNTHESIS METHOD, DEVICE AND COMPUTER-READABLE STORAGE MEDIUM

    公开(公告)号:US20230206895A1

    公开(公告)日:2023-06-29

    申请号:US18089576

    申请日:2022-12-28

    CPC classification number: G10L13/047 G10L13/10

    Abstract: A speech synthesis method includes: obtaining an acoustic feature sequence of a text to be processed; processing the acoustic feature sequence by using a non-autoregressive computing model in parallel to obtain first audio information of the text, to be processed, wherein the first audio information comprises audio corresponding to each segment; processing the acoustic feature sequence and the first audio information by using an autoregressive computing model to obtain a residual value corresponding to each segment; and obtaining second audio information corresponding to an i-th segment based on the first audio information corresponding to the i-th segment and the residual values corresponding to a first to an (i-1)-th segment, wherein a synthesized audio of the text to be processed comprises each of the second audio information, i=1, 2 . . . n, n is a total number of the segments.

    METHOD AND APPARATUS FOR VOICE CONVERSION AND STORAGE MEDIUM

    公开(公告)号:US20210193160A1

    公开(公告)日:2021-06-24

    申请号:US17084672

    申请日:2020-10-30

    Abstract: The present disclosure discloses a voice conversion method. The method includes: obtaining a to-be-converted voice, and extracting acoustic features of the to-be-converted voice; obtaining a source vector corresponding to the to-be-converted voice from a source vector pool, and selecting a target vector corresponding to the target voice from the target vector pool; obtaining acoustic features of the target voice output by the voice conversion model by using the acoustic features of the to-be-converted voice, the source vector corresponding to the to-be-converted voice, and the target vector corresponding to the target voice as an input of the voice conversion model; and obtaining the target voice by converting the acoustic features of the target voice using a vocoder. In addition, a voice conversion apparatus and a storage medium are also provided.

Patent Agency Ranking