METHOD OF TRAINING SPEECH SYNTHESIS MODEL AND METHOD OF SYNTHESIZING SPEECH

    公开(公告)号:US20230178067A1

    公开(公告)日:2023-06-08

    申请号:US18074023

    申请日:2022-12-02

    CPC classification number: G10L13/047 G10L25/30

    Abstract: A method of training a speech synthesis method, a method of synthesizing a speech, a device and a storage medium are provided, which relate to a field of artificial intelligence technology, in particular to a field of speech synthesis technology. The specific implementation scheme includes: processing training data by using the speech synthesis model, so as to determine a content encoding sequence, a style encoding sequence, a timbre encoding vector, a noise environment vector and a target Mel spectrum sequence corresponding to the training data; determine a total loss value according to the content encoding sequence, the style encoding sequence, the timbre encoding vector, the noise environment vector and the target Mel spectrum sequence; and adjusting a parameter of the speech synthesis model according to the total loss value.

Patent Agency Ranking