METHOD AND APPARATUS FOR CONVERTING VOICE TIMBRE, METHOD AND APPARATUS FOR TRAINING MODEL, DEVICE AND MEDIUM

    公开(公告)号:US20230127787A1

    公开(公告)日:2023-04-27

    申请号:US18145326

    申请日:2022-12-22

    Abstract: A method and an apparatus for converting a voice timbre, and a method for training a model. The solution includes: obtaining a target acoustic feature by encoding a sample audio using an encoding branch in a voice timbre conversion model; obtaining a target text feature by performing feature extraction on a real text sequence labeled by the sample audio; training the encoding branch based on a difference between the target acoustic feature and the target text feature; obtaining a first spectrum feature having an original timbre by decoding the target text feature using a decoding branch in the voice timbre conversion model based on the original timbre corresponding to the identification information carried in the sample audio; obtaining a second spectrum feature by performing spectrum feature extraction on the sample audio; and training the decoding branch based on a difference between the first spectrum feature and the second spectrum feature.

Patent Agency Ranking