-
公开(公告)号:US20230127787A1
公开(公告)日:2023-04-27
申请号:US18145326
申请日:2022-12-22
Inventor: Junchao Wang , Yixiang Chen , Tao Sun
Abstract: A method and an apparatus for converting a voice timbre, and a method for training a model. The solution includes: obtaining a target acoustic feature by encoding a sample audio using an encoding branch in a voice timbre conversion model; obtaining a target text feature by performing feature extraction on a real text sequence labeled by the sample audio; training the encoding branch based on a difference between the target acoustic feature and the target text feature; obtaining a first spectrum feature having an original timbre by decoding the target text feature using a decoding branch in the voice timbre conversion model based on the original timbre corresponding to the identification information carried in the sample audio; obtaining a second spectrum feature by performing spectrum feature extraction on the sample audio; and training the decoding branch based on a difference between the first spectrum feature and the second spectrum feature.