Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Yixiang Chen"

1.

发明申请
METHOD AND APPARATUS FOR CONVERTING VOICE TIMBRE, METHOD AND APPARATUS FOR TRAINING MODEL, DEVICE AND MEDIUM 有权

公开(公告)号：US20230127787A1

公开(公告)日：2023-04-27

申请号：US18145326

申请日：2022-12-22

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Junchao Wang , Yixiang Chen , Tao Sun

IPC: G10L15/02 , G10L15/06

Abstract: A method and an apparatus for converting a voice timbre, and a method for training a model. The solution includes: obtaining a target acoustic feature by encoding a sample audio using an encoding branch in a voice timbre conversion model; obtaining a target text feature by performing feature extraction on a real text sequence labeled by the sample audio; training the encoding branch based on a difference between the target acoustic feature and the target text feature; obtaining a first spectrum feature having an original timbre by decoding the target text feature using a decoding branch in the voice timbre conversion model based on the original timbre corresponding to the identification information carried in the sample audio; obtaining a second spectrum feature by performing spectrum feature extraction on the sample audio; and training the decoding branch based on a difference between the first spectrum feature and the second spectrum feature.

Patent Agency Ranking