-
公开(公告)号:US20240274120A1
公开(公告)日:2024-08-15
申请号:US18568261
申请日:2022-09-16
Applicant: Lemon Inc.
Inventor: Dongyang Dai , Yuanzhe Chen , Li Chen , Yuping Wang , Qiao Tian , Ming Tu , Rui Xia , Yuxuan Wang
IPC: G10L13/027 , G10L25/18
CPC classification number: G10L13/027 , G10L25/18
Abstract: Provided are an audio synthesis method and apparatus, an electronic device, and a readable storage medium. In the present solution, conversion from a text to an audio having a target timbre is achieved by means of a pre-trained voice synthesis model, the voice synthesis model comprising a first feature extraction sub-model and a second feature extraction sub-model, wherein the first feature extraction sub-model outputs, according to an inputted text to be processed, an acoustic feature comprising a bottleneck feature; the second feature extraction sub-model outputs, according to the inputted first acoustic features, a Mel spectrum feature corresponding to the text to be processed; according to the Mel spectrum feature corresponding to the text to be processed, the target audio corresponding to the text to be processed is obtained, and the target audio has the target timbre.