Invention Grant
- Patent Title: Global prosody style transfer without text transcriptions
-
Application No.: US17337518Application Date: 2021-06-03
-
Publication No.: US11996083B2Publication Date: 2024-05-28
- Inventor: Kaizhi Qian , Yang Zhang , Shiyu Chang , Jinjun Xiong , Chuang Gan , David Cox
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Tutunjian & Bitetto, P.C.
- Agent Stosch Sabo
- Main IPC: G10L13/10
- IPC: G10L13/10 ; G06N20/00 ; G10L17/04 ; G10L21/013 ; G10L25/63

Abstract:
A computer-implemented method is provided of using a machine learning model for disentanglement of prosody in spoken natural language. The method includes encoding, by a computing device, the spoken natural language to produce content code. The method further includes resampling, by the computing device without text transcriptions, the content code to obscure the prosody by applying an unsupervised technique to the machine learning model to generate prosody-obscured content code. The method additionally includes decoding, by the computing device, the prosody-obscured content code to synthesize speech indirectly based upon the content code.
Public/Granted literature
- US20220392429A1 GLOBAL PROSODY STYLE TRANSFER WITHOUT TEXT TRANSCRIPTIONS Public/Granted day:2022-12-08
Information query