-
公开(公告)号:US12046227B2
公开(公告)日:2024-07-23
申请号:US17659840
申请日:2022-04-19
Applicant: Google LLC
Inventor: Tom Marius Kenter , Tobias Alexander Hawker , Robert Clark
IPC: G10L13/08 , G10L15/02 , G10L15/06 , G10L15/187
CPC classification number: G10L13/08 , G10L15/02 , G10L15/063 , G10L15/187 , G10L2015/025
Abstract: A method for generating frame values using a key frame network includes receiving a text utterance having at least one phoneme, and for each respective phoneme of the at least one phoneme, predicting, using a predictive model, a fixed quantity of key frames. Each respective key frame of the fixed quantity of key frames includes a representation of a component of the respective phoneme. The method also includes generating, using the fixed quantity of key frames, a plurality of frame values. Here, each respective frame value of the plurality of frame values is representative of a fixed-duration of audio.
-
公开(公告)号:US20230335110A1
公开(公告)日:2023-10-19
申请号:US17659840
申请日:2022-04-19
Applicant: Google LLC
Inventor: Tom Marius Kenter , Tobias Alexander Hawker , Robert Clark
IPC: G10L13/08 , G10L15/02 , G10L15/06 , G10L15/187
CPC classification number: G10L13/08 , G10L15/02 , G10L15/063 , G10L15/187 , G10L2015/025
Abstract: A method for generating frame values using a key frame network includes receiving a text utterance having at least one phoneme, and for each respective phoneme of the at least one phoneme, predicting, using a predictive model, a fixed quantity of key frames. Each respective key frame of the fixed quantity of key frames includes a representation of a component of the respective phoneme. The method also includes generating, using the fixed quantity of key frames, a plurality of frame values. Here, each respective frame value of the plurality of frame values is representative of a fixed-duration of audio.
-