-
公开(公告)号:US11335325B2
公开(公告)日:2022-05-17
申请号:US16749257
申请日:2020-01-22
发明人: Hosang Sung , Seonho Hwang , Doohwa Hong , Eunmi Oh , Kyoungbo Min , Jonghoon Jeong , Kihyun Choo
IPC分类号: G10L13/08 , G10L15/22 , G10L15/18 , G10L13/047 , G10L13/033 , G10L15/02 , G10L13/00
摘要: An electronic device and a controlling method of the electronic device are provided. The electronic device acquires text to respond on a received user's speech, acquires a plurality of pieces of parameter information for determining a style of an output speech corresponding to the text based on information on a type of a plurality of text-to-speech (TTS) databases and the received user's speech, identifies a TTS database corresponding to the plurality of pieces of parameter information among the plurality of TTS databases, identifies a weight set corresponding to the plurality of pieces of parameter information among a plurality of weight sets acquired through a trained artificial intelligence model, adjusts information on the output speech stored in the TTS database based on the weight set, synthesizes the output speech based on the adjusted information on the output speech, and outputs the output speech corresponding to the text.
-
公开(公告)号:US11942077B2
公开(公告)日:2024-03-26
申请号:US17949741
申请日:2022-09-21
发明人: Kyoungbo Min , Seungdo Choi , Doohwa Hong
CPC分类号: G10L15/063 , G10L13/00 , G10L15/16
摘要: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
-
公开(公告)号:US20200279551A1
公开(公告)日:2020-09-03
申请号:US16788418
申请日:2020-02-12
发明人: Hosang Sung , Kyoungbo Min , Seonho Hwang , Doohwa Hong , Eunmi Oh , Jonghoon Jeong , Kihyun Choo
IPC分类号: G10L13/08 , G10L25/63 , G10L17/00 , G10L13/04 , G10L13/047
摘要: An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
-
公开(公告)号:US11475878B2
公开(公告)日:2022-10-18
申请号:US17081251
申请日:2020-10-27
发明人: Kyoungbo Min , Seungdo Choi , Doohwa Hong
摘要: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
-
公开(公告)号:US20220148562A1
公开(公告)日:2022-05-12
申请号:US17554547
申请日:2021-12-17
发明人: Sangjun PARK , Kyoungbo Min , Kihyun Choo , Seungdo Choi
IPC分类号: G10L13/047 , G10L13/10
摘要: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone; a memory configured to store a text-to-speech (TTS) model and a plurality of evaluation texts; and a processor configured to: obtain a first reference vector of a user speech spoken by a user based the user speech being received through the microphone, generate a plurality of candidate reference vectors based on the first reference vector, obtain a plurality of synthesized sounds by inputting the plurality of candidate reference vectors and the plurality of evaluation texts to the TTS model, identify at least one synthesized sound of the plurality of synthesized sounds based on a similarity between characteristics of the plurality of synthesized sounds and the user speech, and store a second reference vector of the at least one synthesized sound in the memory as a reference vector corresponding to the user for the TTS model.
-
公开(公告)号:US11763799B2
公开(公告)日:2023-09-19
申请号:US17554547
申请日:2021-12-17
发明人: Sangjun Park , Kyoungbo Min , Kihyun Choo , Seungdo Choi
IPC分类号: G10L13/08 , G10L15/14 , G10L15/06 , G10L13/047 , G10L13/10
CPC分类号: G10L13/047 , G10L13/10
摘要: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone; a memory configured to store a text-to-speech (TTS) model and a plurality of evaluation texts; and a processor configured to: obtain a first reference vector of a user speech spoken by a user based the user speech being received through the microphone, generate a plurality of candidate reference vectors based on the first reference vector, obtain a plurality of synthesized sounds by inputting the plurality of candidate reference vectors and the plurality of evaluation texts to the TTS model, identify at least one synthesized sound of the plurality of synthesized sounds based on a similarity between characteristics of the plurality of synthesized sounds and the user speech, and store a second reference vector of the at least one synthesized sound in the memory as a reference vector corresponding to the user for the TTS model.
-
公开(公告)号:US20230206897A1
公开(公告)日:2023-06-29
申请号:US18171079
申请日:2023-02-17
发明人: Hosang Sung , Kyoungbo Min , Seonho Hwang , Doohwa Hong , Eunmi Oh , Jonghoon Jeong , Kihyun Choo
IPC分类号: G10L13/08 , G10L25/63 , G10L13/047 , G10L13/00 , G10L17/00
CPC分类号: G10L13/08 , G10L25/63 , G10L13/047 , G10L13/00 , G10L17/00
摘要: An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
-
公开(公告)号:US11587547B2
公开(公告)日:2023-02-21
申请号:US16788418
申请日:2020-02-12
发明人: Hosang Sung , Kyoungbo Min , Seonho Hwang , Doohwa Hong , Eunmi Oh , Jonghoon Jeong , Kihyun Choo
摘要: An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
-
公开(公告)号:US11289083B2
公开(公告)日:2022-03-29
申请号:US16683342
申请日:2019-11-14
发明人: Jonghoon Jeong , Hosang Sung , Doohwa Hong , Kyoungbo Min , Eunmi Oh , Kihyun Choo
摘要: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
-
-
-
-
-
-
-