-
公开(公告)号:US20200152194A1
公开(公告)日:2020-05-14
申请号:US16683342
申请日:2019-11-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonghoon JEONG , Hosang SUNG , Doohwa HONG , Kyoungbo MIN , Eunmi OH , Kihyun CHOO
Abstract: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
公开(公告)号:US20210134269A1
公开(公告)日:2021-05-06
申请号:US17081251
申请日:2020-10-27
Applicant: Samsung Electronics Co., Ltd.
Inventor: Kyoungbo MIN , Seungdo CHOI , Doohwa HONG
Abstract: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
-
公开(公告)号:US20220180872A1
公开(公告)日:2022-06-09
申请号:US17679446
申请日:2022-02-24
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonghoon JEONG , Hosang SUNG , Doohwa HONG , Kyoungbo MIN , Eunmi OH , Kihyun CHOO
Abstract: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
公开(公告)号:US20200234693A1
公开(公告)日:2020-07-23
申请号:US16749257
申请日:2020-01-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Hosang SUNG , Seonho HWANG , Doohwa HONG , Eunmi OH , Kyoungbo MIN , Jonghoon JEONG , Kihyun CHOO
IPC: G10L13/08 , G10L15/22 , G10L15/18 , G10L15/02 , G10L13/033 , G10L13/04 , G10L13/047
Abstract: An electronic device and a controlling method of the electronic device are provided. The electronic device acquires text to respond on a received user's speech, acquires a plurality of pieces of parameter information for determining a style of an output speech corresponding to the text based on information on a type of a plurality of text-to-speech (TTS) databases and the received user's speech, identifies a TTS database corresponding to the plurality of pieces of parameter information among the plurality of TTS databases, identifies a weight set corresponding to the plurality of pieces of parameter information among a plurality of weight sets acquired through a trained artificial intelligence model, adjusts information on the output speech stored in the TTS database based on the weight set, synthesizes the output speech based on the adjusted information on the output speech, and outputs the output speech corresponding to the text.
-
公开(公告)号:US20230017302A1
公开(公告)日:2023-01-19
申请号:US17949741
申请日:2022-09-21
Applicant: Samsung Electronics Co., Ltd.
Inventor: Kyoungbo MIN , Seungdo CHOI , Doohwa HONG
Abstract: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
-
公开(公告)号:US20190180747A1
公开(公告)日:2019-06-13
申请号:US16211973
申请日:2018-12-06
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Seohyun BACK , Doohwa HONG , Jongyoub RYU , Jiyeon HONG , Eunkyoung KIM , Sungja CHOI , Jaewon LEE , Sathish INDURTHI
Abstract: The disclosure relates to a voice recognition apparatus for analyzing a user input based on content and generating and outputting an answer and an operation method thereof, the operation method including receiving an audio signal and performing voice recognition on the audio signal; acquiring content information of content being executed; analyzing a user input based on the content information from a voice recognized by performing the voice recognition; generating an answer based on the analyzed user input and the content information; and outputting the answer.
-
-
-
-
-