-
公开(公告)号:US20210350788A1
公开(公告)日:2021-11-11
申请号:US17198727
申请日:2021-03-11
Applicant: Samsung Electronics Co., Ltd.
Inventor: Kihyun CHOO , Sangjun PARK , Nicholas LANE , Ravichander VIPPERLA , Sourav BHATTACHARYA , Syed Samin ISHTIAQ , Taehwa KANG , Jonghoon JEONG
Abstract: A method, performed by an electronic device, of generating a speech signal corresponding to at least one text is provided. The method includes obtaining feature information with respect to a first sample included in the speech signal, based on the at least one text, obtaining condition information related to a condition under which a bunching operation, in which one or more sample values included in the speech signal are obtained, is performed, based on the feature information, configuring one or more bunching blocks for performing the bunching operation, based on the condition information, obtaining the one or more sample values based on the feature information with respect to the first sample by using the one or more bunching blocks, and generating the speech signal based on the obtained one or more sample values.
-
公开(公告)号:US20220246129A1
公开(公告)日:2022-08-04
申请号:US17578164
申请日:2022-01-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Hosang SUNG , Lei YANG , Jonguk YOO , Jonghoon JEONG , Kihyun CHOO
IPC: G10K11/178 , G10L25/78
Abstract: A controlling method of a wearable electronic apparatus includes: receiving, by an IMU sensor, a bone conduction signal corresponding to vibration in the user's face, while the wearable electronic apparatus is operated in an ANC mode; identifying a presence or an absence of the user's voice based on the bone conduction signal, based on the identifying the presence of the user's voice, controlling an operation mode of the wearable electronic apparatus to be a different operation mode from the ANC mode; while the wearable electronic apparatus is operated in the different operation mode, identifying presence or absence of the user's voice based on the bone conduction signal, and based on the absence of the user's voice being identified for a predetermined time while the wearable electronic apparatus is operated in the different operation mode, controlling the different operation mode to return to the ANC mode.
-
公开(公告)号:US20200152194A1
公开(公告)日:2020-05-14
申请号:US16683342
申请日:2019-11-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonghoon JEONG , Hosang SUNG , Doohwa HONG , Kyoungbo MIN , Eunmi OH , Kihyun CHOO
Abstract: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
公开(公告)号:US20220262377A1
公开(公告)日:2022-08-18
申请号:US17712417
申请日:2022-04-04
Applicant: SAMSUNG ELECTRONICS CO, LTD.
Inventor: Sangjun PARK , Kihyun CHOO , Taehwa KANG , Hosang SUNG , Jonghoon JEONG
Abstract: The disclosure relates to an electronic device and a control method thereof. The electronic device includes a memory, and a processor configured to: obtain first feature data for estimating a waveform by inputting acoustic data of a first quality to a first encoder model; and obtain waveform data of a second quality that is a higher quality than the first quality by inputting the first feature data to a decoder model to.
-
公开(公告)号:US20220180872A1
公开(公告)日:2022-06-09
申请号:US17679446
申请日:2022-02-24
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonghoon JEONG , Hosang SUNG , Doohwa HONG , Kyoungbo MIN , Eunmi OH , Kihyun CHOO
Abstract: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
公开(公告)号:US20200234693A1
公开(公告)日:2020-07-23
申请号:US16749257
申请日:2020-01-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Hosang SUNG , Seonho HWANG , Doohwa HONG , Eunmi OH , Kyoungbo MIN , Jonghoon JEONG , Kihyun CHOO
IPC: G10L13/08 , G10L15/22 , G10L15/18 , G10L15/02 , G10L13/033 , G10L13/04 , G10L13/047
Abstract: An electronic device and a controlling method of the electronic device are provided. The electronic device acquires text to respond on a received user's speech, acquires a plurality of pieces of parameter information for determining a style of an output speech corresponding to the text based on information on a type of a plurality of text-to-speech (TTS) databases and the received user's speech, identifies a TTS database corresponding to the plurality of pieces of parameter information among the plurality of TTS databases, identifies a weight set corresponding to the plurality of pieces of parameter information among a plurality of weight sets acquired through a trained artificial intelligence model, adjusts information on the output speech stored in the TTS database based on the weight set, synthesizes the output speech based on the adjusted information on the output speech, and outputs the output speech corresponding to the text.
-
-
-
-
-