-
公开(公告)号:US20200152194A1
公开(公告)日:2020-05-14
申请号:US16683342
申请日:2019-11-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonghoon JEONG , Hosang SUNG , Doohwa HONG , Kyoungbo MIN , Eunmi OH , Kihyun CHOO
Abstract: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
公开(公告)号:US20240038215A1
公开(公告)日:2024-02-01
申请号:US18223265
申请日:2023-07-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Qiuyue MA , Yuxing ZHENG , Hosang SUNG , Lizhong WANG , Xiaoyan LOU
Abstract: The present disclosure provides methods, devices, and computer-readable mediums for audio signal processing. In some embodiments, a method executed by an electronic device includes obtaining guidance features corresponding to an audio signal to be processed, the guidance features indicating distinguishable features of at least one signal type of at least one signal category. The method further includes extracting, according to the guidance features, target audio features corresponding to the audio signal. The method further includes determining, according to the target audio features, a target signal type of the audio signal from among the at least one signal type of the at least one signal category. The method further includes performing corresponding processing according to the target signal type of the audio signal.
-
公开(公告)号:US20170337925A1
公开(公告)日:2017-11-23
申请号:US15670653
申请日:2017-08-07
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Steven Craig GREER , Hosang SUNG
IPC: G10L19/005 , G10L19/24 , G10L19/002
Abstract: An audio coding terminal and method is provided. The terminal includes a coding mode setting unit to set an operation mode, from plural operation modes, for input audio coding by a codec, configured to code the input audio based on the set operation mode such that when the set operation mode is a high frame erasure rate (FER) mode the codec codes a current frame of the input audio according to a select frame erasure concealment (FEC) mode of one or more FEC modes. Upon the setting of the operation mode to be the High FER mode, the one FEC mode is selected, from the one or more FEC modes predetermined for the High FER mode, to control the codec by incorporating of redundancy within a coding of the input audio or as separate redundancy information separate from the coded input audio according to the selected one FEC mode.
-
公开(公告)号:US20220246129A1
公开(公告)日:2022-08-04
申请号:US17578164
申请日:2022-01-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Hosang SUNG , Lei YANG , Jonguk YOO , Jonghoon JEONG , Kihyun CHOO
IPC: G10K11/178 , G10L25/78
Abstract: A controlling method of a wearable electronic apparatus includes: receiving, by an IMU sensor, a bone conduction signal corresponding to vibration in the user's face, while the wearable electronic apparatus is operated in an ANC mode; identifying a presence or an absence of the user's voice based on the bone conduction signal, based on the identifying the presence of the user's voice, controlling an operation mode of the wearable electronic apparatus to be a different operation mode from the ANC mode; while the wearable electronic apparatus is operated in the different operation mode, identifying presence or absence of the user's voice based on the bone conduction signal, and based on the absence of the user's voice being identified for a predetermined time while the wearable electronic apparatus is operated in the different operation mode, controlling the different operation mode to return to the ANC mode.
-
公开(公告)号:US20160196827A1
公开(公告)日:2016-07-07
申请号:US15069473
申请日:2016-03-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Steven Craig GREER , Hosang SUNG
IPC: G10L19/005 , G10L19/24 , G10L19/002
CPC classification number: G10L19/005 , G10L19/002 , G10L19/24
Abstract: An audio coding terminal and method is provided. The terminal includes a coding mode setting unit to set an operation mode, from plural operation modes, for input audio coding by a codec, configured to code the input audio based on the set operation mode such that when the set operation mode is a high frame erasure rate (FER) mode the codec codes a current frame of the input audio according to a select frame erasure concealment (FEC) mode of one or more FEC modes. Upon the setting of the operation mode to be the High FER mode, the one FEC mode is selected, from the one or more FEC modes predetermined for the High FER mode, to control the codec by incorporating of redundancy within a coding of the input audio or as separate redundancy information separate from the coded input audio according to the selected one FEC mode.
-
公开(公告)号:US20230298614A1
公开(公告)日:2023-09-21
申请号:US18107185
申请日:2023-02-08
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jubum HAN , Hosang SUNG , Yeaseul SONG , Jeonghoon LEE
CPC classification number: G10L25/51 , G10L25/30 , G10L25/18 , G10L15/063 , G10L21/12 , G10L21/14 , G10L15/22
Abstract: An example sound recognition method may include sampling input sound based on a preset sampling rate; performing Fast Fourier Transform (FFT) on the sampled input sound based on at least one of random FFT numbers or random hop lengths, and generating a two-dimensional (2D) feature map with a time axis and a frequency axis from the sampled input sound on which FFT is performed; training a neural network model, which recognizes sound, with a plurality of 2D feature maps including the first 2D feature map and an nth 2D feature map as training data.
-
7.
公开(公告)号:US20160111101A1
公开(公告)日:2016-04-21
申请号:US14980927
申请日:2015-12-28
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Hosang SUNG , Kangeun LEE , Seungho CHOI
IPC: G10L19/00 , G10L19/07 , G10L19/083 , G10L19/005 , G10L19/12
CPC classification number: G10L19/0017 , G10L19/005 , G10L19/06 , G10L19/07 , G10L19/083 , G10L19/12 , G10L2019/0002 , G10L2019/0016
Abstract: An apparatus and method for concealing frame erasure and a voice decoding apparatus and method using the same. The frame erasure concealment apparatus includes: a parameter extraction unit determining whether there is an erased frame in a voice packet, and extracting an excitement signal parameter and a line spectrum pair parameter of a previous good frame; and an erasure frame concealment unit, if there is an erased frame, restoring the excitement signal and line spectrum pair parameter of the erased frame by using a regression analysis from the excitement signal and line spectrum pair parameter of the previous good frame. According to the method and apparatus, by predicting and restoring the parameter of the erased frame through the regression analysis, the quality of the restored voice signal can be enhanced and the algorithm can be simplified.
-
公开(公告)号:US20140006023A1
公开(公告)日:2014-01-02
申请号:US14020139
申请日:2013-09-06
Applicant: SAMSUNG Electronics Co., Ltd.
Inventor: Hosang SUNG , Kangeun LEE , Sang-won KANG , Thomas R. FISCHER , Ja-kyoung JUN
IPC: G10L15/08
CPC classification number: G10L19/0212 , G10L15/08 , G10L19/10 , G10L19/107 , G10L19/12 , G10L2019/0013
Abstract: A method and apparatus to search a codebook including pulses that model a predetermined component of a speech signal. The method includes the operations of selecting a predetermined number of paths corresponding to a predetermined number of pulse locations that are most consistent with the predetermined component, from among paths corresponding to pulse locations of a predetermined pulse location set allocated to at least one branch that connects one state of a predetermined Trellis structure to another state, performing the path selecting operation on each of states other than the one state, and selecting a path corresponding to pulse locations that are most consistent with the predetermined component, from among paths including the selected paths. Accordingly, the number of calculations required during a codebook search is reduced.
Abstract translation: 一种搜索包括对语音信号的预定分量进行建模的脉冲的码本的方法和装置。 该方法包括从对应于分配给连接至少一个分支的预定脉冲位置集的脉冲位置的路径中选择对应于与预定分量最一致的预定数量的脉冲位置的预定数量的路径的操作 将预定网格结构的一个状态转换到另一状态,对除了一个状态以外的状态执行路径选择操作,并且从包括所选择的路径的路径中选择对应于与预定分量最一致的脉冲位置的路径 。 因此,减少码本搜索期间所需的计算次数。
-
公开(公告)号:US20220262377A1
公开(公告)日:2022-08-18
申请号:US17712417
申请日:2022-04-04
Applicant: SAMSUNG ELECTRONICS CO, LTD.
Inventor: Sangjun PARK , Kihyun CHOO , Taehwa KANG , Hosang SUNG , Jonghoon JEONG
Abstract: The disclosure relates to an electronic device and a control method thereof. The electronic device includes a memory, and a processor configured to: obtain first feature data for estimating a waveform by inputting acoustic data of a first quality to a first encoder model; and obtain waveform data of a second quality that is a higher quality than the first quality by inputting the first feature data to a decoder model to.
-
公开(公告)号:US20220180872A1
公开(公告)日:2022-06-09
申请号:US17679446
申请日:2022-02-24
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonghoon JEONG , Hosang SUNG , Doohwa HONG , Kyoungbo MIN , Eunmi OH , Kihyun CHOO
Abstract: An electronic apparatus, based on a text sentence being input, obtains prosody information of the text sentence, segments the text sentence into a plurality of sentence elements, obtains a speech in which prosody information is reflected to each of the plurality of sentence elements in parallel by inputting the plurality of sentence elements and the prosody information of the text sentence to a text to speech (TTS) module, and merges the speech for the plurality of sentence elements that are obtained in parallel to output speech for the text sentence.
-
-
-
-
-
-
-
-
-