-
公开(公告)号:US11335325B2
公开(公告)日:2022-05-17
申请号:US16749257
申请日:2020-01-22
发明人: Hosang Sung , Seonho Hwang , Doohwa Hong , Eunmi Oh , Kyoungbo Min , Jonghoon Jeong , Kihyun Choo
IPC分类号: G10L13/08 , G10L15/22 , G10L15/18 , G10L13/047 , G10L13/033 , G10L15/02 , G10L13/00
摘要: An electronic device and a controlling method of the electronic device are provided. The electronic device acquires text to respond on a received user's speech, acquires a plurality of pieces of parameter information for determining a style of an output speech corresponding to the text based on information on a type of a plurality of text-to-speech (TTS) databases and the received user's speech, identifies a TTS database corresponding to the plurality of pieces of parameter information among the plurality of TTS databases, identifies a weight set corresponding to the plurality of pieces of parameter information among a plurality of weight sets acquired through a trained artificial intelligence model, adjusts information on the output speech stored in the TTS database based on the weight set, synthesizes the output speech based on the adjusted information on the output speech, and outputs the output speech corresponding to the text.
-
公开(公告)号:US09479871B2
公开(公告)日:2016-10-25
申请号:US14134508
申请日:2013-12-19
发明人: Junghoe Kim , Eunmi Oh , Kihyun Choo , Miao Lei
CPC分类号: H04R5/02 , G10L19/008 , H04R5/033 , H04S1/002 , H04S3/00 , H04S3/002 , H04S3/02 , H04S2420/01 , H04S2420/07
摘要: A method, medium, and system generating a 3-dimensional (3D) stereo signal in a decoder by using a surround data stream. According to such a method, medium, and system, a head related transfer function (HRTF) is applied in a quadrature mirror filter (QMF) domain, thereby generating a 3D stereo signal by using a surround data stream.
-
公开(公告)号:US09848180B2
公开(公告)日:2017-12-19
申请号:US14794517
申请日:2015-07-08
发明人: Junghoe Kim , Eunmi Oh , Kihyun Choo , Miao Lei
CPC分类号: H04N13/161 , G10L19/008 , H04S1/007 , H04S3/008 , H04S7/308 , H04S2420/03
摘要: Surround audio decoding for selectively generating an audio signal from a multi-channel signal. In the surround audio decoding, a down-mixed signal, e.g., as down-mixed by an encoding terminal, is selectively up-mixed to a stereo signal or a multi-channel signal, by generating spatial information for generating the stereo signal, using spatial information for up-mixing the down-mixed signal to the multi-channel signal.
-
公开(公告)号:US09667270B2
公开(公告)日:2017-05-30
申请号:US14804939
申请日:2015-07-21
发明人: Junghoe Kim , Miao Lei , Eunmi Oh
IPC分类号: G10L19/008 , G10L19/16 , H04S3/00 , H03M7/30 , H04H20/80
CPC分类号: H03M7/30 , G10L19/008 , G10L19/167 , H04H20/80 , H04S3/002 , H04S3/008 , H04S2420/03
摘要: An system, method, and method of encoding/decoding a multi-channel audio signal, including a decoding level generation unit producing decoding-level information that helps a bitstream including a number of audio channel signals and space information to be decoded into a number of audio channel signals, wherein the space information includes information about magnitude differences and/or similarities between channels, and an audio decoder decoding the bitstream according to the decoding-level information. Accordingly, even a single input bitstream can be decoded into a suitable number of channels depending on the type of a speaker configuration used. Scalable channel decoding can be achieved by partially decoding an input bitstream. In the scalable channel decoding, a decoder may set decoding levels and outputs audio channel signals according to the decoding levels, thereby reducing decoding complexity.
-
公开(公告)号:USRE46082E1
公开(公告)日:2016-07-26
申请号:US13678413
申请日:2012-11-15
发明人: Junghoe Kim , Eunmi Oh , Boris Kudryashov , Konstantin Osipov
IPC分类号: G10L25/00 , G10L19/00 , G10L19/02 , G10L21/00 , G10L19/028
CPC分类号: G10L19/028 , G10L19/0017 , G10L19/02
摘要: An apparatus and method of low bit rate encoding and reproducing. The method includes transforming input audio signals in a time domain into spectral signals in a frequency domain, extracting important-spectrum components from the spectral signals in the frequency domain, and quantizing the important-spectrum components, extracting residual-spectrum components other than the important-spectrum components from the spectral signals in the frequency domain, and calculating and quantizing a noise level of the residual-spectrum components, and encoding the quantized important-spectrum components and the quantized noise level losslessly, and outputting encoded bitstreams.
-
公开(公告)号:US20150199972A1
公开(公告)日:2015-07-16
申请号:US14623431
申请日:2015-02-16
发明人: Jung Hoe Kim , Eunmi Oh , Mi Young Kim , Ki Hyun Choo
IPC分类号: G10L19/008
CPC分类号: G10L19/008
摘要: An apparatus and method for encoding/decoding a multi-channel signal may be provided. The apparatus of encoding a multi-channel signal may insert information about whether to encode a phase parameter indicating phase information of a plurality of channels, included in the multi-channel signal, in a bitstream of the multi-channel signal. The apparatus of decoding a multi-channel signal may determine whether to up-mix a mono signal using the phase parameter based on the information about whether to encode.
摘要翻译: 可以提供用于对多信道信号进行编码/解码的装置和方法。 编码多声道信号的装置可以在多声道信号的比特流中插入关于是否编码指示包括在多声道信号中的多个声道的相位信息的相位参数的信息。 解码多声道信号的装置可以基于关于是否编码的信息,使用相位参数来确定是否对单声道信号进行混合。
-
公开(公告)号:US20200279551A1
公开(公告)日:2020-09-03
申请号:US16788418
申请日:2020-02-12
发明人: Hosang Sung , Kyoungbo Min , Seonho Hwang , Doohwa Hong , Eunmi Oh , Jonghoon Jeong , Kihyun Choo
IPC分类号: G10L13/08 , G10L25/63 , G10L17/00 , G10L13/04 , G10L13/047
摘要: An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
-
公开(公告)号:US09706325B2
公开(公告)日:2017-07-11
申请号:US15180930
申请日:2016-06-13
发明人: Junghoe Kim , Eunmi Oh , Kihyun Choo , Miao Lei
CPC分类号: H04S5/005 , G10L19/008 , H04B1/1646 , H04S3/00 , H04S3/02 , H04S2420/01
摘要: A method, medium, and system decoding and/or encoding multiple channels. Accordingly, down-mixed multiple channels can be decoded/up-mixed to a left channel and a right channel during a first stage, thereby enabling a high quality sound output even in scalable channel decoding.
-
公开(公告)号:US20230206897A1
公开(公告)日:2023-06-29
申请号:US18171079
申请日:2023-02-17
发明人: Hosang Sung , Kyoungbo Min , Seonho Hwang , Doohwa Hong , Eunmi Oh , Jonghoon Jeong , Kihyun Choo
IPC分类号: G10L13/08 , G10L25/63 , G10L13/047 , G10L13/00 , G10L17/00
CPC分类号: G10L13/08 , G10L25/63 , G10L13/047 , G10L13/00 , G10L17/00
摘要: An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
-
公开(公告)号:US11587547B2
公开(公告)日:2023-02-21
申请号:US16788418
申请日:2020-02-12
发明人: Hosang Sung , Kyoungbo Min , Seonho Hwang , Doohwa Hong , Eunmi Oh , Jonghoon Jeong , Kihyun Choo
摘要: An electronic apparatus which acquires input data to be input into a TTS module for outputting a voice through the TTS module, acquires a voice signal corresponding to the input data through the TTS module, detects an error in the acquired voice signal based on the input data, corrects the input data based on the detection result, and acquires a corrected voice signal corresponding to the corrected input data through the TTS module.
-
-
-
-
-
-
-
-
-