专利检索 ap:("Jae-hoon Jeong" OR "So-young Jeong" OR "Jeong-su Kim" OR "Jung-eun Park" OR "Woo-jung Lee") AND inv:"Jeong-su Kim" 第 1 页

1.

发明授权
Audio signal processing method, audio apparatus therefor, and electronic apparatus therefor 有权
标题翻译：音频信号处理方法，音频装置及其电子设备

公开(公告)号：US09047862B2

公开(公告)日：2015-06-02

申请号：US13483571

申请日：2012-05-30

申请人： Jae-hoon Jeong , So-young Jeong , Jeong-su Kim , Jung-eun Park , Woo-jung Lee

发明人： Jae-hoon Jeong , So-young Jeong , Jeong-su Kim , Jung-eun Park , Woo-jung Lee

IPC分类号： G10L21/02 , H04B3/20 , G10L19/008 , G10L15/20 , G10L21/0208

CPC分类号： G10L19/008 , G10L15/20 , G10L2021/02082

摘要： An audio apparatus including a decorrelator for generating decorrelated signals by applying a phase shifting value adjusted based on a correlation difference between audio signals included in a multi-channel signal to the audio signals; and a speaker set including at least two speakers for outputting acoustic signals corresponding to the decorrelated signals.

摘要翻译： 一种音频装置，包括去相关器，用于通过将基于多声道信号中包括的音频信号之间的相关差调整的相移值应用于音频信号来产生去相关信号; 以及包括至少两个扬声器的扬声器组，用于输出对应于解相关信号的声信号。

2.

发明申请
Method of and apparatus for transforming speech feature vector 有权
标题翻译：用于转换语音特征向量的方法和装置

公开(公告)号：US20080147391A1

公开(公告)日：2008-06-19

申请号：US11896450

申请日：2007-08-31

申请人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

发明人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

IPC分类号： G10L15/16

CPC分类号： G10L15/02 , G10L15/16 , G10L15/20 , G10L25/30

摘要： Provided is a method and apparatus for transforming a speech feature vector. The method includes extracting a feature vector required for speech recognition from a speech signal and transforming the extracted feature vector using an auto-associative neural network (AANN).

摘要翻译： 提供了一种用于变换语音特征向量的方法和装置。该方法包括从语音信号中提取语音识别所需的特征向量，并使用自相关神经网络（AANN）对所提取的特征向量进行变换。

3.

发明授权
Method and apparatus of transforming speech feature vectors using an auto-associative neural network 有权
标题翻译：使用自动关联神经网络来转换语音特征向量的方法和装置

公开(公告)号：US08838446B2

公开(公告)日：2014-09-16

申请号：US11896450

申请日：2007-08-31

申请人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

发明人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

IPC分类号： G10L15/02 , G10L15/16 , G10L15/20 , G10L25/30

CPC分类号： G10L15/02 , G10L15/16 , G10L15/20 , G10L25/30

摘要： Provided is a method and apparatus for transforming a speech feature vector. The method includes extracting a feature vector required for speech recognition from a speech signal and transforming the extracted feature vector using an auto-associative neural network (AANN).

摘要翻译： 提供了一种用于变换语音特征向量的方法和装置。该方法包括从语音信号中提取语音识别所需的特征向量，并使用自相关神经网络（AANN）对所提取的特征向量进行变换。

4.

发明授权
Multi-stage speech recognition apparatus and method 有权
标题翻译：多级语音识别装置及方法

公开(公告)号：US08762142B2

公开(公告)日：2014-06-24

申请号：US11889665

申请日：2007-08-15

申请人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

发明人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

IPC分类号： G10L15/02 , G10L15/16 , G10L15/32

CPC分类号： G10L15/32 , G10L15/02 , G10L15/16

摘要： Provided are a multi-stage speech recognition apparatus and method. The multi-stage speech recognition apparatus includes a first speech recognition unit performing initial speech recognition on a feature vector, which is extracted from an input speech signal, and generating a plurality of candidate words; and a second speech recognition unit rescoring the candidate words, which are provided by the first speech recognition unit, using a temporal posterior feature vector extracted from the speech signal.

摘要翻译： 提供了一种多级语音识别装置和方法。多级语音识别装置包括：第一语音识别单元，对从输入语音信号提取的特征向量进行初始语音识别，生成多个候选词; 以及第二语音识别单元，使用从所述语音信号提取的时间后向特征向量，对由所述第一语音识别单元提供的候选词进行重新排序。

5.

发明申请
Multi-stage speech recognition apparatus and method 有权
标题翻译：多级语音识别装置及方法

公开(公告)号：US20080208577A1

公开(公告)日：2008-08-28

申请号：US11889665

申请日：2007-08-15

申请人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

发明人： So-young Jeong , Kwang-cheol Oh , Jae-hoon Jeong , Jeong-su Kim

IPC分类号： G10L15/00

CPC分类号： G10L15/32 , G10L15/02 , G10L15/16

摘要： Provided are a multi-stage speech recognition apparatus and method. The multi-stage speech recognition apparatus includes a first speech recognition unit performing initial speech recognition on a feature vector, which is extracted from an input speech signal, and generating a plurality of candidate words; and a second speech recognition unit rescoring the candidate words, which are provided by the first speech recognition unit, using a temporal posterior feature vector extracted from the speech signal.

摘要翻译： 提供了一种多级语音识别装置和方法。多级语音识别装置包括：第一语音识别单元，对从输入语音信号提取的特征向量进行初始语音识别，生成多个候选词; 以及第二语音识别单元，使用从所述语音信号提取的时间后向特征向量，对由所述第一语音识别单元提供的候选词进行重新排序。

6.

发明授权
Apparatus for positioning screen sound source, method of generating loudspeaker set information, and method of reproducing positioned screen sound source 有权
标题翻译：用于定位屏幕声源的装置，产生扬声器组信息的方法，以及再现定位的屏幕声源的方法

公开(公告)号：US08208663B2

公开(公告)日：2012-06-26

申请号：US12482883

申请日：2009-06-11

申请人： So-young Jeong , Jung-ho Kim , Jeong-su Kim

发明人： So-young Jeong , Jung-ho Kim , Jeong-su Kim

IPC分类号： H04R5/02

CPC分类号： H04R5/04

摘要： An apparatus for positioning a screen sound source, a method of generating loudspeaker set information for screen sound source positioning, and a method of reproducing a positioned screen sound source are provided. The apparatus and methods relate to a screen sound source positioning technique. A plurality of loudspeakers, each configured to have approximately the same gain, are each disposed proximate to the edge of a display, and a loudspeaker set including at least two of the loudspeakers is selected to position a virtual sound source substantially synchronized with a visual object displayed at a position on the screen of the display. Accordingly, a virtual sound source may be positioned at a certain specific position on the screen of a display without sound source distortion.

摘要翻译： 提供一种用于定位屏幕声源的装置，一种产生用于屏幕声源定位的扬声器组信息的方法以及再现定位的屏幕声源的方法。该装置和方法涉及屏幕声源定位技术。每个配置成具有近似相同增益的多个扬声器各自设置在显示器的边缘附近，并且选择包括至少两个扬声器的扬声器组，以将基本上与视觉对象同步的虚拟声源定位显示在显示屏的屏幕上的位置。因此，虚拟声源可以位于显示器的屏幕上的某个特定位置，而没有声源失真。

7.

发明申请
APPARATUS FOR POSITIONING SCREEN SOUND SOURCE, METHOD OF GENERATING LOUDSPEAKER SET INFORMATION, AND METHOD OF REPRODUCING POSITIONED SCREEN SOUND SOURCE 有权
标题翻译：用于定位屏幕声源的装置，产生扬声器组信息的方法和再现定位屏幕声源的方法

公开(公告)号：US20100111336A1

公开(公告)日：2010-05-06

申请号：US12482883

申请日：2009-06-11

申请人： So-young JEONG , Jung-ho Kim , Jeong-su Kim

发明人： So-young JEONG , Jung-ho Kim , Jeong-su Kim

IPC分类号： H04R5/02

CPC分类号： H04R5/04

摘要： An apparatus for positioning a screen sound source, a method of generating loudspeaker set information for screen sound source positioning, and a method of reproducing a positioned screen sound source are provided. The apparatus and methods relate to a screen sound source positioning technique. A plurality of loudspeakers, each configured to have approximately the same gain, are each disposed proximate to the edge of a display, and a loudspeaker set including at least two of the loudspeakers is selected to position a virtual sound source substantially synchronized with a visual object displayed at a position on the screen of the display. Accordingly, a virtual sound source may be positioned at a certain specific position on the screen of a display without sound source distortion.

摘要翻译： 提供一种用于定位屏幕声源的装置，一种产生用于屏幕声源定位的扬声器组信息的方法以及再现定位的屏幕声源的方法。该装置和方法涉及屏幕声源定位技术。每个配置成具有近似相同增益的多个扬声器各自设置在显示器的边缘附近，并且选择包括至少两个扬声器的扬声器组，以将基本上与视觉对象同步的虚拟声源定位显示在显示屏的屏幕上的位置。因此，虚拟声源可以位于显示器的屏幕上的某个特定位置，而没有声源失真。

8.

发明申请
Apparatus and method for speech recognition using a plurality of confidence score estimation algorithms 有权
标题翻译：使用多个置信分数估计算法进行语音识别的装置和方法

公开(公告)号：US20070136058A1

公开(公告)日：2007-06-14

申请号：US11517369

申请日：2006-09-08

申请人： Jae-hoon Jeong , Sang-bae Jeong , Jeong-su Kim , Nam-hoon Kim

发明人： Jae-hoon Jeong , Sang-bae Jeong , Jeong-su Kim , Nam-hoon Kim

IPC分类号： G10L15/00

CPC分类号： G10L15/08 , G10L2015/088

摘要： An apparatus for speech recognition includes: a first confidence score calculator calculating a first confidence score using a ratio between a likelihood of a keyword model for feature vectors per frame of a speech signal and a likelihood of a Filler model for the feature vectors; a second confidence score calculator calculating a second confidence score by comparing a Gaussian distribution trace of the keyword model per frame of the speech signal with a Gaussian distribution trace sample of a stored corresponding keyword of the keyword model; and a determination module determining a confidence of a result using the keyword model in accordance with a position determined by the first and second confidence scores on a confidence coordinate system.

摘要翻译： 一种用于语音识别的装置包括：第一置信度分数计算器，使用针对每个语音信号的每个特征向量的关键字模型的似然率与特征向量的填充模型的似然率之间的比率来计算第一置信度分数; 第二置信度计算器通过将所述语音信号的每帧的关键字模型的高斯分布轨迹与所述关键字模型的存储的对应关键字的高斯分布轨迹样本进行比较来计算第二置信度分数; 以及确定模块，其根据由置信坐标系上的第一和第二置信度得分确定的位置，使用关键字模型确定结果的置信度。

9.

发明授权
Positioning and reproducing screen sound source with high resolution 有权
标题翻译：定位和再现高分辨率的屏幕声源

公开(公告)号：US09036842B2

公开(公告)日：2015-05-19

申请号：US12483693

申请日：2009-06-12

申请人： Jung-ho Kim , So-young Jeong , Jeong-su Kim

发明人： Jung-ho Kim , So-young Jeong , Jeong-su Kim

IPC分类号： H04R5/02 , H04N5/60 , H04R3/12 , H04S7/00

CPC分类号： H04R3/12 , H04R2400/11 , H04R2420/07 , H04R2430/03 , H04R2499/15 , H04S3/008 , H04S7/30 , H04S2400/11 , H04S2420/07

摘要： A virtual screen sound source is spatially synchronized with a visual object displayed on a display. A plurality of loudspeaker sets, which each include at least three of a plurality of loudspeakers installed at the periphery of a display, are selected, individual sound sources corresponding to the respective selected loudspeaker sets are generated, and a multi-sound source is generated by overlapping the generated individual sound sources and output through loudspeakers included in the loudspeaker sets.

摘要翻译： 虚拟屏幕声源与显示器上显示的视觉对象在空间上同步。选择多个扬声器组，每个扬声器组包括安装在显示器周边的多个扬声器中的至少三个扬声器组，产生对应于各个选定扬声器组的各个声源，并且通过以下方式产生多声源：通过扬声器组中包含的扬声器重叠生成的各个声源并输出。

10.

发明授权
Apparatus and method for speech recognition using a plurality of confidence score estimation algorithms 有权
标题翻译：使用多个置信分数估计算法进行语音识别的装置和方法

公开(公告)号：US08543399B2

公开(公告)日：2013-09-24

申请号：US11517369

申请日：2006-09-08

申请人： Jae-hoon Jeong , Sang-bae Jeong , Jeong-su Kim , Nam-hoon Kim

发明人： Jae-hoon Jeong , Sang-bae Jeong , Jeong-su Kim , Nam-hoon Kim

IPC分类号： G10L15/00

CPC分类号： G10L15/08 , G10L2015/088

摘要： An apparatus for speech recognition includes: a first confidence score calculator calculating a first confidence score using a ratio between a likelihood of a keyword model for feature vectors per frame of a speech signal and a likelihood of a Filler model for the feature vectors; a second confidence score calculator calculating a second confidence score by comparing a Gaussian distribution trace of the keyword model per frame of the speech signal with a Gaussian distribution trace sample of a stored corresponding keyword of the keyword model; and a determination module determining a confidence of a result using the keyword model in accordance with a position determined by the first and second confidence scores on a confidence coordinate system.

摘要翻译： 一种用于语音识别的装置包括：第一置信度分数计算器，使用针对每个语音信号的每个特征向量的关键字模型的似然率与特征向量的填充模型的似然率之间的比率来计算第一置信度分数; 第二置信度计算器通过将所述语音信号的每帧的关键字模型的高斯分布轨迹与所述关键字模型的存储的对应关键字的高斯分布轨迹样本进行比较来计算第二置信度分数; 以及确定模块，其根据由置信坐标系上的第一和第二置信度得分确定的位置，使用关键字模型确定结果的置信度。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类