专利检索 ap:"Byung-Kwan Kwak" 第 1 页

1.

发明授权
Apparatus and method for recognizing voice command 有权
标题翻译：用于识别语音命令的装置和方法

公开(公告)号：US09142212B2

公开(公告)日：2015-09-22

申请号：US13093919

申请日：2011-04-26

申请人： Chi-Youn Park , Byung-Kwan Kwak , Jeong-Su Kim , Jeong-Mi Cho

发明人： Chi-Youn Park , Byung-Kwan Kwak , Jeong-Su Kim , Jeong-Mi Cho

IPC分类号： G10L21/00 , G10L15/22 , G10L15/00 , H04M1/00 , H04M1/64

CPC分类号： G10L15/22

摘要： An apparatus and method for recognizing a voice command for use in an interactive voice user interface are provided. The apparatus includes a command intention belief generation unit that is configured to recognize a first voice command and that may generate one or more command intention beliefs for the first voice command. The apparatus also includes a command intention belief update unit that is configured to update each of the command intention beliefs based on a system response to the first voice command and a second voice commands. The apparatus also includes a command intention belief selection unit that is configured to select one of the updated command intention beliefs for the first voice command. The apparatus also includes an operation signal output unit that is configured to select a final command intention from the selected updated command intention belief and to output an operation signal based on the selected final command intention.

摘要翻译： 提供了一种用于识别在交互式语音用户界面中使用的语音命令的装置和方法。该装置包括命令意图信念生成单元，其被配置为识别第一语音命令并且可以生成用于第一语音命令的一个或多个命令意图信念。该装置还包括命令意图置信更新单元，其被配置为基于对第一语音命令的系统响应和第二语音命令来更新每个命令意图信念。该装置还包括命令意图置信选择单元，其被配置为选择第一语音命令的更新的命令意图信念之一。该装置还包括操作信号输出单元，其被配置为从所选择的更新命令意图置信度中选择最终命令意图，并且基于所选择的最终命令意图来输出操作信号。

2.

发明申请
APPARATUS AND METHOD FOR VOICE COMMAND RECOGNITION BASED ON A COMBINATION OF DIALOG MODELS 有权
标题翻译：基于对话模型组合的语音命令识别的装置和方法

公开(公告)号：US20120173244A1

公开(公告)日：2012-07-05

申请号：US13245032

申请日：2011-09-26

申请人： Byung-Kwan Kwak , Chi-Youn Park , Jeong-Su Kim , Jeong-Mi Cho

发明人： Byung-Kwan Kwak , Chi-Youn Park , Jeong-Su Kim , Jeong-Mi Cho

IPC分类号： G10L21/00

CPC分类号： G10L15/22 , G10L2015/228

摘要： Provided are a voice command recognition apparatus and method capable of figuring out the intention of a voice command input through a voice dialog interface, by combining a rule based dialog model and a statistical dialog model rule. The voice command recognition apparatus includes a command intention determining unit configured to correct an error in recognizing a voice command of a user, and an application processing unit configured to check whether the final command intention determined in the command intention determining unit comprises the input factors for execution of an application.

摘要翻译： 提供了一种能够通过组合基于规则的对话模型和统计对话模型规则来通过语音对话界面来识别语音命令输入的意图的语音命令识别装置和方法。语音指令识别装置包括：命令意图确定单元，被配置为校正用户识别语音命令时的错误;以及应用处理单元，被配置为检查在命令意图确定单元中确定的最终命令意图是否包括用于执行应用程序。

3.

发明授权
Spoken dialogue interface apparatus and method 失效
标题翻译：口语对话界面设备和方法

公开(公告)号：US07725322B2

公开(公告)日：2010-05-25

申请号：US11348301

申请日：2006-02-07

申请人： Byung-kwan Kwak , Jae-won Lee

发明人： Byung-kwan Kwak , Jae-won Lee

IPC分类号： G10L21/00 , G10L13/00

CPC分类号： G10L15/1822

摘要： The spoken dialogue interface apparatus according to an embodiment of the present invention includes a speech recognition module for recognizing a human's speech from a sound signal; a user intention interpretation module for extracting a sentence from the recognized speech and interpreting a user's intention based on the sentence; a user intention selection module for determining user intention using the interpreted user's intention and a predetermined domain action frame; and a system response generation module for generating a system response sentence corresponding to the selected user intention, wherein the domain action frame includes service information which the user requests, and parameter information which is used to perform a service, and the domain action frame is constructed to have a hierarchical tree structure.

摘要翻译： 根据本发明的实施例的口语对话界面装置包括：语音识别模块，用于从声音信号识别人的语音; 用户意图解释模块，用于从所识别的语音中提取句子，并且基于所述句子解释用户的意图; 用户意图选择模块，用于使用解释的用户的意图和预定的域动作帧来确定用户意图; 以及系统响应生成模块，用于生成与所选择的用户意图相对应的系统响应语句，其中所述域动作帧包括用户请求的服务信息，以及用于执行服务的参数信息，并且构造域动作帧具有层次树结构。

4.

发明申请
VOICE QUERY EXTENSION METHOD AND SYSTEM 有权
标题翻译：语音查询扩展方法和系统

公开(公告)号：US20090157383A1

公开(公告)日：2009-06-18

申请号：US12045138

申请日：2008-03-10

申请人： Jeong Mi Cho , Byung-Kwan Kwak , Nam Hoon Kim , Ick Sang Han

发明人： Jeong Mi Cho , Byung-Kwan Kwak , Nam Hoon Kim , Ick Sang Han

IPC分类号： G10L15/22

CPC分类号： G10L15/08 , G06F17/30637 , G06F17/30669 , G06F17/30681 , G10L15/005 , G10L2015/025 , G10L2015/088

摘要： A voice query extension method and system. The voice query extension method includes: detecting voice activity of a user from an input signal and extracting a feature vector from the voice activity; converting the feature vector into at least one phoneme sequence and generating the at least one phoneme sequence; matching the at least one phoneme sequence with words registered in a dictionary, extracting a string of the matched words with a linguistic meaning, and selecting the string of the matched words as a query; determining whether the query is in a predetermined first language, and when the query is not in the first language as a result of the determining, converting the query using a phoneme to grapheme rule, and generating a query in the first language; and searching using the query in the first language.

摘要翻译： 语音查询扩展方法和系统。语音查询扩展方法包括：从输入信号检测用户的语音活动，并从语音活动中提取特征向量; 将所述特征向量转换成至少一个音素序列并生成所述至少一个音素序列; 将至少一个音素序列与在字典中注册的单词匹配，提取具有语言含义的匹配词的字符串，并且将匹配词的字符串选择为查询; 确定所述查询是否处于预定的第一语言，以及当所述查询作为所述确定的结果不在所述第一语言中时，使用音素将所述查询转换为字母规则，并且以所述第一语言生成查询; 并使用第一语言的查询进行搜索。

5.

发明申请
Device, method, and medium for establishing language model 有权
标题翻译：用于建立语言模型的设备，方法和介质

公开(公告)号：US20070118353A1

公开(公告)日：2007-05-24

申请号：US11545484

申请日：2006-10-11

申请人： Jeong-mi Cho , Byung-kwan Kwak

发明人： Jeong-mi Cho , Byung-kwan Kwak

IPC分类号： G06F17/28

CPC分类号： G06F17/2775 , G10L15/193

摘要： A device, a method, and a medium for establishing a language model for speech recognition are disclosed. The language-model-establishing device includes: a schema expander for expanding a state schema which is composed of at least one state defined by a finite state grammar using a general grammar database; a grammatical-structure-expander for expanding grammatical structures which can be expressed by each state of the expanded state schema using the general grammar database; and a grammatical-structure-filter for filtering out any incorrect grammatical structure from the expanded grammatical structures using the general grammar database. Since the state schema is expanded using the general grammar database, it is possible to improve recognition of unlearned grammatical structures.

摘要翻译： 公开了一种用于建立语音识别语言模型的装置，方法和介质。所述语言建模设备包括：用于扩展状态模式的模式扩展器，所述状态模式由使用一般语法数据库的由有限状态语法定义的至少一个状态组成; 用于扩展语法结构的语法结构扩展器，其可以由使用一般语法数据库的扩展状态模式的每个状态表示; 以及一个语法结构过滤器，用于使用一般语法数据库从扩展的语法结构中过滤任何不正确的语法结构。由于状态模式使用通用语法数据库进行扩展，因此可以提高对未学习的语法结构的识别。

6.

发明授权
Method, medium and apparatus for providing mobile voice web service 有权

公开(公告)号：US09251786B2

公开(公告)日：2016-02-02

申请号：US12007797

申请日：2008-01-15

申请人： Jeong-mi Cho , Ji-yeun Kim , Yoon-kyung Song , Byung-kwan Kwak , Nam-hoon Kim , Ick-sang Han

发明人： Jeong-mi Cho , Ji-yeun Kim , Yoon-kyung Song , Byung-kwan Kwak , Nam-hoon Kim , Ick-sang Han

IPC分类号： G10L15/00 , G10L15/193

CPC分类号： G10L15/193

摘要： Provided are a method and apparatus for providing a mobile voice web service in a mobile terminal. The method includes analyzing a web history of a user from web search logs of the user and generating a voice access list based on the analysis results, and performing voice recognition by dynamically generating a voice recognition syntax according to the generated voice access list. Accordingly, by limiting syntax required for voice recognition by generating a syntax suitable for a web context of the user, efficient voice recognition, which can be performed in a terminal not a server, can be implemented.

7.

发明申请
APPARATUS AND METHOD FOR RECOGNIZING VOICE COMMAND 有权
标题翻译：用于识别语音命令的装置和方法

公开(公告)号：US20120035935A1

公开(公告)日：2012-02-09

申请号：US13093919

申请日：2011-04-26

申请人： Chi-Youn Park , Byung-Kwan Kwak , Jeong-Su Kim , Jeong-Mi Cho

发明人： Chi-Youn Park , Byung-Kwan Kwak , Jeong-Su Kim , Jeong-Mi Cho

IPC分类号： G10L21/00

CPC分类号： G10L15/22

摘要： An apparatus and method for recognizing a voice command for use in an interactive voice user interface are provided. The apparatus includes a command intention belief generation unit that is configured to recognize a first voice command and that may generate one or more command intention beliefs for the first voice command. The apparatus also includes a command intention belief update unit that is configured to update each of the command intention beliefs based on a system response to the first voice command and a second voice commands. The apparatus also includes a command intention belief selection unit that is configured to select one of the updated command intention beliefs for the first voice command. The apparatus also includes an operation signal output unit that is configured to select a final command intention from the selected updated command intention belief and to output an operation signal based on the selected final command intention.

摘要翻译： 提供了一种用于识别在交互式语音用户界面中使用的语音命令的装置和方法。该装置包括命令意图信念生成单元，其被配置为识别第一语音命令并且可以生成用于第一语音命令的一个或多个命令意图信念。该装置还包括命令意图置信更新单元，其被配置为基于对第一语音命令的系统响应和第二语音命令来更新每个命令意图信念。该装置还包括命令意图置信选择单元，其被配置为选择第一语音命令的更新的命令意图信念之一。该装置还包括操作信号输出单元，其被配置为从所选择的更新命令意图置信度中选择最终命令意图，并且基于所选择的最终命令意图来输出操作信号。

8.

发明申请
Apparatus for providing voice dialogue service and method of operating the same 失效
标题翻译：用于提供语音对话服务的装置及其操作方法

公开(公告)号：US20070208556A1

公开(公告)日：2007-09-06

申请号：US11510728

申请日：2006-08-28

申请人： Byung Kwan Kwak , Jeong Mi Cho , In Ho Kang

发明人： Byung Kwan Kwak , Jeong Mi Cho , In Ho Kang

IPC分类号： G06F17/27

CPC分类号： G06F17/2785 , G06F17/2705

摘要： A speech dialogue service apparatus including: a language analysis module tagging a part of speech (POS) of each respective word included in a sentence recorded in a predetermined text, syntactically analyzing the sentence by classifying a meaning of each respective word, and generating at least one semantic frame corresponding to the sentence according to a result of the syntactical analysis; and a dialogue management module analyzing an intention of the sentence corresponding to the at least one respective semantic frame, and generating a system response corresponding to the sentence intention by selecting a predetermined sentence intention according to whether an action corresponding to the intention of the respective sentence can be performed.

摘要翻译： 一种语音对话服务设备，包括：语言分析模块，标记在预定文本中记录的句子中包含的每个单词的一部分语音（POS），通过对每个单词的含义进行分类来语法分析句子，并至少产生根据语法分析的结果，对应于句子的一个语义框架; 以及对话管理模块，其分析与所述至少一个相应语义帧相对应的句子的意图，并且根据与所述句子的意图相对应的动作来选择预定句子意图来生成与所述句子意图相对应的系统响应可以执行。

9.

发明授权
Apparatus and method for voice command recognition based on a combination of dialog models 有权
标题翻译：基于对话模型组合的语音指令识别装置和方法

公开(公告)号：US08954326B2

公开(公告)日：2015-02-10

申请号：US13245032

申请日：2011-09-26

申请人： Byung-Kwan Kwak , Chi-Youn Park , Jeong-Su Kim , Jeong-Mi Cho

发明人： Byung-Kwan Kwak , Chi-Youn Park , Jeong-Su Kim , Jeong-Mi Cho

IPC分类号： G10L15/26 , G10L17/00 , G10L21/00 , G10L15/22

CPC分类号： G10L15/22 , G10L2015/228

摘要： Provided are a voice command recognition apparatus and method capable of figuring out the intention of a voice command input through a voice dialog interface, by combining a rule based dialog model and a statistical dialog model rule. The voice command recognition apparatus includes a command intention determining unit configured to correct an error in recognizing a voice command of a user, and an application processing unit configured to check whether the final command intention determined in the command intention determining unit comprises the input factors for execution of an application.

摘要翻译： 提供了一种能够通过组合基于规则的对话模型和统计对话模型规则来通过语音对话界面来识别语音命令输入的意图的语音命令识别装置和方法。语音指令识别装置包括：命令意图确定单元，被配置为校正用户识别语音命令时的错误;以及应用处理单元，被配置为检查在命令意图确定单元中确定的最终命令意图是否包括用于执行应用程序。

10.

发明授权
Device, method, and medium for establishing language model for expanding finite state grammar using a general grammar database 有权
标题翻译：使用一般语法数据库建立扩展有限状态语法的语言模型的设备，方法和介质

公开(公告)号：US08255220B2

公开(公告)日：2012-08-28

申请号：US11545484

申请日：2006-10-11

申请人： Jeong-mi Cho , Byung-kwan Kwak

发明人： Jeong-mi Cho , Byung-kwan Kwak

IPC分类号： G10L15/00 , G10L15/18

CPC分类号： G06F17/2775 , G10L15/193

摘要： A device, a method, and a medium for establishing a language model for speech recognition are disclosed. The language-model-establishing device includes: a schema expander for expanding a state schema which is composed of at least one state defined by a finite state grammar using a general grammar database; a grammatical-structure-expander for expanding grammatical structures which can be expressed by each state of the expanded state schema using the general grammar database; and a grammatical-structure-filter for filtering out any incorrect grammatical structure from the expanded grammatical structures using the general grammar database. Since the state schema is expanded using the general grammar database, it is possible to improve recognition of unlearned grammatical structures.

摘要翻译： 公开了一种用于建立语音识别语言模型的装置，方法和介质。所述语言建模设备包括：用于扩展状态模式的模式扩展器，所述状态模式由使用一般语法数据库的由有限状态语法定义的至少一个状态组成; 用于扩展语法结构的语法结构扩展器，其可以由使用一般语法数据库的扩展状态模式的每个状态表示; 以及一个语法结构过滤器，用于使用一般语法数据库从扩展的语法结构中过滤任何不正确的语法结构。由于状态模式使用通用语法数据库进行扩展，因此可以提高对未学习的语法结构的识别。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类