专利检索 ap:("Dimitri Kanevsky" OR "Joseph Reisinger" OR "Robert Sicconi" OR "Mahesh Viswanathan") AND inv:"Mahesh Viswanathan" 第 1 页

1.

发明申请
Hierarchical methods and apparatus for extracting user intent from spoken utterances 有权
标题翻译：用于从口语中提取用户意图的分层方法和装置

公开(公告)号：US20070055529A1

公开(公告)日：2007-03-08

申请号：US11216483

申请日：2005-08-31

申请人： Dimitri Kanevsky , Joseph Reisinger , Robert Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Joseph Reisinger , Robert Sicconi , Mahesh Viswanathan

IPC分类号： G10L21/00

CPC分类号： G10L15/1815 , G10L15/1822 , G10L2015/226

摘要： Improved techniques are disclosed for permitting a user to employ more human-based grammar (i.e., free form or conversational input) while addressing a target system via a voice system. For example, a technique for determining intent associated with a spoken utterance of a user comprises the following steps/operations. Decoded speech uttered by the user is obtained. An intent is then extracted from the decoded speech uttered by the user. The intent is extracted in an iterative manner such that a first class is determined after a first iteration and a sub-class of the first class is determined after a second iteration. The first class and the sub-class of the first class are hierarchically indicative of the intent of the user, e.g., a target and data that may be associated with the target. The multi-stage intent extraction approach may have more than two iterations. By way of example only, the user intent extracting step may further determine a sub-class of the sub-class of the first class after a third iteration, such that the first class, the sub-class of the first class, and the sub-class of the sub-class of the first class are hierarchically indicative of the intent of the user.

摘要翻译： 公开了改进的技术，以允许用户在经由语音系统寻址目标系统的同时采用更多的基于人的语法（即，自由形式或会话输入）。例如，用于确定与用户的口语发音相关联的意图的技术包括以下步骤/操作。获得用户发出的解码的语音。然后从用户发出的解码语音中提取意图。以迭代方式提取意图，使得在第一次迭代之后确定第一类并且在第二次迭代之后确定第一类的子类。第一类的第一类和子类分层地指示用户的意图，例如可以与目标相关联的目标和数据。多级意图提取方法可能有两次以上的迭代。仅作为示例，用户意图提取步骤还可以在第三次迭代之后确定第一类的子类的子类，使得第一类，第一类的子类和子类第一类的子类的类被分层地指示用户的意图。

2.

发明授权
Hierarchical methods and apparatus for extracting user intent from spoken utterances 有权
标题翻译：用于从口语中提取用户意图的分层方法和装置

公开(公告)号：US08265939B2

公开(公告)日：2012-09-11

申请号：US11216483

申请日：2005-08-31

申请人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

IPC分类号： G10L21/00

CPC分类号： G10L15/1815 , G10L15/1822 , G10L2015/226

摘要： A technique for determining intent associated with a spoken utterance of a user comprises the following steps/operations. Decoded speech uttered by the user is obtained. An intent is then extracted from the decoded speech uttered by the user. The intent is extracted in an iterative manner such that a class is determined after a first iteration and a sub-class of the class is determined after a second iteration. The class and the sub-class of the class are hierarchically indicative of the intent of the user, e.g., a target and data that may be associated with the target. The user intent extracting step may further determine a sub-class of the sub-class of the class after a third iteration, such that the class, the sub-class of the class, and the sub-class of the sub-class of the class are hierarchically indicative of the intent of the user.

摘要翻译： 用于确定与用户的口语发音相关联的意图的技术包括以下步骤/操作。获得用户发出的解码的语音。然后从用户发出的解码语音中提取意图。以迭代方式提取意图，使得在第一次迭代之后确定类并且在第二次迭代之后确定该类的子类。类的类和子类分层地指示用户的意图，例如目标和可以与目标相关联的数据。用户意图提取步骤还可以在第三次迭代之后确定类的子类的子类，使得类，类的子类和子类的子类类分层表示用户的意图。

3.

发明申请
HIERARCHICAL METHODS AND APPARATUS FOR EXTRACTING USER INTENT FROM SPOKEN UTTERANCES 有权
标题翻译：提取用户信息的分类方法和设备

公开(公告)号：US20130006637A1

公开(公告)日：2013-01-03

申请号：US13564596

申请日：2012-08-01

申请人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

IPC分类号： G10L15/04

CPC分类号： G10L15/1815 , G10L15/1822 , G10L2015/226

摘要： Improved techniques are disclosed for permitting a user to employ more human-based grammar (i.e., free form or conversational input) while addressing a target system via a voice system. For example, a technique for determining intent associated with a spoken utterance of a user comprises the following steps/operations. Decoded speech uttered by the user is obtained. An intent is then extracted from the decoded speech uttered by the user. The intent is extracted in an iterative manner such that a first class is determined after a first iteration and a sub-class of the first class is determined after a second iteration. The first class and the sub-class of the first class are hierarchically indicative of the intent of the user, e.g., a target and data that may be associated with the target.

摘要翻译： 公开了改进的技术，以允许用户在经由语音系统寻址目标系统的同时采用更多的基于人的语法（即，自由形式或会话输入）。例如，用于确定与用户的口语发音相关联的意图的技术包括以下步骤/操作。获得用户发出的解码的语音。然后从用户发出的解码语音中提取意图。以迭代方式提取意图，使得在第一次迭代之后确定第一类并且在第二次迭代之后确定第一类的子类。第一类的第一类和子类分层地指示用户的意图，例如可以与目标相关联的目标和数据。

4.

发明申请
Hierarchical Methods and Apparatus for Extracting User Intent from Spoken Utterances 审中-公开
标题翻译：用于从口语中提取用户意图的分层方法和装置

公开(公告)号：US20080221903A1

公开(公告)日：2008-09-11

申请号：US12125441

申请日：2008-05-22

申请人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

IPC分类号： G10L21/00

CPC分类号： G10L15/1815 , G10L15/1822 , G10L2015/226

摘要： Improved techniques are disclosed for permitting a user to employ more human-based grammar (i.e., free form or conversational input) while addressing a target system via a voice system. For example, a technique for determining intent associated with a spoken utterance of a user comprises the following steps/operations. Decoded speech uttered by the user is obtained. An intent is then extracted from the decoded speech uttered by the user. The intent is extracted in an iterative manner such that a first class is determined after a first iteration and a sub-class of the first class is determined after a second iteration. The first class and the sub-class of the first class are hierarchically indicative of the intent of the user, e.g., a target and data that may be associated with the target. The multi-stage intent extraction approach may have more than two iterations. By way of example only, the user intent extracting step may further determine a sub-class of the sub-class of the first class after a third iteration, such that the first class, the sub-class of the first class, and the sub-class of the sub-class of the first class are hierarchically indicative of the intent of the user.

摘要翻译： 公开了改进的技术，以允许用户在经由语音系统寻址目标系统的同时采用更多的基于人的语法（即，自由形式或会话输入）。例如，用于确定与用户的口语发音相关联的意图的技术包括以下步骤/操作。获得用户发出的解码的语音。然后从用户发出的解码语音中提取意图。以迭代方式提取意图，使得在第一次迭代之后确定第一类并且在第二次迭代之后确定第一类的子类。第一类的第一类和子类分层地指示用户的意图，例如可以与目标相关联的目标和数据。多级意图提取方法可能有两次以上的迭代。仅作为示例，用户意图提取步骤还可以在第三次迭代之后确定第一类的子类的子类，使得第一类，第一类的子类和子类第一类的子类的类被分层地指示用户的意图。

5.

发明授权
Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems 有权

公开(公告)号：US09754586B2

公开(公告)日：2017-09-05

申请号：US12133762

申请日：2008-06-05

申请人： Sabine Deligne , Ramesh A. Gopinath , Dimitri Kanevsky , Mahesh Viswanathan

发明人： Sabine Deligne , Ramesh A. Gopinath , Dimitri Kanevsky , Mahesh Viswanathan

IPC分类号： G10L15/19 , G10L15/06 , G10L15/183

CPC分类号： G10L15/19 , G10L15/063 , G10L15/183 , G10L2015/0631

摘要： The present invention concerns methods and apparatus for identifying and assigning meaning to words not recognized by a vocabulary or grammar of a speech recognition system. In an embodiment of the invention, the word may be in an acoustic vocabulary of the speech recognition system, but may be unrecognized by an embedded grammar of a language model of the speech recognition system. In another embodiment of the invention, the word may not be recognized by any vocabulary associated with the speech recognition system. In embodiments of the invention, at least one hypothesis is generated for an utterance not recognized by the speech recognition system. If the at least one hypothesis meets at least one predetermined criterion, a sword or more corresponding to the at least one hypothesis is added to the vocabulary of the speech recognition system. In other embodiments of the invention, before adding the word to the vocabulary of the speech recognition system, the at least one hypothesis may be presented to the user of the speech recognition system to determine if that is what the used intended when the user spoke.

6.

发明授权
Systems and methods for predicting consequences of misinterpretation of user commands in automated systems 失效
标题翻译：用于预测自动系统中用户命令误解后果的系统和方法

公开(公告)号：US07729920B2

公开(公告)日：2010-06-01

申请号：US12103560

申请日：2008-04-15

申请人： Jarir Kamel Chaar , Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

发明人： Jarir Kamel Chaar , Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G10L15/22

CPC分类号： G10L15/22

摘要： Systems and methods are provided for processing and executing commands in automated systems. For example, command processing systems and methods are provided which can automatically determine, evaluate or otherwise predict consequences of execution of misrecognized or misinterpreted user commands in automated systems and thus prevent undesirable or dangerous consequences that can result from execution of misrecognized/misinterpreted commands.

摘要翻译： 提供了系统和方法来处理和执行自动化系统中的命令。例如，提供了命令处理系统和方法，其可以自动地确定，评估或以其他方式预测在自动化系统中执行错误识别或误解的用户命令的后果，并且因此防止由于执行错误识别/误解的命令而导致的不期望或危险的后果。

7.

发明授权
Systems and methods for predicting consequences of misinterpretation of user commands in automated systems 有权
标题翻译：用于预测自动系统中用户命令误解后果的系统和方法

公开(公告)号：US07437297B2

公开(公告)日：2008-10-14

申请号：US11044316

申请日：2005-01-27

申请人： Jarir Kamel Chaar , Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

发明人： Jarir Kamel Chaar , Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G10L15/22

CPC分类号： G10L15/22

摘要： Systems and methods are provided for processing and executing commands in automated systems. For example, command processing systems and methods are provided which can automatically determine, evaluate or otherwise predict consequences of execution of misrecognized or misinterpreted user commands in automated systems and thus prevent undesirable or dangerous consequences that can result from execution of misrecognized/misinterpreted commands.

摘要翻译： 提供了系统和方法来处理和执行自动化系统中的命令。例如，提供了命令处理系统和方法，其可以自动地确定，评估或以其他方式预测在自动化系统中执行错误识别或误解的用户命令的后果，并且因此防止由于执行错误识别/误解的命令而导致的不期望或危险的后果。

8.

发明申请
Biometric Vehicular Emergency Management System 失效
标题翻译：生物识别车辆应急管理系统

公开(公告)号：US20080167757A1

公开(公告)日：2008-07-10

申请号：US11621382

申请日：2007-01-09

申请人： Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G06F17/00

CPC分类号： G07C5/085 , G07C5/0816

摘要： Techniques for managing vehicular emergencies are disclosed. For example, a method of managing a vehicular emergency includes the steps of collecting biometric data regarding at least one occupant of a vehicle, collecting data regarding at least one operational characteristic of the vehicle, and detecting vehicular emergencies through analysis of at least a portion of the biometric data and the operational characteristic data. This method may also include communicating at least one message relating to the data, wherein the content of the message is determined by the processing device based at least in part on the data and/or controlling a function of the vehicle in response to the data. The method may also include collecting data regarding at least one operational characteristic of at least one proximate vehicle.

摘要翻译： 公开了用于管理车辆紧急情况的技术。例如，管理车辆紧急情况的方法包括以下步骤：收集关于车辆的至少一个乘员的生物特征数据，收集关于车辆的至少一个操作特征的数据，以及通过分析至少一部分生物特征数据和操作特征数据。该方法还可以包括传达与数据有关的至少一个消息，其中消息的内容至少部分地基于数据和/或响应于数据控制车辆的功能由处理设备确定。该方法还可以包括收集关于至少一个邻近车辆的至少一个操作特征的数据。

9.

发明申请
Methods and Apparatus for Use in Speech Recognition Systems for Identifying Unknown Words and for Adding Previously Unknown Words to Vocabularies and Grammars of Speech Recognition Systems 有权
标题翻译：用于识别未知词语的语音识别系统中使用的方法和装置，以及将以前未知的词添加到语音识别系统的语义和语法

公开(公告)号：US20080270136A1

公开(公告)日：2008-10-30

申请号：US12133762

申请日：2008-06-05

申请人： Sabine Deligne , Ramesh A. Gopinath , Dimitri Kanevsky , Mahesh Viswanathan

发明人： Sabine Deligne , Ramesh A. Gopinath , Dimitri Kanevsky , Mahesh Viswanathan

IPC分类号： G10L15/18

CPC分类号： G10L15/19 , G10L15/063 , G10L15/183 , G10L2015/0631

摘要： The present invention concerns methods and apparatus for identifying and assigning meaning to words not recognized by a vocabulary or grammar of a speech recognition system. In an embodiment of the invention, the word may be in an acoustic vocabulary of the speech recognition system, but may be unrecognized by an embedded grammar of a language model of the speech recognition system. In another embodiment of the invention, the word may not be recognized by any vocabulary associated with the speech recognition system. In embodiments of the invention, at least one hypothesis is generated for an utterance not recognized by the speech recognition system. If the at least one hypothesis meets at least one predetermined criterion, a sword or more corresponding to the at least one hypothesis is added to the vocabulary of the speech recognition system. In other embodiments of the invention, before adding the word to the vocabulary of the speech recognition system, the at least one hypothesis may be presented to the user of the speech recognition system to determine if that is what the used intended when the user spoke.

摘要翻译： 本发明涉及用于识别和分配对语音识别系统的词汇或语法不被识别的词语的含义的方法和装置。在本发明的一个实施例中，该词可以在语音识别系统的声学词汇中，但是可能由语音识别系统的语言模型的嵌入语法无法识别。在本发明的另一个实施例中，该词可能不被与语音识别系统相关联的任何词汇识别。在本发明的实施例中，为语音识别系统未识别的话语生成至少一个假设。如果所述至少一个假设满足至少一个预定标准，则将与所述至少一个假设相对应的剑或更多的剑添加到所述语音识别系统的词汇表。在本发明的其他实施例中，在将单词添加到语音识别系统的词汇表之前，可以将该至少一个假设呈现给语音识别系统的用户，以确定当用户说话时所使用的意图是什么。

10.

发明授权
Processing user input in accordance with input types accepted by an application 有权
标题翻译：根据应用程序接受的输入类型处理用户输入

公开(公告)号：US08370163B2

公开(公告)日：2013-02-05

申请号：US13242874

申请日：2011-09-23

申请人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

发明人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G10L21/00 , G10L15/00 , G10L15/04

CPC分类号： G10L15/24 , G06F3/167 , G10L15/22

摘要： In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

摘要翻译： 在语音处理系统中，从多个模态输入设备接收多模态请求，并且运行所请求的应用以向用户提供多模态请求的反馈。在语音处理系统中，提供了多模聚合单元，其接收来自多个模态输入设备的多模式输入，并且基于在时间约束内的多模式输入的交互人体工程学的解释来将聚合结果提供给应用控制的多模态输入。因此，在时间窗口内识别来自用户的多模式输入。对多模式输入的相互作用人体工程学的解释包括交互生物特征和交互机制度量的解释，其中至少一种模态的交互输入可以用于给另一种模态的至少一个其他输入带来意义。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类