专利检索 ap:("Ellen Eide" OR "Raul Fernandez" OR "John Pitrelli" OR "Mahesh Viswanathan") AND inv:"Mahesh Viswanathan" 第 3 页

21.

发明申请
HIERARCHICAL METHODS AND APPARATUS FOR EXTRACTING USER INTENT FROM SPOKEN UTTERANCES 有权
标题翻译：提取用户信息的分类方法和设备

公开(公告)号：US20130006637A1

公开(公告)日：2013-01-03

申请号：US13564596

申请日：2012-08-01

申请人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Joseph Simon Reisinger , Robert Sicconi , Mahesh Viswanathan

IPC分类号： G10L15/04

CPC分类号： G10L15/1815 , G10L15/1822 , G10L2015/226

摘要： Improved techniques are disclosed for permitting a user to employ more human-based grammar (i.e., free form or conversational input) while addressing a target system via a voice system. For example, a technique for determining intent associated with a spoken utterance of a user comprises the following steps/operations. Decoded speech uttered by the user is obtained. An intent is then extracted from the decoded speech uttered by the user. The intent is extracted in an iterative manner such that a first class is determined after a first iteration and a sub-class of the first class is determined after a second iteration. The first class and the sub-class of the first class are hierarchically indicative of the intent of the user, e.g., a target and data that may be associated with the target.

摘要翻译： 公开了改进的技术，以允许用户在经由语音系统寻址目标系统的同时采用更多的基于人的语法（即，自由形式或会话输入）。例如，用于确定与用户的口语发音相关联的意图的技术包括以下步骤/操作。获得用户发出的解码的语音。然后从用户发出的解码语音中提取意图。以迭代方式提取意图，使得在第一次迭代之后确定第一类并且在第二次迭代之后确定第一类的子类。第一类的第一类和子类分层地指示用户的意图，例如可以与目标相关联的目标和数据。

22.

发明授权
Systems and methods for intelligent control of microphones for speech recognition applications 失效
标题翻译：用于语音识别应用的麦克风智能控制的系统和方法

公开(公告)号：US08140325B2

公开(公告)日：2012-03-20

申请号：US11619793

申请日：2007-01-04

申请人： Dimitri Kanevsky , Mahesh Viswanathan , David Nahamoo , Roberto Sicconi

发明人： Dimitri Kanevsky , Mahesh Viswanathan , David Nahamoo , Roberto Sicconi

IPC分类号： G10L21/02

CPC分类号： G10L15/24 , G10L15/20 , G10L15/28

摘要： Systems and methods for intelligent control of microphones in speech processing applications, which allows the capturing, recording and preprocessing of speech data in the captured audio in a way that optimizes speech decoding accuracy.

摘要翻译： 用于智能控制语音处理应用中的麦克风的系统和方法，其允许以优化语音解码精度的方式在所捕获的音频中捕获，记录和预处理语音数据。

23.

发明授权
Masking related sensitive data in groups 失效
标题翻译：分组掩蔽相关敏感数据

公开(公告)号：US07877398B2

公开(公告)日：2011-01-25

申请号：US11942081

申请日：2007-11-19

申请人： William Kroeschel , Arvind Sathi , Mahesh Viswanathan

发明人： William Kroeschel , Arvind Sathi , Mahesh Viswanathan

IPC分类号： G06F17/30

CPC分类号： G06F21/6254

摘要： A method and system of masking a group of related data values. A record in an unmasked data file of n records is read. The record includes a first set of data values of data elements included in a related data group (RDG) and one or more data values of one or more data elements external to the RDG. A random number k is received. A second set of data values is retrieved from a lookup table that associates n key values with n sets of data values. Retrieving the second set of data values includes identifying that the second set of data values is associated with a key value of k. The n sets of data values are included in the umnasked data file's n records. The record is masked by replacing the first set of data values with the retrieved second set of data values.

摘要翻译： 一组屏蔽相关数据值的方法和系统。读取n个记录的未屏蔽数据文件中的记录。记录包括包括在相关数据组（RDG）中的数据元素的第一组数据值和RDG外部的一个或多个数据元素的一个或多个数据值。接收随机数k。从查找表中检索第二组数据值，该查找表将n个键值与n组数据值相关联。检索第二组数据值包括识别第二组数据值与k的键值相关联。 n个数据值包含在已记录的数据文件的n个记录中。通过用检索到的第二组数据值替换第一组数据值来掩蔽该记录。

24.

发明申请
MACHINE, SYSTEM AND METHOD FOR USER-GUIDED TEACHING AND MODIFYING OF VOICE COMMANDS AND ACTIONS EXECUTED BY A CONVERSATIONAL LEARNING SYSTEM 有权
标题翻译：用户指导教学的机器，系统和方法以及对话学习系统执行的语音命令和行为的改进

公开(公告)号：US20100185445A1

公开(公告)日：2010-07-22

申请号：US12357342

申请日：2009-01-21

申请人： Liam D. Comerford , Mahesh Viswanathan

发明人： Liam D. Comerford , Mahesh Viswanathan

IPC分类号： G10L15/04 , G10L21/00

CPC分类号： G10L15/06 , G10L15/22 , G10L2015/0631 , G10L2015/223

摘要： A machine, system and method for user-guided teaching and modifications of voice commands and actions to be executed by a conversational learning system. The machine includes a system bus for communicating data and control signals received from the conversational learning system to a computer system, a vehicle data and control bus for connecting devices and sensors in the machine, a bridge module for connecting the vehicle data and control bus to the system bus, machine subsystems coupled to the vehicle data and control bus having a respective user interface for receiving a voice command or input signal from a user, a memory coupled to the system bus for storing action command sequences learned for a new voice command and a processing unit coupled to the system bus for automatically executing the action command sequences learned when the new voice command is spoken.

摘要翻译： 用于用户指导教学和修改由会话学习系统执行的语音命令和动作的机器，系统和方法。该机器包括用于将从对话学习系统接收的数据和控制信号传送到计算机系统的系统总线，用于连接机器中的设备和传感器的车辆数据和控制总线，用于将车辆数据和控制总线连接到系统总线，耦合到车辆数据和控制总线的机器子系统具有用于从用户接收语音命令或输入信号的相应用户界面;耦合到系统总线的存储器，用于存储为新语音命令学习的动作命令序列;以及耦合到系统总线的处理单元，用于当说出新的语音命令时自动执行学习的动作命令序列。

25.

发明授权
Methods and apparatus for tracking speakers in an audio stream 失效
标题翻译：用于跟踪音频流中的扬声器的方法和装置

公开(公告)号：US07739114B1

公开(公告)日：2010-06-15

申请号：US09345238

申请日：1999-06-30

申请人： Scott Shaobing Chen , Alain Charles Louis Tritschler , Mahesh Viswanathan

发明人： Scott Shaobing Chen , Alain Charles Louis Tritschler , Mahesh Viswanathan

IPC分类号： G10L21/00

CPC分类号： G10L21/0272 , G10L17/00

摘要： Speakers are automatically identified in an audio (or video) source. The audio information is processed to identify potential segment boundaries. Homogeneous segments are clustered substantially concurrently with the segmentation routine, and a cluster identifier is assigned to each identified segment. A segmentation subroutine identifies potential segment boundaries using the BIC model selection criterion. A clustering subroutine uses a BIC model selection criterion to assign a cluster identifier to each of the identified segments. If the difference of BIC values for each model is positive, the two clusters are merged.

摘要翻译： 音频（或视频）源自动识别扬声器。处理音频信息以识别潜在的分段边界。均匀片段与分割程序基本上同时聚类，并且将簇标识符分配给每个识别的片段。分割子程序使用BIC模型选择标准来识别潜在的分段边界。聚类子程序使用BIC模型选择标准来将簇标识符分配给每个识别的段。如果每个模型的BIC值的差异为正，则将两个聚类合并。

26.

发明申请
MULTI-HOP NATURAL LANGUAGE TRANSLATION 审中-公开
标题翻译：多语言自然语言翻译

公开(公告)号：US20090132230A1

公开(公告)日：2009-05-21

申请号：US11940748

申请日：2007-11-15

申请人： Dimitri Kanevsky , Bhuvana Ramabhadran , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Bhuvana Ramabhadran , Mahesh Viswanathan

IPC分类号： G06F17/28

CPC分类号： G06F17/289 , G06Q10/06 , G06Q10/10

摘要： Illustrative embodiments provide a computer implemented method and apparatus, in the form of a data processing system, and a computer program product for optimizing a natural language translation. In one illustrative embodiment, the computer implemented method comprises receiving a request from a requester, wherein the request comprises source language data, an indication of a source language and a destination language, and determining whether a translation between the source language and the destination language is needed. Identifying a mapping between the source language and the destination language includes a set of hops, the method, responsive to a determination that the translation is needed, translates the source language data into a destination language data associated with each successive hop in the set of hops in the mapping and returns the destination language data to the requester at a destination hop.

摘要翻译： 说明性实施例提供了数据处理系统形式的计算机实现的方法和装置，以及用于优化自然语言翻译的计算机程序产品。在一个说明性实施例中，计算机实现的方法包括从请求者接收请求，其中所述请求包括源语言数据，源语言和目的地语言的指示，以及确定源语言和目的地语言之间的翻译是否是需要的识别源语言和目的地语言之间的映射包括一组跳，该方法响应于需要转换的确定，将源语言数据转换为与该组跳数中的每个连续跳跃相关联的目的地语言数据在映射并将目的地语言数据返回给请求者在目的地跳。

27.

发明申请
SYSTEMS AND METHODS FOR INTELLIGENT CONTROL OF MICROPHONES FOR SPEECH RECOGNITION APPLICATIONS 失效
标题翻译：用于语音识别应用的智能智能控制的系统和方法

公开(公告)号：US20080167868A1

公开(公告)日：2008-07-10

申请号：US11619793

申请日：2007-01-04

申请人： Dimitri Kanevsky , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Mahesh Viswanathan

IPC分类号： G10L15/20 , H03G3/00

CPC分类号： G10L15/24 , G10L15/20 , G10L15/28

摘要： Systems and methods for intelligent control of microphones in speech processing applications, which allows the capturing, recording and preprocessing of speech data in the captured audio in a way that optimizes speech decoding accuracy.

摘要翻译： 用于智能控制语音处理应用中的麦克风的系统和方法，其允许以优化语音解码精度的方式在所捕获的音频中捕获，记录和预处理语音数据。

28.

发明授权
Touch gesture based interface for motor vehicle 有权
标题翻译：触摸手势界面的汽车

公开(公告)号：US07295904B2

公开(公告)日：2007-11-13

申请号：US10930225

申请日：2004-08-31

申请人： Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G09G5/00 , G06K9/00 , G06F3/00

CPC分类号： G06F3/016 , B60K35/00 , B60K37/06 , B60K2350/1024 , B60K2350/1028 , B60K2350/2013 , G06F3/038 , G06F3/04883 , G06F2203/04809

摘要： An improved apparatus and method is provided for operating devices and systems in a motor vehicle, while at the same time reducing vehicle operator distractions. One or more touch sensitive pads are mounted on the steering wheel of the motor vehicle, and the vehicle operator touches the pads in a pre-specified synchronized pattern, to perform functions such as controlling operation of the radio or adjusting a window. At least some of the touch patterns used to generate different commands may be selected by the vehicle operator. Usefully, the system of touch pad sensors and the signals generated thereby are integrated with speech recognition and/or facial gesture recognition systems, so that commands may be generated by synchronized multi-mode inputs.

摘要翻译： 提供了一种用于在机动车辆中操作装置和系统的改进的装置和方法，同时减少了车辆操作者的干扰。一个或多个触敏垫安装在机动车辆的方向盘上，并且车辆操作者以预先指定的同步模式接触垫，以执行诸如控制无线电操作或调整窗口的功能。用于产生不同命令的至少一些触摸图案可以由车辆操作者选择。有趣的是，触摸板传感器的系统和由此产生的信号与语音识别和/或面部手势识别系统集成，使得命令可以由同步的多模式输入生成。

29.

发明申请
Multimodal aggregating unit 审中-公开
标题翻译：多模聚集单元

公开(公告)号：US20050197843A1

公开(公告)日：2005-09-08

申请号：US10795495

申请日：2004-03-07

申请人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

发明人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G10L11/00

CPC分类号： G10L15/24 , G06F3/167 , G10L15/22

摘要： In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

摘要翻译： 在语音处理系统中，从多个模态输入设备接收多模态请求，并且运行所请求的应用以向用户提供多模态请求的反馈。在语音处理系统中，提供了多模聚合单元，其接收来自多个模态输入设备的多模式输入，并且基于在时间约束内的多模式输入的交互人体工程学的解释来将聚合结果提供给应用控制的多模态输入。因此，在时间窗口内识别来自用户的多模式输入。对多模式输入的相互作用人体工程学的解释包括交互生物特征和交互机制度量的解释，其中至少一种模态的交互输入可以用于给另一种模态的至少一个其他输入带来意义。

30.

发明申请
Interactive debugging and tuning method for CTTS voice building 有权
标题翻译： CTTS语音建立的交互式调试和调优方法

公开(公告)号：US20050086060A1

公开(公告)日：2005-04-21

申请号：US10688041

申请日：2003-10-17

申请人： Philip Gleason , Maria Smith , Mahesh Viswanathan , Jie Zeng

发明人： Philip Gleason , Maria Smith , Mahesh Viswanathan , Jie Zeng

IPC分类号： G10L11/00 , G10L13/02

CPC分类号： G10L13/033

摘要： A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

摘要翻译： 一种用于识别和校正使用连续文本到语音（CTTS）技术产生的合成语音中的问题源的方法，系统和装置。该方法可以包括显示对应于从拼接语音单元产生的合成语音的波形的步骤。可以从从用户接收的文本输入生成合成语音。该方法还可以包括显示与至少一个语音单元对应的参数的步骤。该方法可以包括显示包含所选语音单元的原始记录的步骤。可以从用户接收编辑输入，并且可以根据编辑输入来调整参数。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类