专利检索 ap:("Sasha P. Caskey" OR "Dimitri Kanevsky" OR "Brian Kingsbury" OR "Tara N. Sainath" OR "George Saon") AND inv:"Dimitri Kanevsky" 第 6 页

51.

发明申请
SPARSE REPRESENTATION FEATURES FOR SPEECH RECOGNITION 有权
标题翻译：用于语音识别的小数代表特征

公开(公告)号：US20120078621A1

公开(公告)日：2012-03-29

申请号：US12889845

申请日：2010-09-24

申请人： Dimitri Kanevsky , David Nahamoo , Bhuvana Ramabhadran , Tara N. Sainath

发明人： Dimitri Kanevsky , David Nahamoo , Bhuvana Ramabhadran , Tara N. Sainath

IPC分类号： G10L15/00

CPC分类号： G10L15/02

摘要： Techniques are disclosed for generating and using sparse representation features to improve speech recognition performance. In particular, principles of the invention provide sparse representation exemplar-based recognition techniques. For example, a method comprises the following steps. A test vector and a training data set associated with a speech recognition system are obtained. A subset of the training data set is selected. The test vector is mapped with the selected subset of the training data set as a linear combination that is weighted by a sparseness constraint such that a new test feature set is formed wherein the training data set is moved more closely to the test vector subject to the sparseness constraint. An acoustic model is trained on the new test feature set.The acoustic model trained on the new test feature set may be used to decode user speech input to the speech recognition system.

摘要翻译： 公开了用于生成和使用稀疏表示特征以改善语音识别性能的技术。特别地，本发明的原理提供了基于示例的稀疏表示识别技术。例如，一种方法包括以下步骤。获得与语音识别系统相关联的测试向量和训练数据集。选择训练数据集的子集。将测试向量与所选择的训练数据集的子集映射为由稀疏约束加权的线性组合，使得形成新的测试特征集合，其中训练数据集更接近地移动到受测对象的测试向量稀疏约束在新的测试功能集上训练声学模型。在新测试特征集上训练的声学模型可以用于解码输入到语音识别系统的用户语音。

52.

发明申请
MANAGING ENCOUNTERS WITH PERSONS 有权
标题翻译：管理人员与人

公开(公告)号：US20120053980A1

公开(公告)日：2012-03-01

申请号：US12872042

申请日：2010-08-31

申请人： Sarah H. Basson , Dimitri Kanevsky , Clifford Alan Pickover , Tara N. Sainath

发明人： Sarah H. Basson , Dimitri Kanevsky , Clifford Alan Pickover , Tara N. Sainath

IPC分类号： G06Q10/00

CPC分类号： G06Q10/06316

摘要： Techniques are disclosed for facilitating coordination of user activities in accordance with information processing systems and, more particularly, to techniques for managing encounters with persons using such information processing systems. For example, a method for facilitating user coordination of one or more activities comprises the following steps. User personal preference input for managing an encounter with at least one other person is accepted. Input of at least one user schedule entry is received. Schedule entries of the at least one other person are evaluated and it is automatically determined whether there is an overlap between the at least one user schedule entry and the schedule entries of the at least one other person. A response to a determined overlap is automatically determined. The user personal preference input may comprise an indication of whether the user wishes to avoid an encounter with the at least one other person or coordinate an encounter with the at least one other person.

摘要翻译： 公开了用于促进根据信息处理系统的用户活动的协调的技术，更具体地，涉及用于管理与使用这种信息处理系统的人的遭遇的技术。例如，用于促进一个或多个活动的用户协调的方法包括以下步骤。用于管理与至少一个其他人的遭遇的用户个人偏好输入被接受。接收至少一个用户日程表条目的输入。对至少一个其他人的计划条目进行评估，并且自动确定该至少一个用户日程表条目和该至少一个其他人的日程表条目之间是否存在重叠。自动确定对确定的重叠的响应。用户个人偏好输入可以包括用户是否希望避免与至少一个其他人的遭遇或协调与至少一个其他人的遭遇的指示。

53.

发明申请
WINDOW DISPLAY MANAGEMENT IN A GRAPHICAL USER INTERFACE 审中-公开
标题翻译：窗口显示管理在图形用户界面

公开(公告)号：US20110283226A1

公开(公告)日：2011-11-17

申请号：US12780899

申请日：2010-05-15

申请人： Sara H. Basson , Dimitri Kanevsky , Clifford A. Pickover , Tara N. Sainath

发明人： Sara H. Basson , Dimitri Kanevsky , Clifford A. Pickover , Tara N. Sainath

IPC分类号： G06F3/048

CPC分类号： G06F3/0481 , G06F9/453

摘要： The present invention provides a computer implemented method and data processing system for effectively presenting popup and related windows on a computer GUI. An example system may include a computer processor coupled to the computer readable memory. The computer processor is configured to receive content of a new window for display in the display screen, perform a text analysis on the content of the new window to determine a relevance of the new window to the user, and determine a display position of the new window on the display screen based on the relevance of the new window to the user and a cursor position in the GUI displaying keyboard input such that the new window is displayed on the display screen at the determined display position.

摘要翻译： 本发明提供了一种用于在计算机GUI上有效地呈现弹出窗口和相关窗口的计算机实现的方法和数据处理系统。示例系统可以包括耦合到计算机可读存储器的计算机处理器。计算机处理器被配置为接收用于在显示屏幕中显示的新窗口的内容，对新窗口的内容执行文本分析以确定新窗口与用户的相关性，并且确定新窗口的显示位置基于新窗口与用户的相关性的显示屏幕上的窗口和GUI中显示键盘输入的光标位置，使得新窗口在所确定的显示位置显示在显示屏幕上。

54.

发明授权
Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems 有权

公开(公告)号：US09754586B2

公开(公告)日：2017-09-05

申请号：US12133762

申请日：2008-06-05

申请人： Sabine Deligne , Ramesh A. Gopinath , Dimitri Kanevsky , Mahesh Viswanathan

发明人： Sabine Deligne , Ramesh A. Gopinath , Dimitri Kanevsky , Mahesh Viswanathan

IPC分类号： G10L15/19 , G10L15/06 , G10L15/183

CPC分类号： G10L15/19 , G10L15/063 , G10L15/183 , G10L2015/0631

摘要： The present invention concerns methods and apparatus for identifying and assigning meaning to words not recognized by a vocabulary or grammar of a speech recognition system. In an embodiment of the invention, the word may be in an acoustic vocabulary of the speech recognition system, but may be unrecognized by an embedded grammar of a language model of the speech recognition system. In another embodiment of the invention, the word may not be recognized by any vocabulary associated with the speech recognition system. In embodiments of the invention, at least one hypothesis is generated for an utterance not recognized by the speech recognition system. If the at least one hypothesis meets at least one predetermined criterion, a sword or more corresponding to the at least one hypothesis is added to the vocabulary of the speech recognition system. In other embodiments of the invention, before adding the word to the vocabulary of the speech recognition system, the at least one hypothesis may be presented to the user of the speech recognition system to determine if that is what the used intended when the user spoke.

55.

发明授权
Forced/predictable adaptation for speech recognition 有权
标题翻译：强制/可预测的语音识别适应

公开(公告)号：US08838448B2

公开(公告)日：2014-09-16

申请号：US13440176

申请日：2012-04-05

申请人： Dan Ning Jiang , Vaibhava Goel , Dimitri Kanevsky , Yong Qin

发明人： Dan Ning Jiang , Vaibhava Goel , Dimitri Kanevsky , Yong Qin

IPC分类号： G10L15/06 , G10L15/00

CPC分类号： G10L15/07

摘要： A method is described for use with automatic speech recognition using discriminative criteria for speaker adaptation. An adaptation evaluation is performed of speech recognition performance data for speech recognition system users. Adaptation candidate users are identified based on the adaptation evaluation for whom an adaptation process is likely to improve system performance.

摘要翻译： 描述了一种使用自动语音识别的方法，使用用于说话者适应的歧视性标准。对语音识别系统用户的语音识别性能数据进行适应评估。适应候选用户是根据对自适应过程有可能提高系统性能的适应性评估来确定的。

56.

发明授权
Method and computer program for securely storing data 有权
标题翻译：用于安全存储数据的方法和计算机程序

公开(公告)号：US08694801B2

公开(公告)日：2014-04-08

申请号：US13557690

申请日：2012-07-25

申请人： Ossama Emam , Genady Grabarnik , Dimitri Kanevsky , Alexander Zlatsin

发明人： Ossama Emam , Genady Grabarnik , Dimitri Kanevsky , Alexander Zlatsin

IPC分类号： G06F11/30 , G06F12/14 , H04L29/06 , G06F15/173

CPC分类号： H04L63/08 , G06F11/1076 , G06F17/30194 , G06F21/6227 , H04L63/0428 , H04L67/288 , H04L67/325

摘要： A method of securely storing data comprising the steps of: dividing the data into a plurality of secure components; encrypting the secure components; moving each secure component to a different location which is substantially inaccessible to an unauthorized request; storing the secure components at the different locations for a period of time; repeating the moving and storing steps; moving all of the secure components to a single location in response to an authorized request; decrypting each of the secure components; and assembling the plurality of secure components to reconstruct the original data.

摘要翻译： 一种安全地存储数据的方法，包括以下步骤：将数据分成多个安全组件; 加密安全组件; 将每个安全组件移动到对未授权请求基本不可访问的不同位置; 将安全组件存储在不同位置一段时间; 重复移动和存储步骤; 响应于授权请求将所有安全组件移动到单个位置; 解密每个安全组件; 以及组装所述多个安全组件以重建所述原始数据。

57.

发明申请
MULTIPLE AUDIO/VIDEO DATA STREAM SIMULATION 失效
标题翻译：多个音频/视频数据流模拟

公开(公告)号：US20120246669A1

公开(公告)日：2012-09-27

申请号：US13484320

申请日：2012-05-31

申请人： Sara H. Basson , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

发明人： Sara H. Basson , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

IPC分类号： H04N21/24

CPC分类号： G10L17/26 , G10L19/167

摘要： A multiple audio/video data stream simulation method and system. A computing system receives first audio and/or video data streams. The first audio and/or video data streams include data associated with a first person and a second person. The computing system monitors the first audio and/or video data streams. The computing system identifies emotional attributes comprised by the first audio and/or video data streams. The computing system generates second audio and/or video data streams associated with the first audio and/or video data streams. The second audio and/or video data streams include the first audio and/or video data streams data without the emotional attributes. The computing system stores the second audio and/or video data streams.

摘要翻译： 多音频/视频数据流仿真方法和系统。计算系统接收第一音频和/或视频数据流。第一音频和/或视频数据流包括与第一人和第二人有关的数据。计算系统监视第一音频和/或视频数据流。计算系统识别由第一音频和/或视频数据流组成的情感属性。计算系统产生与第一音频和/或视频数据流相关联的第二音频和/或视频数据流。第二音频和/或视频数据流包括没有情绪属性的第一音频和/或视频数据流数据。计算系统存储第二音频和/或视频数据流。

58.

发明申请
MULTIPLE AUDIO/VIDEO DATA STREAM SIMULATION 失效

公开(公告)号：US20120239393A1

公开(公告)日：2012-09-20

申请号：US13484323

申请日：2012-05-31

申请人： Sara H. Basson , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

发明人： Sara H. Basson , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

IPC分类号： G10L15/00

CPC分类号： G10L17/26 , G10L19/167

摘要： A multiple audio/video data stream simulation method and system. A computing system receives first audio and/or video data streams. The first audio and/or video data streams include data associated with a first person and a second person. The computing system monitors the first audio and/or video data streams. The computing system identifies emotional attributes comprised by the first audio and/or video data streams. The computing system generates second audio and/or video data streams associated with the first audio and/or video data streams. The second audio and/or video data streams include the first audio and/or video data streams data without the emotional attributes. The computing system stores the second audio and/or video data streams.

59.

发明申请
SHARING GPS NAVIGATION INFORMATION 审中-公开
标题翻译：共享GPS导航信息

公开(公告)号：US20120221243A1

公开(公告)日：2012-08-30

申请号：US13466309

申请日：2012-05-08

申请人： Sara H. Basson , Peter Gustav Fairweather , Dimitri Kanevsky , Edward Emile Kelley

发明人： Sara H. Basson , Peter Gustav Fairweather , Dimitri Kanevsky , Edward Emile Kelley

IPC分类号： G01C21/00

CPC分类号： G01C21/00 , G01C21/34 , G01C21/3626 , G01C21/3679

摘要： A host application on a host computer system receives annotations made by drivers of respective navigation information displayed to the drivers by GPS devices in vehicles of the respective drivers. The host application saves the annotated navigation information for the respective drivers on a computer readable memory accessible by the host application. The host application receives a request from a first one of the drivers for annotated navigation information made by one or more of the other drivers. Responsive to the request, the host application selects one or more items of the saved annotated navigation information. The host application sends the selected one or more items of the saved annotated navigation information to the GPS device of the first one of the drivers.

摘要翻译： 主机计算机系统上的主机应用程序接收由相应驱动器的车辆中的GPS设备向驾驶员显示的各个导航信息的驾驶员制作的注释。主机应用程序将相应驱动程序的注释导航信息保存在主机应用程序可访问的计算机可读存储器上。主机应用程序从第一个驱动程序接收由一个或多个其他驱动程序制作的注释导航信息的请求。响应于该请求，主机应用程序选择一个或多个保存的注释导航信息项。主机应用程序将所选择的一个或多个保存的注释导航信息项发送到第一个驱动程序的GPS设备。

60.

发明申请
VERBAL DESCRIPTION 有权
标题翻译： VERBAL说明

公开(公告)号：US20120188446A1

公开(公告)日：2012-07-26

申请号：US13433702

申请日：2012-03-29

申请人： Sara H. Basson , Brian Reginald Heasman , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

发明人： Sara H. Basson , Brian Reginald Heasman , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

IPC分类号： H04N9/475

CPC分类号： H04N21/435 , H04N21/235 , H04N21/2368 , H04N21/4305 , H04N21/8133 , H04N21/8146

摘要： A verbal description method and system. A computing system broadcasts first audio data and video data associated with the first audio data. The computing system determines that the video data comprises a graphic without a description in the first audio data. The computing system receives audible description data associated with the graphic. The computing system generates second audio data comprising the first audio data and the audible description data. The computing system synchronizes portions of the second audio data with associated portions of the video data. The computing system generates synchronized audio/video data comprising the portions of said second audio data aligned with the associated portions of said video data. The computing system broadcasts the synchronized audio/video data.

摘要翻译： 口头描述方法和系统。计算系统广播与第一音频数据相关联的第一音频数据和视频数据。计算系统确定视频数据包括在第一音频数据中没有描述的图形。计算系统接收与该图形相关联的声音描述数据。计算系统产生包括第一音频数据和可听说明数据的第二音频数据。计算系统将第二音频数据的部分与视频数据的相关部分同步。计算系统产生同步的音频/视频数据，该数据包括与所述视频数据的相关部分对准的所述第二音频数据的部分。计算系统广播同步的音频/视频数据。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类