专利检索 ap:("Genady Grabarmik" OR "Dimitri Kanevsky" OR "Debanjan Saha" OR "Larisa Shwartz") AND inv:"Dimitri Kanevsky" 第 9 页

81.

发明申请
Phonetic Features for Speech Recognition 有权
标题翻译：语音识别的语音特征

公开(公告)号：US20120221333A1

公开(公告)日：2012-08-30

申请号：US13034293

申请日：2011-02-24

申请人： Dimitri Kanevsky , David Nahamoo , Bhuvana Ramabhadran , Tara N. Sainath

发明人： Dimitri Kanevsky , David Nahamoo , Bhuvana Ramabhadran , Tara N. Sainath

IPC分类号： G10L15/06

CPC分类号： G10L15/063 , G10L15/02 , G10L15/083 , G10L2015/025 , G10L2015/0631

摘要： Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.

摘要翻译： 公开了使用语音特征进行语音识别的技术。例如，一种方法包括以下步骤：获得与语音识别系统相关联的第一字典和训练数据集，从训练数据集计算一个或多个支持参数，将第一字典变换为第二字典，其中第二字典是第一字典的一个或多个语音标签的功能，并且使用一个或多个支持参数从第二字典中选择一个或多个样本，以创建用于模式识别的一个或多个基于样本的类识别特征的集合任务。

82.

发明申请
Virtual Communication Techniques 有权
标题翻译：虚拟通信技术

公开(公告)号：US20120215843A1

公开(公告)日：2012-08-23

申请号：US13030268

申请日：2011-02-18

申请人： Sameer Maskey , Sara H. Basson , Dimitri Kanevsky , Tara N. Sainath

发明人： Sameer Maskey , Sara H. Basson , Dimitri Kanevsky , Tara N. Sainath

IPC分类号： G06F15/16 , G06F17/27

CPC分类号： G06Q10/10

摘要： Techniques for facilitating communication are provided. The techniques include using a machine-to-machine communication to facilitate communication between one or more human users of a communicator device and a compatible communicator device, wherein using the machine-to-machine communication to facilitate communication between one or more human users comprises initiating a machine-to-machine communication with a compatible communicator device if the device is within the geographic proximity, wherein the machine-to-machine communication incorporates one or more related items from a user profile of each device automatically extracted by the device initiating the machine-to-machine communication, and conducting the machine-to-machine communication in a manner in which the communication can be monitored by the one or more human users.

摘要翻译： 提供了促进通信的技术。这些技术包括使用机器对机器通信来促进通信器设备的一个或多个人类用户和兼容通信器设备之间的通信，其中使用机器到机器通信来促进一个或多个人类用户之间的通信包括启动如果所述设备在所述地理邻近范围内，则与所述兼容通信器设备进行机对机通信，其中所述机器对机器通信包括来自启动所述机器的设备自动提取的每个设备的用户简档中的一个或多个相关项目机器间通信，并且以可以由一个或多个人类用户监视通信的方式进行机器到机器通信。

83.

发明授权
Simulation method and system 失效
标题翻译：仿真方法和系统

公开(公告)号：US08237742B2

公开(公告)日：2012-08-07

申请号：US12137606

申请日：2008-06-12

申请人： Sara H. Basson , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

发明人： Sara H. Basson , Dimitri Kanevsky , Edward Emile Kelley , Bhuvana Ramabhadran

IPC分类号： G09G5/00 , G09G5/36

CPC分类号： G10L25/63 , G06F19/00 , G10L15/25 , G10L21/00 , G10L21/06 , G10L2021/0135 , G10L2021/065 , G11B27/028 , G11B27/034 , G11B27/036 , G11B27/28 , G16H50/50 , H04N21/44213

摘要： A simulation method and system. A computing system receives a first audio and/or video data stream. The first audio and/or video data stream includes data associated with a first person. The computing system monitors the first audio and/or video data stream. The computing system identifies emotional attributes comprised by the first audio and/or video data stream. The computing system generates a second audio and/or video data stream associated with the first audio and/or video data stream. The second audio and/or video data stream includes the data without the emotional attributes. The computing system stores the second audio and/or video data stream.

摘要翻译： 一种模拟方法和系统。计算系统接收第一音频和/或视频数据流。第一音频和/或视频数据流包括与第一人相关联的数据。计算系统监视第一音频和/或视频数据流。计算系统识别由第一音频和/或视频数据流组成的情感属性。计算系统生成与第一音频和/或视频数据流相关联的第二音频和/或视频数据流。第二音频和/或视频数据流包括没有情感属性的数据。计算系统存储第二音频和/或视频数据流。

84.

发明申请
MULTIMODAL AGGREGATING UNIT 有权

公开(公告)号：US20120046945A1

公开(公告)日：2012-02-23

申请号：US13242538

申请日：2011-09-23

申请人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

发明人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G10L21/00

CPC分类号： G10L15/24 , G06F3/167 , G10L15/22

摘要： In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

85.

发明申请
MULTIMODAL AGGREGATING UNIT 有权
标题翻译：多模式聚集单元

公开(公告)号：US20120044183A1

公开(公告)日：2012-02-23

申请号：US13242874

申请日：2011-09-23

申请人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

发明人： Alexander Faisman , Dimitri Kanevsky , David Nahamoo , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： G09G5/00 , G06F3/041

CPC分类号： G10L15/24 , G06F3/167 , G10L15/22

摘要： In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

摘要翻译： 在语音处理系统中，从多个模态输入设备接收多模态请求，并且运行所请求的应用以向用户提供多模态请求的反馈。在语音处理系统中，提供了多模聚合单元，其接收来自多个模态输入设备的多模式输入，并且基于在时间约束内的多模式输入的交互人体工程学的解释来将聚合结果提供给应用控制的多模态输入。因此，在时间窗口内识别来自用户的多模式输入。对多模式输入的相互作用人体工程学的解释包括交互生物特征和交互机制度量的解释，其中至少一种模态的交互输入可以用于给另一种模态的至少一个其他输入带来意义。

86.

发明申请
Modification of Speech Quality in Conversations Over Voice Channels 审中-公开
标题翻译：语音通话对话中语音质量的修改

公开(公告)号：US20120016674A1

公开(公告)日：2012-01-19

申请号：US12838103

申请日：2010-07-16

申请人： Sarah H. Basson , Dimitri Kanevsky , David Nahamoo , Tara N. Sainath

发明人： Sarah H. Basson , Dimitri Kanevsky , David Nahamoo , Tara N. Sainath

IPC分类号： G10L13/00

CPC分类号： G10L19/0018 , G10L2021/0135

摘要： Techniques are disclosed for modifying speech quality in a conversation over a voice channel. For example, a method for modifying a speech quality associated with a spoken utterance transmittable over a voice channel comprises the following steps. The spoken utterance is obtained prior to an intended recipient of the spoken utterance receiving the spoken utterance. An existing speech quality of the spoken utterance is determined. The existing speech quality of the spoken utterance is compared to at least one desired speech quality associated with at least one previously obtained spoken utterance to determine whether the existing speech quality substantially matches the desired speech quality. At least one characteristic of the spoken utterance is modified to change the existing speech quality of the spoken utterance to the desired speech quality when the existing speech quality does not substantially match the desired speech quality. The spoken utterance is presented with the desired speech quality to the intended recipient.

摘要翻译： 公开了用于通过语音信道修改会话中的语音质量的技术。例如，用于修改与可通过语音信道传输的口语话语相关联的语音质量的方法包括以下步骤。口语发音是在接受口语发音的口语发音之前获得的。确定说话话语的现有语音质量。将口语发音的现有语音质量与与至少一个先前获得的口语话语相关联的至少一个期望语音质量进行比较，以确定现有语音质量是否与所需语音质量基本匹配。修改口语发音的至少一个特征，以便当现有语音质量基本上不符合期望的语音质量时，将口语发音的现有语音质量改变为所需语音质量。讲话话语以期望的语音质量呈现给预期的接收者。

87.

发明授权
Biometric vehicular emergency management system 失效
标题翻译：生物识别车辆应急管理系统

公开(公告)号：US08085139B2

公开(公告)日：2011-12-27

申请号：US11621382

申请日：2007-01-09

申请人： Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

发明人： Dimitri Kanevsky , Roberto Sicconi , Mahesh Viswanathan

IPC分类号： B60Q1/00

CPC分类号： G07C5/085 , G07C5/0816

摘要： Techniques for managing vehicular emergencies are disclosed. For example, a method of managing a vehicular emergency includes the steps of collecting biometric data regarding at least one occupant of a vehicle, collecting data regarding at least one operational characteristic of the vehicle, and detecting vehicular emergencies through analysis of at least a portion of the biometric data and the operational characteristic data. This method may also include communicating at least one message relating to the data, wherein the content of the message is determined by the processing device based at least in part on the data and/or controlling a function of the vehicle in response to the data. The method may also include collecting data regarding at least one operational characteristic of at least one proximate vehicle.

摘要翻译： 公开了用于管理车辆紧急情况的技术。例如，管理车辆紧急情况的方法包括以下步骤：收集关于车辆的至少一个乘员的生物特征数据，收集关于车辆的至少一个操作特征的数据，以及通过分析至少一部分生物特征数据和操作特征数据。该方法还可以包括传达与数据有关的至少一个消息，其中消息的内容至少部分地基于数据和/或响应于数据控制车辆的功能由处理设备确定。该方法还可以包括收集关于至少一个邻近车辆的至少一个操作特征的数据。

88.

发明授权
Method and system for nano-encoding and decoding information related to printed texts and images on paper and other surfaces 有权
标题翻译：用于纳米编码和解码与纸张和其他表面上的印刷文本和图像有关的信息的方法和系统

公开(公告)号：US08036415B2

公开(公告)日：2011-10-11

申请号：US11619454

申请日：2007-01-03

申请人： Dimitri Kanevsky , Dmitri V. Talapin

发明人： Dimitri Kanevsky , Dmitri V. Talapin

IPC分类号： G06K9/00 , G06K9/36 , G06K19/06

CPC分类号： G06K19/06 , G06K9/00 , G06K19/06009 , G06K19/06187 , H04N1/00326 , H04N1/00331

摘要： A method and system for nano-encoding and decoding information related to printed texts and images on paper and other surfaces is provided. The system and method includes a nano-encoder for encoding information related to printed texts and images; and then collocating the encoded information with the related printed texts and/or images. The system also includes a nano-decoder for decoding information encoded by the nano-encoder. The nano-decoder includes a text processing database having a translator database. The translator database includes a definition database; and a summary database. In addition, the system and method includes detecting luminescent nano particles and/or magnetic nano particles; and determining invariant properties of the detected nano particles. The invariant properties are then matched with coded information. The system and method includes matching the invariant properties with predetermined coded information and analyzing the invariant properties of the detected nano particles for segmentation.

摘要翻译： 提供了一种用于对纸张和其他表面上的印刷文本和图像进行纳米编码和解码信息的方法和系统。该系统和方法包括用于编码与印刷文本和图像有关的信息的纳米编码器; 然后将编码的信息与相关的打印文本和/或图像并置。该系统还包括用于解码由纳米编码器编码的信息的纳米解码器。纳米解码器包括具有翻译器数据库的文本处理数据库。翻译器数据库包括定义数据库; 和汇总数据库。此外，该系统和方法包括检测发光纳米颗粒和/或磁性纳米颗粒; 并确定所检测的纳米颗粒的不变特性。然后将不变属性与编码信息进行匹配。该系统和方法包括将不变属性与预定编码信息进行匹配，并分析检测到的纳米颗粒的不变性质进行分割。

89.

发明授权
Real time backup system for computer users 失效
标题翻译：计算机用户的实时备份系统

公开(公告)号：US07991739B2

公开(公告)日：2011-08-02

申请号：US12124750

申请日：2008-05-21

申请人： Dimitri Kanevsky , Alexander Zlatsin

发明人： Dimitri Kanevsky , Alexander Zlatsin

IPC分类号： G06F17/00

CPC分类号： G06F11/1461 , G06F11/1446 , G06F11/1456 , G06F11/1458 , G06F11/1464 , G06F11/1471 , G06F11/3438 , G06F17/30289 , G06F2201/80 , G06F2201/805 , Y10S707/99955

摘要： This invention involves tracking and backing all the information that a user generates on its computer devices (including embedded devices) in real time. The local user server records all user actions and gestures (via various means that include TV cameras). All of this information (user actions and saved files in a computer) is then sent to a remote server via the Internet. This remote server has a virtual map of all the embedded devices on a computer that the person uses. The remote server immediately starts to interpret the user's actions (including user gestures). In one implementation, the invention stores user actions that are related to data generation (e.g. actions that called some links where data is stored, or executed some programs that generated data). In another variant the remote server generates and downloads the same files that are downloaded on the local user computer devices. For example, if a person begins to download a program, the server may also download the same program on a remote backup server. This way, if the user loses this program, it can be retrieved automatically through a provided server on the Internet. If user's files are backed up by regular backup periodically, relevant data that were stored by real time backup servers can be eliminated.

摘要翻译： 本发明涉及跟踪和支持用户在其计算机设备（包括嵌入式设备）上实时产生的所有信息。本地用户服务器记录所有用户操作和手势（通过包括电视摄像机的各种方法）。所有这些信息（用户操作和计算机中保存的文件）然后通过Internet发送到远程服务器。该远程服务器具有人使用的计算机上的所有嵌入式设备的虚拟映射。远程服务器立即开始解释用户的操作（包括用户手势）。在一个实现中，本发明存储与数据生成相关的用户动作（例如，称为存储数据的一些链接的动作，或者执行一些生成数据的程序）。在另一个变体中，远程服务器生成并下载在本地用户计算机设备上下载的相同文件。例如，如果某人开始下载程序，则服务器还可以在远程备份服务器上下载相同的程序。这样一来，如果用户丢失了这个程序，就可以通过Internet上提供的服务器自动检索。如果定期备份用户文件，则可以消除实时备份服务器存储的相关数据。

90.

发明授权
Methods, systems, and computer program products for detecting alteration of audio or image data 有权
标题翻译：用于检测音频或图像数据更改的方法，系统和计算机程序产品

公开(公告)号：US07934264B2

公开(公告)日：2011-04-26

申请号：US11829338

申请日：2007-07-27

申请人： Sara H. Basson , Sarah L. Conrod , Dimitri Kanevsky , Edward E. Kelley , Giridharan R. Iyengar

发明人： Sara H. Basson , Sarah L. Conrod , Dimitri Kanevsky , Edward E. Kelley , Giridharan R. Iyengar

IPC分类号： G06F11/00 , G06F12/14 , G06F12/16 , G06F7/04 , G06F17/30 , G08B23/00 , H04N7/16 , G06F15/16

CPC分类号： G06F17/30038

摘要： Using metadata to detect alteration of data. A first set of metadata characteristics including at least one respective semantic description are recorded for a first set of data representing original data. A second set of metadata characteristics including at least one corresponding semantic description are recorded for a second set of data representing data under test. The first and second sets of metadata characteristics are compared. If the first and second sets of metadata characteristics are not identical, these sets are processed to identify locations in the first set of data that have been altered. Using the at least one semantic description for the first set of data and the at least one corresponding semantic description for the second set of data, one or more metadata characteristics that have changed from the first set of data to the second set of data are identified.

摘要翻译： 使用元数据检测数据的更改。对于表示原始数据的第一组数据记录包括至少一个相应语义描述的第一组元数据特征。对于表示正在测试的数据的第二组数据，记录包括至少一个对应的语义描述的第二组元数据特征。比较第一组和第二组元数据特征。如果第一和第二组元数据特征不相同，则处理这些集合以识别已经被改变的第一组数据中的位置。使用用于第一组数据的至少一个语义描述和用于第二组数据的至少一个对应语义描述，识别从第一组数据到第二组数据已经改变的一个或多个元数据特征。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类