专利检索 ap:("Nitendra Rajput" OR "Om D. Deshmukh") AND inv:"Om D. Deshmukh" 第 1 页

1.

发明申请
User Driven Audio Content Navigation 有权
标题翻译：用户驱动的音频内容导航

公开(公告)号：US20110320950A1

公开(公告)日：2011-12-29

申请号：US12822802

申请日：2010-06-24

申请人： Nitendra Rajput , Om D. Deshmukh

发明人： Nitendra Rajput , Om D. Deshmukh

IPC分类号： G06F3/16

CPC分类号： G06F17/30775 , G06F17/30743 , G10L15/00 , G10L21/043 , H04M3/4938

摘要： Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.

摘要翻译： 描述了配置成为口头网络提供用户驱动的音频内容导航的系统和相关方法。实施例允许用户对似乎与用户相关的内容来删除音频，类似于标准网页的视觉撇除，并标记音频内的兴趣点。实施例提供了用于在客户机 - 服务器环境中与信息系统交互的情况下导航音频内容的技术，其中客户端设备可以是简单的标准电话。

2.

发明授权
User driven audio content navigation 有权

公开(公告)号：US09710552B2

公开(公告)日：2017-07-18

申请号：US13596313

申请日：2012-08-28

申请人： Nitendra Rajput , Om D. Deshmukh

发明人： Nitendra Rajput , Om D. Deshmukh

IPC分类号： G10L15/00 , G10L21/00 , H04M3/00 , G06F17/30 , G10L21/043 , H04M3/493

CPC分类号： G06F17/30775 , G06F17/30743 , G10L15/00 , G10L21/043 , H04M3/4938

摘要： Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.

3.

发明申请
User Driven Audio Content Navigation 有权
标题翻译：用户驱动的音频内容导航

公开(公告)号：US20120324356A1

公开(公告)日：2012-12-20

申请号：US13596313

申请日：2012-08-28

申请人： Nitendra Rajput , Om D. Deshmukh

发明人： Nitendra Rajput , Om D. Deshmukh

IPC分类号： G06F3/16

CPC分类号： G06F17/30775 , G06F17/30743 , G10L15/00 , G10L21/043 , H04M3/4938

摘要： Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.

摘要翻译： 描述了配置成为口头网络提供用户驱动的音频内容导航的系统和相关方法。实施例允许用户对似乎与用户相关的内容来删除音频，类似于标准网页的视觉撇除，并标记音频内的兴趣点。实施例提供了用于在客户机 - 服务器环境中与信息系统交互的情况下导航音频内容的技术，其中客户端设备可以是简单的标准电话。

4.

发明授权
User driven audio content navigation 有权

公开(公告)号：US09715540B2

公开(公告)日：2017-07-25

申请号：US12822802

申请日：2010-06-24

申请人： Om D. Deshmukh , Nitendra Rajput

发明人： Om D. Deshmukh , Nitendra Rajput

IPC分类号： G10L15/00 , G10L21/00 , H04M3/00 , G06F17/30 , G10L21/043 , H04M3/493

CPC分类号： G06F17/30775 , G06F17/30743 , G10L15/00 , G10L21/043 , H04M3/4938

摘要： Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.

5.

发明授权
Automatic evaluation of spoken fluency 有权
标题翻译：自动评价口语流利

公开(公告)号：US08457967B2

公开(公告)日：2013-06-04

申请号：US12541927

申请日：2009-08-15

申请人： Kartik Audhkhasi , Om D. Deshmukh , Kundan Kandhway , Ashish Verma

发明人： Kartik Audhkhasi , Om D. Deshmukh , Kundan Kandhway , Ashish Verma

IPC分类号： G10L15/04 , G10L15/26 , G10L15/06 , G10L13/00 , G10L21/00 , G06F17/27

CPC分类号： G10L15/26 , G09B19/04

摘要： A procedure to automatically evaluate the spoken fluency of a speaker by prompting the speaker to talk on a given topic, recording the speaker's speech to get a recorded sample of speech, and then analyzing the patterns of disfluencies in the speech to compute a numerical score to quantify the spoken fluency skills of the speakers. The numerical fluency score accounts for various prosodic and lexical features, including formant-based filled-pause detection, closely-occurring exact and inexact repeat N-grams, normalized average distance between consecutive occurrences of N-grams. The lexical features and prosodic features are combined to classify the speaker with a C-class classification and develop a rating for the speaker.

摘要翻译： 一个程序，通过提示说话者在给定的主题上进行谈话，记录讲话者的语音以获得记录的语音样本，然后分析语音中的不清楚的模式以计算数字得分，自动评估讲话者的口语流畅性量化演讲者的口语流利能力。数值流利度分数考虑到各种韵律和词汇特征，包括基于共振峰的填充暂停检测，紧密发生的精确和不精确的重复N克，连续出现的N克之间的归一化平均距离。词汇特征和韵律特征相结合，将扬声器分类为C级分类，并为扬声器开发评级。

6.

发明申请
Analysis of the Temporal Evolution of Emotions in an Audio Interaction in a Service Delivery Environment 有权
标题翻译：分析服务交付环境中音频交互情绪的时间演化

公开(公告)号：US20110196677A1

公开(公告)日：2011-08-11

申请号：US12704172

申请日：2010-02-11

申请人： Om D. Deshmukh , Chitra Dorai , Shailesh Joshi , Maureen E. Rzasa , Ashish Verma , Karthik Visweswariah , Gary J. Wright , Sai Zeng

发明人： Om D. Deshmukh , Chitra Dorai , Shailesh Joshi , Maureen E. Rzasa , Ashish Verma , Karthik Visweswariah , Gary J. Wright , Sai Zeng

IPC分类号： G10L17/00 , G10L15/00

CPC分类号： G10L15/22 , G10L2015/227

摘要： According to one illustrative embodiment, a method is provided for analyzing an audio interaction. At least one change in an emotion of a speaker in an audio interaction and at least one aspect of the audio interaction are identified. The at least one change in an emotion is analyzed in conjunction with the at least one aspect to determine a relationship between the at least one change in an emotion and the at least one aspect, and a result of the analysis is provided.

摘要翻译： 根据一个说明性实施例，提供了一种用于分析音频交互的方法。识别音频交互中的扬声器的情感和音频交互的至少一个方面中的至少一个变化。结合至少一个方面来分析情绪中的至少一个改变以确定情绪中的至少一个改变与至少一个方面之间的关系，并且提供分析结果。

7.

发明授权
Intent discovery in audio or text-based conversation 有权
标题翻译：音频或基于文本的对话中的意图发现

公开(公告)号：US08983840B2

公开(公告)日：2015-03-17

申请号：US13526637

申请日：2012-06-19

申请人： Om D. Deshmukh , Sachindra Joshi , Saket Saurabh , Ashish Verma

发明人： Om D. Deshmukh , Sachindra Joshi , Saket Saurabh , Ashish Verma

IPC分类号： G10L15/18 , G06F17/27

CPC分类号： G10L25/48 , G10L15/02 , G10L15/18 , G10L15/1822 , G10L15/26 , G10L21/10

摘要： Techniques, an apparatus and an article of manufacture identifying one or more utterances that are likely to carry the intent of a speaker, from a conversation between two or more parties. A method includes obtaining an input of a set of utterances in chronological order from a conversation between two or more parties, computing an intent confidence value of each utterance by summing intent confidence scores from each of the constituent words of the utterance, wherein intent confidence scores capture each word's influence on the subsequent utterances in the conversation based on (i) the uniqueness of the word in the conversation and (ii) the number of times the word subsequently occurs in the conversation, and generating a ranked order of the utterances from highest to lowest intent confidence value, wherein the highest intent value corresponds to the utterance which is most likely to carry intent of the speaker.

摘要翻译： 从两个或多个方之间的对话中识别出可能携带说话人意图的一个或多个话语的技术，装置和制品。一种方法包括从两个或更多方之间的会话按时间顺序获得一组话语的输入，通过将来自每个话语的组成词的意图置信度得分相加来计算每个话语的意图置信度值，其中意图置信度得分基于（i）会话中的单词的唯一性和（ii）单词随后在会话中发生的次数，并且从最高级别生成排序的话语顺序，从而捕获每个单词对对话中后续话语的影响到最低意图置信度值，其中最高意图值对应于最有可能携带说话者意图的话语。

8.

发明授权
Analysis of the temporal evolution of emotions in an audio interaction in a service delivery environment 有权
标题翻译：分析服务交付环境中音频交互中情绪的时间演变

公开(公告)号：US08417524B2

公开(公告)日：2013-04-09

申请号：US12704172

申请日：2010-02-11

申请人： Om D. Deshmukh , Chitra Dorai , Shailesh Joshi , Maureen E. Rzasa , Ashish Verma , Karthik Visweswariah , Gary J. Wright , Sai Zeng

发明人： Om D. Deshmukh , Chitra Dorai , Shailesh Joshi , Maureen E. Rzasa , Ashish Verma , Karthik Visweswariah , Gary J. Wright , Sai Zeng

IPC分类号： G10L15/06 , G10L11/04 , G10L11/06 , G10L19/06 , G10L15/00 , G10L17/00 , G10L11/00 , G10L21/00

CPC分类号： G10L15/22 , G10L2015/227

摘要： Analyzing an audio interaction is provided. At least one change in an emotion of a speaker in an audio interaction and at least one aspect of the audio interaction are identified. The at least one change in an emotion is analyzed in conjunction with the at least one aspect to determine a relationship between the at least one change in an emotion and the at least one aspect, and a result of the analysis is provided.

摘要翻译： 提供音频互动分析。识别音频交互中的扬声器的情感和音频交互的至少一个方面中的至少一个变化。结合至少一个方面来分析情绪中的至少一个改变以确定情绪中的至少一个改变与至少一个方面之间的关系，并且提供分析结果。

9.

发明申请
System and a Method for Generating Semantically Similar Sentences for Building a Robust SLM 有权
标题翻译：用于生成语义类似句子的系统和方法，用于构建稳健的SLM

公开(公告)号：US20130018649A1

公开(公告)日：2013-01-17

申请号：US13181923

申请日：2011-07-13

申请人： Om D. Deshmukh , Sachindra Joshi , Shajith I. Mohamed , Ashish Verma

发明人： Om D. Deshmukh , Sachindra Joshi , Shajith I. Mohamed , Ashish Verma

IPC分类号： G06F17/27

CPC分类号： G06F17/274 , G06F17/2795 , G06F17/2881 , G10L15/26

摘要： A system and method are described for generating semantically similar sentences for a statistical language model. A semantic class generator determines for each word in an input utterance a set of corresponding semantically similar words. A sentence generator computes a set of candidate sentences each containing at most one member from each set of semantically similar words. A sentence verifier grammatically tests each candidate sentence to determine a set of grammatically correct sentences semantically similar to the input utterance. Also note that the generated semantically similar sentences are not restricted to be selected from an existing sentence database.

摘要翻译： 描述了用于为统计语言模型生成语义上类似的句子的系统和方法。语义类生成器确定输入语义中的每个单词一组相应的语义上相似的单词。句子生成器从每个语义上相似的单词集合中计算出一组候选句子，每个候选句子最多包含一个成员。句子验证器语法测试每个候选句子以确定一组语法上正确的句子，其语义上类似于输入的话语。还要注意，生成的语义上相似的句子不限于从现有句子数据库中选择。

10.

发明申请
EVALUATING SPOKEN SKILLS 失效
标题翻译：评估SPOKEN技能

公开(公告)号：US20100185435A1

公开(公告)日：2010-07-22

申请号：US12354849

申请日：2009-01-16

申请人： Om D. Deshmukh , Ashish Verma

发明人： Om D. Deshmukh , Ashish Verma

IPC分类号： G06F17/20 , G10L15/04

CPC分类号： G10L15/1807 , G09B19/04 , G09B19/06 , G10L15/26

摘要： Techniques for evaluating one or more spoken language skills of a speaker are provided. The techniques include identifying one or more temporal locations of interest in a speech passage spoken by a speaker, computing one or more acoustic parameters, wherein the one or more acoustic parameters capture one or more properties of one or more acoustic-phonetic features of the one or more locations of interest, and combining the one or more acoustic parameters with an output of an automatic speech recognizer to modify an output of a spoken language skill evaluation.

摘要翻译： 提供了用于评估扬声器的一种或多种口语技能的技术。所述技术包括识别由扬声器说出的语音通道中的一个或多个感兴趣的时间位置，计算一个或多个声学参数，其中所述一个或多个声学参数捕获一个或多个声学特征的一个或多个属性或更多的感兴趣的位置，并且将一个或多个声学参数与自动语音识别器的输出组合以修改口语技能评估的输出。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类