专利检索 ap:("Srinivas Bangalore" OR "Alistair Conkie" OR "Giuseppe Di Fabbrizio" OR "Narendra Kumar Gupta" OR "Amanda Joy Stent") AND inv:"Srinivas Bangalore" 第 6 页

51.

发明授权
Method for building linguistic models from a corpus 有权
标题翻译：从语料库构建语言模型的方法

公开(公告)号：US06415248B1

公开(公告)日：2002-07-02

申请号：US09443891

申请日：1999-11-19

申请人： Srinivas Bangalore , Giuseppe Riccardi

发明人： Srinivas Bangalore , Giuseppe Riccardi

IPC分类号： G06F1720

CPC分类号： G06F17/274 , G10L15/183 , G10L15/19

摘要： A method iteratively integrates clustering techniques with phrase acquisition techniques to build complex linguistic models from a corpus. A set of features is initialized by the corpus. Thereafter, the method determines, according to a predetermined cost function, to process the features by one of phrase clustering processing or phrase grammar learning processing. If phrase clustering processing is performed, the method processes an interstitial set of features comprising both the old features and newly established clusters by phrase grammar learning processing. The features obtained as an output of phrase grammar learning is re-indexed as a set of features for a subsequent iteration. The method may be repeated over several iterations to build a hierarchical linguistic model.

摘要翻译： 一种方法迭代地将聚类技术与短语获取技术相结合，从语料库构建复杂的语言模型。一组特征由语料库初始化。此后，该方法根据预定的成本函数确定通过短语聚类处理或短语语法学习处理之一处理特征。如果执行短语群集处理，则该方法通过短语语法学习处理来处理包括旧特征和新建簇的特征的插页式集。作为短语语法学习的输出获得的特征被重新索引为随后的迭代的一组特征。该方法可以在多次迭代中重复以构建分级语言模型。

52.

发明授权
On-Demand language translation for television programs 有权
标题翻译：电视节目的按需语言翻译

公开(公告)号：US08589146B2

公开(公告)日：2013-11-19

申请号：US12772580

申请日：2010-05-03

申请人： Srinivas Bangalore , David Crawford Gibbon , Mazin Gilbert , Patrick Guy Haffner , Zhu Liu , Behzad Shahraray

发明人： Srinivas Bangalore , David Crawford Gibbon , Mazin Gilbert , Patrick Guy Haffner , Zhu Liu , Behzad Shahraray

IPC分类号： G06F17/28

CPC分类号： H04N21/4856 , G06F17/289 , G10L21/10 , H04N7/0885 , H04N7/17336 , H04N21/2355 , H04N21/2543 , H04N21/25435 , H04N21/4355 , H04N21/440236 , H04N21/4882 , H04N21/4884 , H04N21/8193

摘要： A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal. The video signal including the translated information in the target language may be sent to a display device.

摘要翻译： 为按需翻译服务提供方法，系统和机器可读介质。包括用于将源语言翻译成目标语言的至少一个语言对模块的翻译模块可以被用户使用。用户可能会收取使用所请求的按需翻译服务的费用，或者可以免费使用按需翻译服务，以便向用户显示商业消息。可以接收包括源语言的信息的视频信号，其可以从视频信号获取为文本，并且可以通过使用翻译模块从源语言翻译成目标语言。基于翻译文本的翻译信息可以被添加到接收的视频信号中。可以将包括目标语言的翻译信息的视频信号发送到显示装置。

53.

发明授权
System and method for enriching spoken language translation with prosodic information 有权
标题翻译：用韵律信息丰富口语翻译的系统和方法

公开(公告)号：US08571849B2

公开(公告)日：2013-10-29

申请号：US12241660

申请日：2008-09-30

申请人： Srinivas Bangalore , Vivek Kumar Rangarajan Sridhar

发明人： Srinivas Bangalore , Vivek Kumar Rangarajan Sridhar

IPC分类号： G06F17/28

CPC分类号： G06F17/289 , G10L13/10

摘要： Disclosed herein are systems, methods, and computer readable-media for enriching spoken language translation with prosodic information in a statistical speech translation framework. The method includes receiving speech for translation to a target language, generating pitch accent labels representing segments of the received speech which are prosodically prominent, and injecting pitch accent labels with word tokens within the translation engine to create enriched target language output text. A further step may be added of synthesizing speech in the target language based on the prosody enriched target language output text. An automatic prosody labeler can generate pitch accent labels. An automatic prosody labeler can exploit lexical, syntactic, and prosodic information of the speech. A maximum entropy model may be used to determine which segments of the speech are prosodically prominent. A pitch accent label can include an indication of certainty that a respective segment of the speech is prosodically prominent and/or an indication of prosodic prominence of a respective segment of speech.

摘要翻译： 本文公开了用于在统计语音翻译框架中丰富口头语言翻译与韵律信息的系统，方法和计算机可读介质。该方法包括接收用于翻译到目标语言的语音，产生表示接收到的语音段的韵律突出的音调重音标签，以及在翻译引擎内用词令牌注入音调重音标签，以创建丰富的目标语言输出文本。可以基于韵律丰富的目标语言输出文本，添加目标语言中的合成语音的另一步骤。自动韵律贴标机可以产生音调重音标签。自动韵律标签者可以利用言语的词汇，句法和韵律信息。可以使用最大熵模型来确定语音的哪些段韵律突出。音调重音标签可以包括确定语音的相应片段韵律突出的指示和/或相应语音段的韵律突出的指示。

54.

发明授权
Method and apparatus for building sales tools by mining data from websites 失效
标题翻译：通过网站挖掘数据建立销售工具的方法和设备

公开(公告)号：US08359307B2

公开(公告)日：2013-01-22

申请号：US13088935

申请日：2011-04-18

申请人： Srinivas Bangalore , Junlan Feng , Mazin Gilbert , Jay Gordon Wilpon

发明人： Srinivas Bangalore , Junlan Feng , Mazin Gilbert , Jay Gordon Wilpon

IPC分类号： G06F17/30

CPC分类号： G06F17/2235 , G06Q30/02 , G06Q30/06

摘要： A website mining tool is disclosed that extracts information from, for example, a company's website and presents the extracted information in a graphical user interface (GUI). In one embodiment, web pages from a website are stored in, for example, computer memory and a structure of the web pages is identified. A plurality of blocks of information is then extracted as a function of this structure and a category is assigned to each block of information. The elements in the blocks of information are then displayed, for example to a salesperson, as a function of these categories. In another embodiment, Document Object Modeling parsing is used to identify the structure of the web pages. In yet another embodiment, a support vector machine is used to categorize each block of information.

摘要翻译： 公开了一种网站挖掘工具，其提取例如公司网站的信息，并将所提取的信息呈现在图形用户界面（GUI）中。在一个实施例中，来自网站的网页被存储在例如计算机存储器中，并且识别网页的结构。然后根据该结构提取多个信息块，并将类别分配给每个信息块。然后，作为这些类别的函数，将信息块中的元素显示为例如销售人员。在另一个实施例中，文档对象建模解析用于识别网页的结构。在另一个实施例中，支持向量机用于对每个信息块进行分类。

55.

发明申请
SYSTEM AND METHOD FOR OPTIMIZING RESPONSE HANDLING TIME AND CUSTOMER SATISFACTION SCORES 有权
标题翻译：优化响应处理时间和客户满意度的系统和方法

公开(公告)号：US20120271898A1

公开(公告)日：2012-10-25

申请号：US13539896

申请日：2012-07-02

申请人： Srinivas Bangalore , Mazin Gilbert

发明人： Srinivas Bangalore , Mazin Gilbert

IPC分类号： G06F15/16

CPC分类号： H04L51/02 , G06Q10/06 , G06Q10/06311 , G06Q10/063112 , G06Q30/0207 , G06Q30/0245

摘要： A system and method disclosed for using and updating a database of template responses for a live agent in response to user communications. The method includes computing an average string distance between each response from a live agent and a template, use to generate the response, modifying the computed average string distance based on a customer satisfaction score associated with each response and selecting a response that minimizes the computed average string distance and maximizes customer satisfaction. Upon receiving a further communication on a certain issue, the system presents a prototype response that has been added to the template database to the live agent for use in generating a response to the further communication that reduces handling time and increases customer satisfaction.

摘要翻译： 公开了用于响应于用户通信使用和更新活动代理的模板响应的数据库的系统和方法。该方法包括计算来自活动代理和模板的每个响应之间的平均字符串距离，用于生成响应，基于与每个响应相关联的客户满意度得分修改所计算的平均字符串距离，并且选择使计算出的平均值最小化的响应弦距，并最大化客户满意度。在一个特定的问题上接收到进一步的通信之后，系统呈现已经被添加到模板数据库中的实时代理的原型响应，用于产生对进一步通信的响应，这减少了处理时间并提高了客户满意度。

56.

发明授权
System and method of spoken language understanding in human computer dialogs 有权
标题翻译：在人机对话中口语理解的系统和方法

公开(公告)号：US08190436B2

公开(公告)日：2012-05-29

申请号：US10310596

申请日：2002-12-05

申请人： Srinivas Bangalore , Narendra K. Gupta , Mazin G Rahim

发明人： Srinivas Bangalore , Narendra K. Gupta , Mazin G Rahim

IPC分类号： G10L21/00 , G06F17/20 , G06F17/27

CPC分类号： G10L15/1815 , G06F17/2785 , G10L13/043 , G10L15/02 , G10L15/1822 , G10L15/265

摘要： A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

摘要翻译： 公开了一种提高口语对话系统中的自动语音识别的系统和方法。该方法包括将语音识别器输出划分为独立子句，识别每个自包含子句中的对话行为，通过识别当前域对象和/或当前域动作进行限定对话行为，以及确定是否可进一步限定对于当前域对象和/或当前域操作。如果可以进一步鉴定，则该方法包括识别与当前域对象和/或当前域操作相关联的另一域操作和/或另一域对象，将另一域操作和/或另一域对象重新分配为当前域操作，以及 /或当前域对象，然后递归地限定新的当前域操作和/或当前对象。这个过程一直持续到没有什么是剩下的资格。

57.

发明授权
Text edit tracker that categorizes communications, determines distances between templates, codes templates in color, and uses a morphing score based on edits 有权
标题翻译：对通信进行分类的文本编辑跟踪器，确定模板之间的距离，颜色的代码模板，并使用基于编辑的变形分数

公开(公告)号：US08170961B2

公开(公告)日：2012-05-01

申请号：US12252732

申请日：2008-10-16

申请人： Srinivas Bangalore , David J. Smith

发明人： Srinivas Bangalore , David J. Smith

IPC分类号： G06F11/00

CPC分类号： G06F17/248

摘要： A method for monitoring edits to a template for responding to an incoming communication includes categorizing the incoming communication into a category associated with the template for a response to the incoming communication. The method also includes determining distances between the template and each of a set of responses based on the template, at a predetermined level of granularity. The method also includes coding the template in accordance with the determined distances and displaying the coded template. A method for extracting a new template based on responses to an existing template includes selecting factors that affect quantitative measures for preparing a response to the incoming communication. The method includes using a mathematical model of the factors to cluster a set of responses created based on the existing template into two clusters. The method further includes restricting a first cluster centroid to be the existing template and searching for a second cluster centroid for a second cluster.

摘要翻译： 用于监视对模板的编辑以响应传入通信的方法包括将传入通信分类为与模板相关联的类别以对于进入通信的响应。该方法还包括以预定的粒度级确定基于模板的模板与一组响应中的每一个之间的距离。该方法还包括根据确定的距离对模板进行编码并显示编码的模板。基于对现有模板的响应来提取新模板的方法包括选择影响对进入通信的响应的定量测量的因素。该方法包括使用这些因素的数学模型将基于现有模板创建的一组响应聚类成两个聚类。该方法还包括将第一聚类中心限制为现有模板并搜索第二聚簇的第二聚类质心。

58.

发明申请
SYSTEM AND METHOD FOR OPEN SPEECH RECOGNITION 有权
标题翻译：用于开放语音识别的系统和方法

公开(公告)号：US20120084086A1

公开(公告)日：2012-04-05

申请号：US12895359

申请日：2010-09-30

申请人： Mazin GILBERT , Srinivas Bangalore , Patrick Haffner , Robert Bell

发明人： Mazin GILBERT , Srinivas Bangalore , Patrick Haffner , Robert Bell

IPC分类号： G10L15/26

CPC分类号： G10L15/32 , G10L15/063 , G10L15/26 , G10L2015/0638

摘要： Disclosed herein are systems, methods and non-transitory computer-readable media for performing speech recognition across different applications or environments without model customization or prior knowledge of the domain of the received speech. The disclosure includes recognizing received speech with a collection of domain-specific speech recognizers, determining a speech recognition confidence for each of the speech recognition outputs, selecting speech recognition candidates based on a respective speech recognition confidence for each speech recognition output, and combining selected speech recognition candidates to generate text based on the combination.

摘要翻译： 本文公开了用于在不需要模型定制或接收到的语音的领域的先前知识的情况下在不同的应用或环境上执行语音识别的系统，方法和非暂时的计算机可读介质。该公开内容包括：利用特定领域的语音识别器的集合来识别接收的语音，为每个语音识别输出确定语音识别置信度，基于每个语音识别输出的相应语音识别置信度选择语音识别候选，以及组合所选语音识别候选人基于组合生成文本。

59.

发明授权
System and method of automatically building dialog services by exploiting the content and structure of websites 失效
标题翻译：通过利用网站的内容和结构自动构建对话服务的系统和方法

公开(公告)号：US08065151B1

公开(公告)日：2011-11-22

申请号：US10740156

申请日：2003-12-18

申请人： Srinivas Bangalore , Junlan Feng , Mazin Rahim

发明人： Srinivas Bangalore , Junlan Feng , Mazin Rahim

IPC分类号： G10L11/00

CPC分类号： G10L15/22 , G10L15/193

摘要： A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.

摘要翻译： 公开了一种用于提供网站的对话界面的方法和系统。该方法包括在网站中的每个节点处，计算摘要，文档描述和别名。口语对话服务中的对话管理器利用每个网站节点的摘要，文档描述和别名来向用户生成提示，其中网站中的节点与用户请求匹配。以这种方式，可以自动生成对网站内容和导航的口头对话界面。

60.

发明授权
System and method of providing a spoken dialog interface to a website 有权
标题翻译：向网站提供口语对话界面的系统和方法

公开(公告)号：US08060369B2

公开(公告)日：2011-11-15

申请号：US12533300

申请日：2009-07-31

申请人： Srinivas Bangalore , Junlan Feng , Mazin G. Rahim

发明人： Srinivas Bangalore , Junlan Feng , Mazin G. Rahim

IPC分类号： G10L15/18 , G10L15/22

CPC分类号： G10L13/08 , G06F17/27 , G10L15/063 , G10L15/183 , G10L15/22 , H04M3/4936

摘要： Disclosed is a system and method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes converting data from a structured database associated with a website to a structured text data set and a structured task knowledge base, extracting linguistic items from the structured database, and training a spoken dialog service component using at least one of the structured text data, the structured task knowledge base, or the linguistic items. The system includes modules configured to implement the method.

摘要翻译： 公开了一种用于从网站数据训练口语对话服务组件的系统和方法。口语对话服务组件通常包括自动语音识别模块，语言理解模块，对话管理模块，语言生成模块和文本到语音模块。该方法包括将来自与网站相关联的结构化数据库的数据转换为结构化文本数据集和结构化任务知识库，从结构化数据库中提取语言项目，以及使用至少一个结构化文本数据来训练口语对话服务组件，结构化任务知识库或语言项目。该系统包括配置为实现该方法的模块。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类