Patent search cpc:"G10L2015/0635" Page 6

51.

发明申请
System and Method for Optimizing Speech Recognition and Natural Language Parameters with User Feedback 有权
Title translation: 用户反馈优化语音识别和自然语言参数的系统和方法

公开(公告)号：US20150348540A1

公开(公告)日：2015-12-03

申请号：US14287866

申请日：2014-05-27

Applicant: Andrej LJOLJE , Diamantino Antonio CASEIRO , Mazin GILBERT GILBERT , Vincent GOFFIN , Taniya Mishra

Inventor： Andrej LJOLJE , Diamantino Antonio CASEIRO , Mazin GILBERT GILBERT , Vincent GOFFIN , Taniya Mishra

IPC: G10L15/18 , G10L15/26

CPC classification number: G10L15/063 , G10L15/01 , G10L15/18 , G10L15/26 , G10L2015/0635

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

Abstract translation: 这里公开了用于将显着权重分配给ASR模型的单词的系统，方法和非暂时计算机可读存储介质。分配给ASR模型中的单词的显着性值基于以前的成绩单的人类感知判断。这些显着性值被用作权重以修改ASR模型，使得将口头文档转换成抄本的加权ASR模型的结果为用户提供更准确和有用的转录。

52.

发明授权
Cloud based adaptive learning for distributed sensors 有权
Title translation: 用于分布式传感器的基于云的自适应学习

公开(公告)号：US09177546B2

公开(公告)日：2015-11-03

申请号：US14013009

申请日：2013-08-28

Applicant: Texas Instruments Incorporated

Inventor： Lin Sun , Wei Ma

IPC: G10L15/065

CPC classification number: G10L15/063 , G06F17/30743 , G10L15/065 , G10L2015/0635

Abstract: A low power sound recognition sensor is configured to receive an analog signal that may contain a signature sound. Sound parameter information is extracted from the analog signal and compared to a sound parameter reference stored locally with the sound recognition sensor to detect when the signature sound is received in the analog signal. A trigger signal is generated when a signature sound is detected. A portion of the extracted sound parameter information is sent to a remote training location for adaptive training when a signature sound detection error occurs. An updated sound parameter reference from the remote training location is received in response to the adaptive training.

Abstract translation: 低功率声音识别传感器被配置为接收可能包含签名声音的模拟信号。从模拟信号中提取声音参数信息，并将其与使用声音识别传感器本地存储的声音参数参考值进行比较，以检测在模拟信号中何时接收到签名声音。当检测到签名声音时，产生触发信号。当签名声音检测错误发生时，提取的声音参数信息的一部分被发送到远程训练位置进行自适应训练。响应于自适应训练接收来自远程训练位置的更新的声音参数参考。

53.

发明申请
SYSTEM AND METHOD FOR PERFORMING DUAL MODE SPEECH RECOGNITION 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US20150154959A1

公开(公告)日：2015-06-04

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/08 , G10L15/26

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

54.

发明申请
MOBILE TERMINAL AND MENU CONTROL METHOD THEREOF 审中-公开
Title translation: 移动终端和菜单控制方法

公开(公告)号：US20150126252A1

公开(公告)日：2015-05-07

申请号：US14594959

申请日：2015-01-12

Applicant: LG Electronics Inc.

Inventor： Jong-Keun YOUN , Dae-Sung JUNG , Jae-Hoon YU , Tae-Jun KIM , Jae-Min JOH , Jae-Do KWAK , Jong-Ho SHIN

IPC: H04M1/27 , H04M1/725 , G10L17/22 , H04W4/16

CPC classification number: H04M1/271 , G06F3/167 , G10L15/063 , G10L15/1815 , G10L15/22 , G10L17/22 , G10L25/78 , G10L2015/0635 , G10L2015/223 , G10L2015/228 , G10L2025/783 , H04M1/72563 , H04M1/72583 , H04W4/16

Abstract: A mobile terminal including a wireless communication unit configured to provide wireless communication, a display, and a controller configured to, activate a mode for voice recognition in response to a touch input to a soft button displayed on the display or to a hard button on the mobile terminal, receive a first voice input associated with a phone call relating operation of the mobile terminal, display an indicator on the display indicating the voice input is being recognized by the mobile terminal, analyze the context of a voice command in the voice input, execute the call relating operation only if there is a single contact in a phonebook that matches the voice command in the first voice input, if there is no single contact that matches the voice command of the received voice input, display a plurality of candidates that is analyzed based on the voice command, receive a second input according to a plurality of candidates, and execute the call relating operation based on the second input.

Abstract translation: 一种移动终端，包括被配置为提供无线通信的无线通信单元，显示器和控制器，其被配置为响应于对显示在显示器上的软按钮的触摸输入或者在显示器上的硬按钮来激活用于语音识别的模式移动终端接收与移动终端的电话呼叫相关联的第一语音输入，在显示器上显示表示语音输入正在被移动终端识别的指示符，分析语音输入中的语音命令的上下文，如果没有与接收到的语音输入的语音命令相匹配的单个接触，则在电话簿中存在与第一语音输入中的语音命令匹配的单个联系人时，仅执行呼叫相关操作，则显示多个候选基于语音命令分析，根据多个候选接收第二输入，并且基于第二个执行呼叫相关操作输入。

55.

发明申请
System and Method for Determining the Compliance of Agent Scripts 有权
Title translation: 确定代理脚本合规性的系统和方法

公开(公告)号：US20150066504A1

公开(公告)日：2015-03-05

申请号：US14319847

申请日：2014-06-30

Applicant: Verint Systems Ltd.

Inventor： Jeffery Michael Iannone , Ron Wein , Omer Ziv

IPC: G10L15/01 , G10L15/26

CPC classification number: G10L15/10 , G10L15/04 , G10L15/06 , G10L15/08 , G10L15/26 , G10L2015/0635 , G10L2015/088

Abstract: Systems and methods of script identification in audio data obtained from audio data. The audio data is segmented into a plurality of utterances. A script model representative of a script text is obtained. The plurality of utterances are decoded with the script model. A determination is made if the script text occurred in the audio data.

Abstract translation: 从音频数据获得的音频数据中脚本识别的系统和方法。音频数据被分割成多个话语。获得代表脚本文本的脚本模型。用脚本模型解码多个话语。如果脚本文本发生在音频数据中，则确定。

56.

发明申请
System and Method of Automated Language Model Adaptation 有权
Title translation: 自动语言模型适应的系统与方法

公开(公告)号：US20150066503A1

公开(公告)日：2015-03-05

申请号：US14291895

申请日：2014-05-30

Applicant: VERINT SYSTEMS LTD.

Inventor： Ran Achituv , Omer Ziv , Ido Shapira , Daniel Baum

IPC: G10L15/26

CPC classification number: G10L15/197 , G06F17/30746 , G10L15/063 , G10L15/083 , G10L15/26 , G10L2015/0635 , H04M3/51

Abstract: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio file transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio file transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.

Abstract translation: 用于音频数据转录的语言模型的自动适应的系统和方法包括获得音频数据。音频数据用语言模型转录以产生多个音频文件转录。评估多个音频文件转录的质量。基于评估的质量来选择来自多个音频文件转录的至少一个最佳转录。根据来自多个音频文件转录的所选择的至少一个最佳转录来计算统计量。语言模型根据计算的统计信息进行修改。

57.

发明授权
Mobile terminal and menu control method thereof 有权
Title translation: 移动终端及其菜单控制方法

公开(公告)号：US08958848B2

公开(公告)日：2015-02-17

申请号：US12140111

申请日：2008-06-16

Applicant: Jong-Ho Shin , Jong-Keun Youn , Dae-Sung Jung , Jae-Hoon Yu , Tae-Jun Kim , Jae-Min Joh , Jae-Do Kwak

Inventor： Jong-Ho Shin , Jong-Keun Youn , Dae-Sung Jung , Jae-Hoon Yu , Tae-Jun Kim , Jae-Min Joh , Jae-Do Kwak

IPC: H04B1/38 , G10L15/00 , G10L25/00 , H04M1/725

CPC classification number: H04M1/271 , G06F3/167 , G10L15/063 , G10L15/1815 , G10L15/22 , G10L17/22 , G10L25/78 , G10L2015/0635 , G10L2015/223 , G10L2015/228 , G10L2025/783 , H04M1/72563 , H04M1/72583 , H04W4/16

Abstract: A mobile terminal including an input unit configured to receive an input to activate a voice recognition function on the mobile terminal and a memory configured to store multiple domains related to menus and operations of the mobile terminal. It further includes a controller configured to access a specific domain among the multiple domains included in the memory based on the received input to activate the voice recognition function, to recognize user speech based on a language model and an acoustic model of the accessed domain, and to determine at least one menu and operation of the mobile terminal based on the accessed specific domain and the recognized user speech.

Abstract translation: 一种移动终端，包括被配置为接收用于激活移动终端上的语音识别功能的输入的输入单元和被配置为存储与移动终端的菜单和操作相关的多个域的存储器。它还包括控制器，被配置为基于接收到的输入来访问存储器中包括的多个域中的特定域，以激活语音识别功能，基于访问域的语言模型和声学模型识别用户语音，以及基于所访问的特定域和所识别的用户语音来确定移动终端的至少一个菜单和操作。

58.

发明申请
VOICE AUTHENTICATION AND SPEECH RECOGNITION SYSTEM AND METHOD 审中-公开
Title translation: 语音认证和语音识别系统及方法

公开(公告)号：US20150019220A1

公开(公告)日：2015-01-15

申请号：US14374225

申请日：2013-01-23

Applicant: AURAYA PTY LTD

Inventor： Habib Emile Talhami , Amit Sadanand Malegaonkar , Renuka Amit Malegaonkar , Clive David Summerfield

IPC: G10L15/07 , G10L17/04 , G10L15/06

CPC classification number: G10L15/07 , G10L15/063 , G10L15/065 , G10L17/00 , G10L17/04 , G10L2015/0635 , G10L2015/0638

Abstract: A method for configuring a speech recognition system comprises obtaining a speech sample utilised by a voice authentication system in a voice authentication process. The speech sample is processed to generate acoustic models for units of speech associated with the speech sample. The acoustic models are stored for subsequent use by the speech recognition system as part of a speech recognition process.

Abstract translation: 一种用于配置语音识别系统的方法包括在语音认证过程中获得由语音认证系统使用的语音样本。语音样本被处理以产生与语音样本相关联的语音单元的声学模型。声学模型被存储以供语音识别系统随后用作语音识别过程的一部分。

59.

发明申请
VOICE COMMAND DEFINITIONS USED IN LAUNCHING APPLICATION WITH A COMMAND 有权
Title translation: 用命令启动应用程序中使用的语音命令定义

公开(公告)号：US20140278419A1

公开(公告)日：2014-09-18

申请号：US13830318

申请日：2013-03-14

Applicant: MICROSOFT CORPORATION

Inventor： F. Avery Bishop , Travis Wilson , Robert Chambers , Robert Brown

IPC: G06F3/16 , G10L15/22 , G10L15/04

CPC classification number: G10L15/22 , G06F3/16 , G06F3/167 , G10L15/063 , G10L15/193 , G10L2015/0635 , G10L2015/223

Abstract: A voice command definition file (VCDF) declaratively defines voice commands for an application. For example, the VCDF may include definitions for: voice commands; one or more phrases/utterances that may be said to execute each of the commands; a navigation location to navigate to within the application (e.g. a page); phrase lists containing items that may be used as a parameter in a voice command; examples; feedback; and the like. A user may say a single utterance to launch the application, navigate to the associated location of the command and execute the command. The VCDF may define multiple ways to listen for a particular command. The VCDF may be edited/defined by a user and may include a user friendly name for an application. A speech engine loads the VCDF for use such that it may recognize the commands associated with an application. The definitions may be updated during runtime.

Abstract translation: 语音命令定义文件（VCDF）声明地定义了应用的语音命令。例如，VCDF可以包括：语音命令的定义; 可以说是执行每个命令的一个或多个短语/话语; 在应用程序内导航的导航位置（例如页面）; 包含可用作语音命令中的参数的项的短语列表; 例子; 反馈; 等等。用户可以说单个话语来启动应用程序，导航到命令的相关位置并执行该命令。 VCDF可以定义多种方式来侦听特定的命令。 VCDF可以由用户编辑/定义，并且可以包括用于用户的应用名称。语音引擎加载VCDF以使其可以识别与应用相关联的命令。定义可能在运行时更新。

60.

发明授权
System and method of extracting clauses for spoken language understanding 有权
Title translation: 提取语言理解条款的系统和方法

公开(公告)号：US08818793B1

公开(公告)日：2014-08-26

申请号：US10446489

申请日：2003-05-28

Applicant: Srinivas Bangalore , Narendra K. Gupta , Mazin G Rahim

Inventor： Srinivas Bangalore , Narendra K. Gupta , Mazin G Rahim

IPC: G06F17/27

CPC classification number: G10L15/063 , G06F17/271 , G06F17/277 , G10L2015/0635

Abstract: A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.

Abstract translation: 公开了一种提取语言理解条款的分类器和方法。该方法涉及从语音话语文本生成一组子句，并且包括在与句子边界相关的语音话语文本中插入至少一个边界标签，插入至少一个编辑标签，该编辑标签指示语音话语文本的一部分以去除并插入讲话话语文本内的至少一个连接标签。结果是可以根据插入的至少一个边界标签，至少一个编辑标签和至少一个连接标签在语音发音文本内识别的一组子句。所公开的分类器包括句子边界分类器，编辑检测器分类器和连接检测器分类器。克隆器可以包括单个分类器或多个分类器，以执行识别句子边界，编辑文本以及识别文本内的连词的步骤。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification