Cloud based adaptive learning for distributed sensors
    52.
    发明授权
    Cloud based adaptive learning for distributed sensors 有权
    用于分布式传感器的基于云的自适应学习

    公开(公告)号:US09177546B2

    公开(公告)日:2015-11-03

    申请号:US14013009

    申请日:2013-08-28

    Inventor: Lin Sun Wei Ma

    CPC classification number: G10L15/063 G06F17/30743 G10L15/065 G10L2015/0635

    Abstract: A low power sound recognition sensor is configured to receive an analog signal that may contain a signature sound. Sound parameter information is extracted from the analog signal and compared to a sound parameter reference stored locally with the sound recognition sensor to detect when the signature sound is received in the analog signal. A trigger signal is generated when a signature sound is detected. A portion of the extracted sound parameter information is sent to a remote training location for adaptive training when a signature sound detection error occurs. An updated sound parameter reference from the remote training location is received in response to the adaptive training.

    Abstract translation: 低功率声音识别传感器被配置为接收可能包含签名声音的模拟信号。 从模拟信号中提取声音参数信息,并将其与使用声音识别传感器本地存储的声音参数参考值进行比较,以检测在模拟信号中何时接收到签名声音。 当检测到签名声音时,产生触发信号。 当签名声音检测错误发生时,提取的声音参数信息的一部分被发送到远程训练位置进行自适应训练。 响应于自适应训练接收来自远程训练位置的更新的声音参数参考。

    SYSTEM AND METHOD FOR PERFORMING DUAL MODE SPEECH RECOGNITION
    53.
    发明申请
    SYSTEM AND METHOD FOR PERFORMING DUAL MODE SPEECH RECOGNITION 有权
    用于执行双模式语音识别的系统和方法

    公开(公告)号:US20150154959A1

    公开(公告)日:2015-06-04

    申请号:US14621024

    申请日:2015-02-12

    Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

    Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法,在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。 该系统接受来自用户的口语查询,并且本地识别模块和远程识别引擎都对查询执行语音识别操作,返回转录和置信度得分,并受到延迟截止时间的限制。 如果两个来源成功地转录查询,则系统接受具有较高置信度得分的结果。 如果只有一个源成功,则该结果被接受。 在任一情况下,如果远程识别引擎确实成功地转录查询,则如果远程系统结果包括客户端词汇中不存在的信息,则更新客户词汇。

    MOBILE TERMINAL AND MENU CONTROL METHOD THEREOF
    54.
    发明申请
    MOBILE TERMINAL AND MENU CONTROL METHOD THEREOF 审中-公开
    移动终端和菜单控制方法

    公开(公告)号:US20150126252A1

    公开(公告)日:2015-05-07

    申请号:US14594959

    申请日:2015-01-12

    Abstract: A mobile terminal including a wireless communication unit configured to provide wireless communication, a display, and a controller configured to, activate a mode for voice recognition in response to a touch input to a soft button displayed on the display or to a hard button on the mobile terminal, receive a first voice input associated with a phone call relating operation of the mobile terminal, display an indicator on the display indicating the voice input is being recognized by the mobile terminal, analyze the context of a voice command in the voice input, execute the call relating operation only if there is a single contact in a phonebook that matches the voice command in the first voice input, if there is no single contact that matches the voice command of the received voice input, display a plurality of candidates that is analyzed based on the voice command, receive a second input according to a plurality of candidates, and execute the call relating operation based on the second input.

    Abstract translation: 一种移动终端,包括被配置为提供无线通信的无线通信单元,显示器和控制器,其被配置为响应于对显示在显示器上的软按钮的触摸输入或者在显示器上的硬按钮来激活用于语音识别的模式 移动终端接收与移动终端的电话呼叫相关联的第一语音输入,在显示器上显示表示语音输入正在被移动终端识别的指示符,分析语音输入中的语音命令的上下文, 如果没有与接收到的语音输入的语音命令相匹配的单个接触,则在电话簿中存在与第一语音输入中的语音命令匹配的单个联系人时,仅执行呼叫相关操作,则显示多个候选 基于语音命令分析,根据多个候选接收第二输入,并且基于第二个执行呼叫相关操作 输入。

    System and Method of Automated Language Model Adaptation
    56.
    发明申请
    System and Method of Automated Language Model Adaptation 有权
    自动语言模型适应的系统与方法

    公开(公告)号:US20150066503A1

    公开(公告)日:2015-03-05

    申请号:US14291895

    申请日:2014-05-30

    Abstract: Systems and methods of automated adaptation of a language model for transcription of audio data include obtaining audio data. The audio data is transcribed with a language model to produce a plurality of audio file transcriptions. A quality of the plurality of audio file transcriptions is evaluated. At least one best transcription from a plurality of audio file transcriptions is selected based upon the evaluated quality. Statistics are calculated from the selected at least one best transcription from the plurality of audio file transcriptions. The language model is modified from the calculated statistics.

    Abstract translation: 用于音频数据转录的语言模型的自动适应的系统和方法包括获得音频数据。 音频数据用语言模型转录以产生多个音频文件转录。 评估多个音频文件转录的质量。 基于评估的质量来选择来自多个音频文件转录的至少一个最佳转录。 根据来自多个音频文件转录的所选择的至少一个最佳转录来计算统计量。 语言模型根据计算的统计信息进行修改。

    VOICE COMMAND DEFINITIONS USED IN LAUNCHING APPLICATION WITH A COMMAND
    59.
    发明申请
    VOICE COMMAND DEFINITIONS USED IN LAUNCHING APPLICATION WITH A COMMAND 有权
    用命令启动应用程序中使用的语音命令定义

    公开(公告)号:US20140278419A1

    公开(公告)日:2014-09-18

    申请号:US13830318

    申请日:2013-03-14

    Abstract: A voice command definition file (VCDF) declaratively defines voice commands for an application. For example, the VCDF may include definitions for: voice commands; one or more phrases/utterances that may be said to execute each of the commands; a navigation location to navigate to within the application (e.g. a page); phrase lists containing items that may be used as a parameter in a voice command; examples; feedback; and the like. A user may say a single utterance to launch the application, navigate to the associated location of the command and execute the command. The VCDF may define multiple ways to listen for a particular command. The VCDF may be edited/defined by a user and may include a user friendly name for an application. A speech engine loads the VCDF for use such that it may recognize the commands associated with an application. The definitions may be updated during runtime.

    Abstract translation: 语音命令定义文件(VCDF)声明地定义了应用的语音命令。 例如,VCDF可以包括:语音命令的定义; 可以说是执行每个命令的一个或多个短语/话语; 在应用程序内导航的导航位置(例如页面); 包含可用作语音命令中的参数的项的短语列表; 例子; 反馈; 等等。 用户可以说单个话语来启动应用程序,导航到命令的相关位置并执行该命令。 VCDF可以定义多种方式来侦听特定的命令。 VCDF可以由用户编辑/定义,并且可以包括用于用户的应用名称。 语音引擎加载VCDF以使其可以识别与应用相关联的命令。 定义可能在运行时更新。

    System and method of extracting clauses for spoken language understanding
    60.
    发明授权
    System and method of extracting clauses for spoken language understanding 有权
    提取语言理解条款的系统和方法

    公开(公告)号:US08818793B1

    公开(公告)日:2014-08-26

    申请号:US10446489

    申请日:2003-05-28

    CPC classification number: G10L15/063 G06F17/271 G06F17/277 G10L2015/0635

    Abstract: A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.

    Abstract translation: 公开了一种提取语言理解条款的分类器和方法。 该方法涉及从语音话语文本生成一组子句,并且包括在与句子边界相关的语音话语文本中插入至少一个边界标签,插入至少一个编辑标签,该编辑标签指示语音话语文本的一部分以去除并插入 讲话话语文本内的至少一个连接标签。 结果是可以根据插入的至少一个边界标签,至少一个编辑标签和至少一个连接标签在语音发音文本内识别的一组子句。 所公开的分类器包括句子边界分类器,编辑检测器分类器和连接检测器分类器。 克隆器可以包括单个分类器或多个分类器,以执行识别句子边界,编辑文本以及识别文本内的连词的步骤。

Patent Agency Ranking