MULTIPLE RECOGNIZER SPEECH RECOGNITION
    3.
    发明公开
    MULTIPLE RECOGNIZER SPEECH RECOGNITION 有权
    语音识别与多个探测设备

    公开(公告)号:EP2997571A1

    公开(公告)日:2016-03-23

    申请号:EP14726279.4

    申请日:2014-04-18

    申请人: Google Inc.

    摘要: The subject matter of this specification can be embodied in, among other things, a method that includes receiving audio data that corresponds to an utterance, obtaining a first transcription of the utterance that was generated using a limited speech recognizer. The limited speech recognizer includes a speech recognizer that includes a language model that is trained over a limited speech recognition vocabulary that includes one or more terms from a voice command grammar, but that includes fewer than all terms of an expanded grammar. A second transcription of the utterance is obtained that was generated using an expanded speech recognizer. The expanded speech recognizer includes a speech recognizer that includes a language model that is trained over an expanded speech recognition vocabulary that includes all of the terms of the expanded grammar. The utterance is classified based at least on a portion of the first transcription or the second transcription.

    PRIVACY-PRESERVING TRAINING CORPUS SELECTION
    4.
    发明公开
    PRIVACY-PRESERVING TRAINING CORPUS SELECTION 审中-公开
    隐私保护训练选择

    公开(公告)号:EP3234944A1

    公开(公告)日:2017-10-25

    申请号:EP16726756.6

    申请日:2016-05-23

    申请人: Google Inc.

    IPC分类号: G10L15/06 G06F21/62 G10L15/28

    摘要: The present disclosure relates to training a speech recognition system. A system that includes an automated speech recognizer and receives data from a client device. The system determines that at least a portion of the received data is likely sensitive data. Before the at least a portion of the received data is deleted, the system provides the at least a portion of the received data to a model training engine that trains recognition models for the automated speech recognizer. After the at least a portion of the received data is provided, the system deletes the at least a portion of the received data.

    摘要翻译: 本公开涉及训练语音识别系统。 一个包含自动语音识别器并从客户端设备接收数据的系统。 系统确定接收到的数据的至少一部分可能是敏感数据。 在至少一部分接收到的数据被删除之前,系统将接收到的数据的至少一部分提供给训练自动语音识别器的识别模型的模型训练引擎。 在提供接收到的数据的至少一部分之后,系统删除接收到的数据的至少一部分。