METHODS AND SYSTEMS FOR PROVIDING SPEECH RECOGNITION SYSTEMS BASED ON SPEECH RECORDINGS LOGS
    1.
    发明公开
    METHODS AND SYSTEMS FOR PROVIDING SPEECH RECOGNITION SYSTEMS BASED ON SPEECH RECORDINGS LOGS 审中-公开
    方法和系统提供语音识别系统基于语音记录日志

    公开(公告)号:EP2941768A1

    公开(公告)日:2015-11-11

    申请号:EP13826817.2

    申请日:2013-12-20

    申请人: Google Inc.

    IPC分类号: G10L15/065

    摘要: Examples of methods and systems for providing speech recognition systems based on speech recordings logs are described. In some examples, a method may be performed by a computing device within a system to generate modified data logs to use as a training data set for an acoustic model for a particular language. A device may receive one or more data logs that comprise at least one or more recordings of spoken queries and transcribe the recordings. Based on comparisons, the device may identify any transcriptions that may be indicative of noise and may remove those transcriptions indicative of noise from the data logs. Further, the device may remove unwanted transcriptions from the data logs and the device may provide the modified data logs as a training data set to one or more acoustic models for particular languages.

    摘要翻译: 的方法和系统,用于提供基于语音的录音记录的语音识别系统的实例进行描述。 在一些实例中,一种方法可以由计算设备的系统内执行,以生成修改后的数据记录到在声学模型作为锻炼数据集使用用于特定语言。 设备可以接收一个或多个数据记录中做了口语查询至少包含一种或多种录音和转录录音。 基于比较,设备可以识别任何转录确实可以指示噪声的并且可以去除这些转录指示从所述数据记录的噪声。 此外,设备可以从数据日志删除不想要的转录和该装置可以作为一个锻炼数据设置为特定语言的一个或多个声学模型提供修改后的数据的日志。