摘要:
Examples of methods and systems for providing speech recognition systems based on speech recordings logs are described. In some examples, a method may be performed by a computing device within a system to generate modified data logs to use as a training data set for an acoustic model for a particular language. A device may receive one or more data logs that comprise at least one or more recordings of spoken queries and transcribe the recordings. Based on comparisons, the device may identify any transcriptions that may be indicative of noise and may remove those transcriptions indicative of noise from the data logs. Further, the device may remove unwanted transcriptions from the data logs and the device may provide the modified data logs as a training data set to one or more acoustic models for particular languages.