Abstract:
Disclosed are apparatus and methods for generating synthesized utterances related to output of commands. A command is received at a computing device. A textual output for the command is determined using the computing device. A spoken output of the computing device is generated that utilizes a plurality of vocal characteristic sets. At least a portion of the spoken output corresponds to the textual output. At least a first part of the spoken output utilizes vocal characteristics of a first vocal characteristic set. At least a second part of the spoken output utilizes vocal characteristics of a second vocal characteristic set, where at least some of the vocal characteristics of the first vocal characteristic set differ from the vocal characteristics of the second vocal characteristic set.
Abstract:
Methods and systems for adaptation of synthetic speech in an environment are described. In an example, a device, which may include a text-to-speech (TTS) module, may be configured to determine characteristics of an environment of the device. The device also may be configured to determine, based on the one or more characteristics of the environment, speech parameters that characterize a voice output of the text-to-speech module. Further, the device may be configured to process a text to obtain the voice output corresponding to the text based on the speech parameters to account for the one or more characteristics of the environment.