-
公开(公告)号:US20100153114A1
公开(公告)日:2010-06-17
申请号:US12333325
申请日:2008-12-12
申请人: Sheng-Yao Shih , Yun-Chiang Kung , Chiwei Che , Chih-Chung Wang
发明人: Sheng-Yao Shih , Yun-Chiang Kung , Chiwei Che , Chih-Chung Wang
IPC分类号: G10L13/08
CPC分类号: G10L13/02 , G06F17/212 , G10L13/00
摘要: Architecture for playing a document converted into an audio format to a user of an audio-output capable device. The user can interact with the device to control play of the audio document such as pause, rewind, forward, etc. In more robust implementation, the audio-output capable device is a mobile device (e.g., cell phone) having a microphone for processing voice input. Voice commands can then be input to control play (“reading”) of the document audio file to pause, rewind, read paragraph, read next chapter, fast forward, etc. A communications server (e.g., email, attachments to email, etc.) transcodes text-based document content into an audio format by leveraging a text-to-speech (TTS) engine. The transcoded audio files are then transferred to mobile devices through viable transmission channels. Users can then play the audio-formatted document while freeing hand and eye usage for other tasks.
摘要翻译: 将音频格式转换成音频输出功能的设备的用户播放文档的架构。 用户可以与设备进行交互以控制音频文档的播放,例如暂停,倒带,转发等。在更稳健的实现中,具有音频输出功能的设备是具有用于处理的麦克风的移动设备(例如,蜂窝电话) 语音输入 然后可以输入语音命令来控制文档音频文件的播放(“读取”)以暂停,倒退,读取段落,阅读下一章节,快进等。通信服务器(例如,电子邮件,电子邮件附件等) )通过利用文本到语音(TTS)引擎将基于文本的文档内容转码为音频格式。 经转码的音频文件然后通过可行的传输通道传输到移动设备。 然后,用户可以播放音频格式的文档,同时释放手和眼睛的其他任务。