SYSTEMS AND METHODS FOR CONCATENATION OF WORDS IN TEXT TO SPEECH SYNTHESIS
    21.
    发明申请
    SYSTEMS AND METHODS FOR CONCATENATION OF WORDS IN TEXT TO SPEECH SYNTHESIS 有权
    用于语音合成的系统和方法

    公开(公告)号:US20100082347A1

    公开(公告)日:2010-04-01

    申请号:US12240433

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR SPEECH PREPROCESSING IN TEXT TO SPEECH SYNTHESIS
    22.
    发明申请
    SYSTEMS AND METHODS FOR SPEECH PREPROCESSING IN TEXT TO SPEECH SYNTHESIS 审中-公开
    用于语音预处理的语音和语音合成的系统和方法

    公开(公告)号:US20100082328A1

    公开(公告)日:2010-04-01

    申请号:US12240397

    申请日:2008-09-29

    IPC分类号: G06F17/20 G10L13/08

    CPC分类号: G10L13/08 G06F17/275

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。