SYSTEMS AND METHODS OF DETECTING LANGUAGE AND NATURAL LANGUAGE STRINGS FOR TEXT TO SPEECH SYNTHESIS
    1.
    发明申请
    SYSTEMS AND METHODS OF DETECTING LANGUAGE AND NATURAL LANGUAGE STRINGS FOR TEXT TO SPEECH SYNTHESIS 有权
    用于语言合成的语言和自然语言行的检测系统和方法

    公开(公告)号:US20100082329A1

    公开(公告)日:2010-04-01

    申请号:US12240420

    申请日:2008-09-29

    IPC分类号: G06F17/20

    CPC分类号: G10L15/005 G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    Systems and methods of detecting language and natural language strings for text to speech synthesis
    2.
    发明授权
    Systems and methods of detecting language and natural language strings for text to speech synthesis 有权
    检测语言和自然语言字符串的文本到语音合成的系统和方法

    公开(公告)号:US08583418B2

    公开(公告)日:2013-11-12

    申请号:US12240420

    申请日:2008-09-29

    IPC分类号: G06F17/27 G06F17/20

    CPC分类号: G10L15/005 G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR MAPPING PHONEMES FOR TEXT TO SPEECH SYNTHESIS
    3.
    发明申请
    SYSTEMS AND METHODS FOR MAPPING PHONEMES FOR TEXT TO SPEECH SYNTHESIS 审中-公开
    用于将文本映射到语音合成的系统和方法

    公开(公告)号:US20100082327A1

    公开(公告)日:2010-04-01

    申请号:US12240410

    申请日:2008-09-29

    IPC分类号: G06F17/28

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    Systems and methods for text normalization for text to speech synthesis
    4.
    发明授权
    Systems and methods for text normalization for text to speech synthesis 有权
    用于文本到语音合成的文本归一化的系统和方法

    公开(公告)号:US08355919B2

    公开(公告)日:2013-01-15

    申请号:US12240449

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    Systems and methods for text to speech synthesis
    5.
    发明授权
    Systems and methods for text to speech synthesis 有权
    文本到语音合成的系统和方法

    公开(公告)号:US08352272B2

    公开(公告)日:2013-01-08

    申请号:US12240404

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/00

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR TEXT TO SPEECH SYNTHESIS
    6.
    发明申请
    SYSTEMS AND METHODS FOR TEXT TO SPEECH SYNTHESIS 有权
    用于语音合成的系统和方法

    公开(公告)号:US20100082346A1

    公开(公告)日:2010-04-01

    申请号:US12240404

    申请日:2008-09-29

    IPC分类号: G10L13/08 G10L13/00 G10L21/00

    CPC分类号: G10L13/00

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR TEXT NORMALIZATION FOR TEXT TO SPEECH SYNTHESIS
    7.
    发明申请
    SYSTEMS AND METHODS FOR TEXT NORMALIZATION FOR TEXT TO SPEECH SYNTHESIS 有权
    用于文本语音合成的文本正则化的系统和方法

    公开(公告)号:US20100082348A1

    公开(公告)日:2010-04-01

    申请号:US12240449

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR SPEECH PREPROCESSING IN TEXT TO SPEECH SYNTHESIS
    8.
    发明申请
    SYSTEMS AND METHODS FOR SPEECH PREPROCESSING IN TEXT TO SPEECH SYNTHESIS 审中-公开
    用于语音预处理的语音和语音合成的系统和方法

    公开(公告)号:US20100082328A1

    公开(公告)日:2010-04-01

    申请号:US12240397

    申请日:2008-09-29

    IPC分类号: G06F17/20 G10L13/08

    CPC分类号: G10L13/08 G06F17/275

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。