Systems and methods for text normalization for text to speech synthesis
    11.
    发明授权
    Systems and methods for text normalization for text to speech synthesis 有权
    用于文本到语音合成的文本归一化的系统和方法

    公开(公告)号:US08355919B2

    公开(公告)日:2013-01-15

    申请号:US12240449

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR SELECTIVE RATE OF SPEECH AND SPEECH PREFERENCES FOR TEXT TO SPEECH SYNTHESIS
    12.
    发明申请
    SYSTEMS AND METHODS FOR SELECTIVE RATE OF SPEECH AND SPEECH PREFERENCES FOR TEXT TO SPEECH SYNTHESIS 有权
    用于语音合成的语音和语音优先选择率的系统和方法

    公开(公告)号:US20100082344A1

    公开(公告)日:2010-04-01

    申请号:US12240437

    申请日:2008-09-29

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    Media presentation with supplementary media
    13.
    发明申请
    Media presentation with supplementary media 有权
    媒体介绍与补充媒体

    公开(公告)号:US20060168150A1

    公开(公告)日:2006-07-27

    申请号:US11369480

    申请日:2006-03-06

    IPC分类号: G06F15/16

    摘要: Improved techniques for providing supplementary media for media items are disclosed. The media items are typically fixed media items. The supplementary media is one or more of audio, video, image, or text that is provided by a user to supplement (e.g., personalize, customize, annotate, etc.) the fixed media items. In one embodiment, the supplementary media can be provided by user interaction with an on-line media store where media items can be browsed, searched, purchased and/or acquired via a computer network. In another embodiment, the supplementary media can be generated on a playback device.

    摘要翻译: 公开了用于为媒体项目提供辅助媒体的改进技术。 媒体项目通常是固定的媒体项目。 辅助媒体是由用户提供以补充(例如,个性化,定制,注释等)固定媒体项目的音频,视频,图像或文本中的一个或多个。 在一个实施例中,可以通过与在线媒体商店的用户交互来提供补充媒体,其中可以经由计算机网络浏览,搜索,购买和/或获取媒体项目。 在另一个实施例中,补充媒体可以在播放设备上产生。

    Systems and methods for text to speech synthesis
    15.
    发明授权
    Systems and methods for text to speech synthesis 有权
    文本到语音合成的系统和方法

    公开(公告)号:US08352272B2

    公开(公告)日:2013-01-08

    申请号:US12240404

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/00

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    Method and apparatus for filtering email
    16.
    发明申请
    Method and apparatus for filtering email 有权
    用于过滤电子邮件的方法和设备

    公开(公告)号:US20070106742A1

    公开(公告)日:2007-05-10

    申请号:US11643304

    申请日:2006-12-20

    IPC分类号: G06F15/16

    CPC分类号: G06Q10/107 H04L51/12

    摘要: A method and apparatus for filtering messages comprising determining a first semantic anchor corresponding to a first group of messages, for example, legitimate messages and a second semantic anchor corresponding to a second group of messages, for example, unsolicited messages. Determining a vector corresponding to an incoming message; comparing the vector corresponding to the incoming message with at least one of the first semantic anchor and the second semantic anchor to obtain a first comparison value and a second comparison value; and filtering the incoming message based on the first comparison value and the second comparison value.

    摘要翻译: 一种用于过滤消息的方法和装置,包括确定对应于第一组消息的第一语义锚,例如合法消息,以及对应于第二组消息的第二语义锚,例如非请求消息。 确定对应于传入消息的向量; 将与所述输入消息对应的向量与所述第一语义锚和所述第二语义锚中的至少一个进行比较,以获得第一比较值和第二比较值; 以及基于所述第一比较值和所述第二比较值对所述传入消息进行过滤。

    Systems and methods of detecting language and natural language strings for text to speech synthesis
    17.
    发明授权
    Systems and methods of detecting language and natural language strings for text to speech synthesis 有权
    检测语言和自然语言字符串的文本到语音合成的系统和方法

    公开(公告)号:US08583418B2

    公开(公告)日:2013-11-12

    申请号:US12240420

    申请日:2008-09-29

    IPC分类号: G06F17/27 G06F17/20

    CPC分类号: G10L15/005 G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR TEXT NORMALIZATION FOR TEXT TO SPEECH SYNTHESIS
    18.
    发明申请
    SYSTEMS AND METHODS FOR TEXT NORMALIZATION FOR TEXT TO SPEECH SYNTHESIS 有权
    用于文本语音合成的文本正则化的系统和方法

    公开(公告)号:US20100082348A1

    公开(公告)日:2010-04-01

    申请号:US12240449

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR CONCATENATION OF WORDS IN TEXT TO SPEECH SYNTHESIS
    19.
    发明申请
    SYSTEMS AND METHODS FOR CONCATENATION OF WORDS IN TEXT TO SPEECH SYNTHESIS 有权
    用于语音合成的系统和方法

    公开(公告)号:US20100082347A1

    公开(公告)日:2010-04-01

    申请号:US12240433

    申请日:2008-09-29

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。

    SYSTEMS AND METHODS FOR SPEECH PREPROCESSING IN TEXT TO SPEECH SYNTHESIS
    20.
    发明申请
    SYSTEMS AND METHODS FOR SPEECH PREPROCESSING IN TEXT TO SPEECH SYNTHESIS 审中-公开
    用于语音预处理的语音和语音合成的系统和方法

    公开(公告)号:US20100082328A1

    公开(公告)日:2010-04-01

    申请号:US12240397

    申请日:2008-09-29

    IPC分类号: G06F17/20 G10L13/08

    CPC分类号: G10L13/08 G06F17/275

    摘要: Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.

    摘要翻译: 提供了用于合成用于识别媒体资产的语音的算法。 可以从与媒体资产相关联的文本串选择性地合成语音。 文本字符串可以被归一化,并且其母语被确定用于获得目标音素,以便以用户熟悉的语言(例如,方言或重音)提供人声音语音。 算法可以在包括几个专用渲染引擎的系统上实现。 该系统可以是耦合到前端的后端的一部分,包括用于媒体资产和相关联的合成语音的存储器,以及用于接收和处理导致提供合成语音的请求的请求处理器。 前端可以通过网络将媒体资产和相关联的合成语音内容通信到主机耦合到其上播放媒体资产和合成语音的便携式电子设备的设备。