Dynamic audio ducking
    1.
    发明授权
    Dynamic audio ducking 有权
    动态音频下载

    公开(公告)号:US08428758B2

    公开(公告)日:2013-04-23

    申请号:US12371861

    申请日:2009-02-16

    CPC分类号: G10L21/00

    摘要: Various dynamic audio ducking techniques are provided that may be applied where multiple audio streams, such as a primary audio stream and a secondary audio stream, are being played back simultaneously. For example, a secondary audio stream may include a voice announcement of one or more pieces of information pertaining to the primary audio stream, such as the name of the track or the name of the artist. In one embodiment, the primary audio data and the voice feedback data are initially analyzed to determine a loudness value. Based on their respective loudness values, the primary audio stream may be ducked during the period of simultaneous playback such that a relative loudness difference is generally maintained with respect to the loudness of the primary and secondary audio streams. Accordingly, the amount of ducking applied may be customized for each piece of audio data depending on its loudness characteristics.

    摘要翻译: 提供了可以在多个音频流(例如主音频流和次要音频流)正在同时播放的情况下应用的各种动态音频排除技术。 例如,辅助音频流可以包括与主音频流有关的一条或多条信息的语音通知,例如轨道的名称或艺术家的名字。 在一个实施例中,初始分析主音频数据和语音反馈数据以确定响度值。 基于它们各自的响度值,主音频流可能在同时重放的时段期间被淹没,使得相对于主音频流和辅助音频流的响度通常保持相对响度差异。 因此,可以根据其响度特性为每个音频数据定制应用的下降量。

    DYNAMIC AUDIO DUCKING
    2.
    发明申请
    DYNAMIC AUDIO DUCKING 有权
    动态音频播放

    公开(公告)号:US20100211199A1

    公开(公告)日:2010-08-19

    申请号:US12371861

    申请日:2009-02-16

    IPC分类号: G06F17/00

    CPC分类号: G10L21/00

    摘要: Various dynamic audio ducking techniques are provided that may be applied where multiple audio streams, such as a primary audio stream and a secondary audio stream, are being played back simultaneously. For example, a secondary audio stream may include a voice announcement of one or more pieces of information pertaining to the primary audio stream, such as the name of the track or the name of the artist. In one embodiment, the primary audio data and the voice feedback data are initially analyzed to determine a loudness value. Based on their respective loudness values, the primary audio stream may be ducked during the period of simultaneous playback such that a relative loudness difference is generally maintained with respect to the loudness of the primary and secondary audio streams. Accordingly, the amount of ducking applied may be customized for each piece of audio data depending on its loudness characteristics.

    摘要翻译: 提供了可以在多个音频流(例如主音频流和次要音频流)正在同时播放的情况下应用的各种动态音频排除技术。 例如,辅助音频流可以包括与主音频流有关的一条或多条信息的语音通知,例如轨道的名称或艺术家的名字。 在一个实施例中,初始分析主音频数据和语音反馈数据以确定响度值。 基于它们各自的响度值,主音频流可能在同时重放的时段期间被淹没,使得相对于主音频流和辅助音频流的响度通常保持相对响度差异。 因此,可以根据其响度特性为每个音频数据定制应用的下降量。

    Methods for controlling the generation of speech from text representing
one or more names
    3.
    发明授权
    Methods for controlling the generation of speech from text representing one or more names 失效
    用于控制从表示一个或多个名称的文本生成语音的方法

    公开(公告)号:US5832435A

    公开(公告)日:1998-11-03

    申请号:US790578

    申请日:1997-01-29

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

    摘要翻译: 公开了从文本改进的人类可听见语音的自动综合。 通过对合成材料的韵律处理,改进的说话率处理以及系统用户的拼写单词或术语的改进方法,可以获得潜在的文本可理解性的提高。 在优选实施例中实现了适用于大量文本段中的话语的文本序列的韵律整形,其中开发了用于指示文本分组内的概念单元的韵律边界。

    Adaptive methods for controlling the annunciation rate of synthesized
speech
    4.
    发明授权
    Adaptive methods for controlling the annunciation rate of synthesized speech 失效
    用于控制合成语音通告速率的自适应方法

    公开(公告)号:US5749071A

    公开(公告)日:1998-05-05

    申请号:US790580

    申请日:1997-01-29

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the sysstem user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

    摘要翻译: 公开了从文本改进的人类可听见语音的自动综合。 通过对合成材料的韵律处理,改进的说话率处理以及针对sysstem用户的改进拼写单词或术语的方法,可以获得潜在的文本可理解性的提高。 在优选实施例中实现了适用于大量文本段中的话语的文本序列的韵律整形,其中开发了用于指示文本分组内的概念单元的韵律边界。

    Automated voice synthesis employing enhanced prosodic treatment of text,
spelling of text and rate of annunciation
    5.
    发明授权
    Automated voice synthesis employing enhanced prosodic treatment of text, spelling of text and rate of annunciation 失效
    自动语音合成采用增强的韵律处理文本,文字拼写和通告率

    公开(公告)号:US5652828A

    公开(公告)日:1997-07-29

    申请号:US641480

    申请日:1996-03-01

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the sysstem user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

    摘要翻译: 公开了从文本改进的人类可听见语音的自动综合。 通过对合成材料的韵律处理,改进的说话率处理以及针对sysstem用户的改进拼写单词或术语的方法,可以获得潜在的文本可理解性的提高。 在优选实施例中实现了适用于大量文本段中的话语的文本序列的韵律整形,其中开发了用于指示文本分组内的概念单元的韵律边界。

    Media presentation with supplementary media
    6.
    发明授权
    Media presentation with supplementary media 有权
    媒体介绍与补充媒体

    公开(公告)号:US08046689B2

    公开(公告)日:2011-10-25

    申请号:US11369480

    申请日:2006-03-06

    IPC分类号: G06F3/16

    摘要: Improved techniques for providing supplementary media for media items are disclosed. The media items are typically fixed media items. The supplementary media is one or more of audio, video, image, or text that is provided by a user to supplement (e.g., personalize, customize, annotate, etc.) the fixed media items. In one embodiment, the supplementary media can be provided by user interaction with an on-line media store where media items can be browsed, searched, purchased and/or acquired via a computer network. In another embodiment, the supplementary media can be generated on a playback device.

    摘要翻译: 公开了用于为媒体项目提供辅助媒体的改进技术。 媒体项目通常是固定媒体项目。 辅助媒体是由用户提供以补充(例如,个性化,定制,注释等)固定媒体项目的音频,视频,图像或文本中的一个或多个。 在一个实施例中,可以通过与在线媒体商店的用户交互来提供补充媒体,其中可以经由计算机网络浏览,搜索,购买和/或获取媒体项目。 在另一个实施例中,补充媒体可以在播放设备上产生。

    Automated voice synthesis from text having a restricted known
informational content
    7.
    发明授权
    Automated voice synthesis from text having a restricted known informational content 失效
    具有有限的已知信息内容的文本的自动语音合成

    公开(公告)号:US5890117A

    公开(公告)日:1999-03-30

    申请号:US818705

    申请日:1997-03-14

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

    摘要翻译: 公开了从文本改进的人类可听见语音的自动综合。 通过对合成材料的韵律处理,改进的说话率处理以及系统用户的拼写单词或术语的改进方法,可以获得潜在的文本可理解性的提高。 在优选实施例中实现了适用于大量文本段中的话语的文本序列的韵律整形,其中开发了用于指示文本分组内的概念单元的韵律边界。

    Method for synthesizing speech from text and for spelling all or
portions of the text by analogy
    8.
    发明授权
    Method for synthesizing speech from text and for spelling all or portions of the text by analogy 失效
    用于从文本合成语音并通过类比拼写全部或部分文本的方法

    公开(公告)号:US5751906A

    公开(公告)日:1998-05-12

    申请号:US790579

    申请日:1997-01-29

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the sysstem user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

    摘要翻译: 公开了从文本改进的人类可听见语音的自动综合。 通过对合成材料的韵律处理,改进的说话率处理以及针对sysstem用户的改进拼写单词或术语的方法,可以获得潜在的文本可理解性的提高。 在优选实施例中实现了适用于大量文本段中的话语的文本序列的韵律整形,其中开发了用于指示文本分组内的概念单元的韵律边界。

    Methods for controlling the generation of speech from text representing
names and addresses
    9.
    发明授权
    Methods for controlling the generation of speech from text representing names and addresses 失效
    用于从表示名称和地址的文本控制语音生成的方法

    公开(公告)号:US5732395A

    公开(公告)日:1998-03-24

    申请号:US790581

    申请日:1997-01-29

    CPC分类号: G10L13/10 G10L13/04 G10L13/08

    摘要: Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

    摘要翻译: 公开了从文本改进的人类可听见语音的自动综合。 通过对合成材料的韵律处理,改进的说话率处理以及系统用户的拼写单词或术语的改进方法,可以获得潜在的文本可理解性的提高。 在优选实施例中实现了适用于大量文本段中的话语的文本序列的韵律整形,其中开发了用于指示文本分组内的概念单元的韵律边界。