DISCOVERING CAPABILITIES OF THIRD-PARTY VOICE-ENABLED RESOURCES
    2.
    发明申请
    DISCOVERING CAPABILITIES OF THIRD-PARTY VOICE-ENABLED RESOURCES 有权
    发现第三方语音启动资源的能力

    公开(公告)号:US20160189717A1

    公开(公告)日:2016-06-30

    申请号:US14586449

    申请日:2014-12-30

    CPC classification number: G10L17/22 G10L15/22 G10L15/26

    Abstract: Techniques are described for discovering capabilities of voice-enabled resources. A voice-controlled digital personal assistant can respond to user requests to list available voice-enabled resources that are capable of performing a specific task using voice input. The voice-controlled digital personal assistant can also respond to user requests to list the tasks that a particular voice-enabled resource can perform using voice input. The voice-controlled digital personal assistant can also support a practice mode in which users practice voice commands for performing tasks supported by voice-enabled resources.

    Abstract translation: 描述了用于发现语音资源的能力的技术。 语音控制的数字个人助理可以响应用户请求列出能够使用语音输入来执行特定任务的可用的支持语音的资源。 语音控制的数字个人助理还可以响应用户请求列出特定语音使能资源可以使用语音输入来执行的任务。 语音控制的数字助理还可以支持用户练习语音命令来执行支持语音的资源支持的任务的实践模式。

    User interface for generating expressive content

    公开(公告)号:US11321890B2

    公开(公告)日:2022-05-03

    申请号:US15347653

    申请日:2016-11-09

    Abstract: Generation of expressive content is provided. An expressive synthesized speech system provides improved voice authoring user interfaces by which a user is enabled to efficiently author content for generating expressive output. An expressive synthesized speech system provides an expressive keyboard for enabling input of textual content and for selecting expressive operators, such as emoji objects or punctuation objects for applying predetermined prosody attributes or visual effects to the textual content. A voicesetting editor mode enables the user to author and adjust particular prosody attributes associated with the content for composing carefully-crafted synthetic speech. An active listening mode (ALM) is provided, which when selected, a set of ALM effect options are displayed, wherein each option is associated with a particular sound effect and/or visual effect. The user is enabled to rapidly respond with expressive vocal sound effects or visual effects while listening to others speak.

Patent Agency Ranking