SYSTEM AND METHOD FOR DISTRIBUTED VOICE MODELS ACROSS CLOUD AND DEVICE FOR EMBEDDED TEXT-TO-SPEECH
    3.
    发明申请
    SYSTEM AND METHOD FOR DISTRIBUTED VOICE MODELS ACROSS CLOUD AND DEVICE FOR EMBEDDED TEXT-TO-SPEECH 有权
    用于分布式语音模型的系统和方法用于嵌入式文本到语音的云和设备

    公开(公告)号:US20160086598A1

    公开(公告)日:2016-03-24

    申请号:US14953771

    申请日:2015-11-30

    CPC classification number: G10L13/04 G10L13/047 G10L13/07

    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify a speech synthesis context, and determine, based on a local cache of text-to-speech units for a text-to-speech voice and based on the speech synthesis context, additional text-to-speech units which are not in the local cache. The system can request from a server the additional text-to-speech units, and store the additional text-to-speech units in the local cache. The system can then synthesize speech using the text-to-speech units and the additional text-to-speech units in the local cache. The system can prune the cache as the context changes, based on availability of local storage, or after synthesizing the speech. The local cache can store a core set of text-to-speech units associated with the text-to-speech voice that cannot be pruned from the local cache.

    Abstract translation: 用于智能缓存用于语音合成的级联语音单元的系统,方法和计算机可读存储介质。 配置为实施该方法的系统可以识别语音合成上下文,并且基于用于文本到语音语音的文本到语音单元的本地高速缓存并且基于语音合成上下文来确定附加的文本 - 不在本地缓存中的语音单元。 系统可以从服务器请求附加的文本到语音单元,并将附加的文本到语音单元存储在本地高速缓存中。 然后,系统可以使用本地高速缓存中的文本到语音单元和附加的文本到语音单元来合成语音。 系统可以根据本地存储的可用性,或合成语音之后随着上下文的变化修剪缓存。 本地缓存可以存储与文本到语音语音相关联的文本到语音单元的核心集合,其不能从本地高速缓存中修剪。

    VOICE-ENABLED DIALOG INTERACTION WITH WEB PAGES
    5.
    发明申请
    VOICE-ENABLED DIALOG INTERACTION WITH WEB PAGES 有权
    语音通话对话与网页的交互

    公开(公告)号:US20150149168A1

    公开(公告)日:2015-05-28

    申请号:US14092033

    申请日:2013-11-27

    Abstract: Voice enabled dialog with web pages is provided. An Internet address of a web page is received including an area with which a user of a client device can specify information. The web page is loaded using the received Internet address of the web page. A task structure of the web page is then extracted. An abstract representation of the web is then generated. A dialog script, based on the abstract representation of the web page is then provided. Spoken information received from the user is converted into text and the converted text is inserted into the area.

    Abstract translation: 提供带有网页功能的语音启用对话框。 接收到网页的互联网地址,包括客户端设备的用户可以指定信息的区域。 使用接收到的网页的Internet地址来加载网页。 然后提取网页的任务结构。 然后生成Web的抽象表示。 然后提供基于网页的抽象表示的对话框脚本。 从用户接收的口语信息被转换为文本,并将转换后的文本插入该区域。

Patent Agency Ranking