System and method for cloud-based text-to-speech web services

    公开(公告)号:US09412359B2

    公开(公告)日:2016-08-09

    申请号:US14684893

    申请日:2015-04-13

    CPC classification number: G10L13/04 G10L13/00 G10L13/043

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating speech. One variation of the method is from a server side, and another variation of the method is from a client side. The server side method, as implemented by a network-based automatic speech processing system, includes first receiving, from a network client independent of knowledge of internal operations of the system, a request to generate a text-to-speech voice. The request can include speech samples, transcriptions of the speech samples, and metadata describing the speech samples. The system extracts sound units from the speech samples based on the transcriptions and generates an interactive demonstration of the text-to-speech voice based on the sound units, the transcriptions, and the metadata, wherein the interactive demonstration hides a back end processing implementation from the network client. The system provides access to the interactive demonstration to the network client.

    Automated detection and filtering of audio advertisements
    13.
    发明授权
    Automated detection and filtering of audio advertisements 有权
    音频广告的自动检测和过滤

    公开(公告)号:US09183177B2

    公开(公告)日:2015-11-10

    申请号:US13867264

    申请日:2013-04-22

    Abstract: Methods, apparatuses, and media for filtering a data stream are provided. The data stream is partitioned into a plurality of data stream segments. An acoustic parameter of each of the data stream segments is measured, and it is determined whether the acoustic parameter of each of the data stream segments satisfies a predetermined condition. Extraneous segments of the data stream segments are identified in which the predetermined condition is satisfied, and it is determined whether the extraneous segments have a predetermined relationship in the data stream. The extraneous segments are deleted from the data stream to produce a filtered data stream in response to the extraneous segments having the predetermined relationship.

    Abstract translation: 提供了用于过滤数据流的方法,设备和介质。 数据流被划分成多个数据流段。 测量每个数据流段的声学参数,并且确定每个数据流段的声学参数是否满足预定条件。 识别数据流段的外部段,其中满足预定条件,并且确定外部段在数据流中是否具有预定关系。 响应于具有预定关系的外部段,从数据流中删除无关段以产生经滤波的数据流。

Patent Agency Ranking