DEEP NETWORKS FOR UNIT SELECTION SPEECH SYNTHESIS
    11.
    发明申请
    DEEP NETWORKS FOR UNIT SELECTION SPEECH SYNTHESIS 有权
    DEEP网络选择语音合成

    公开(公告)号:US20150073804A1

    公开(公告)日:2015-03-12

    申请号:US14019967

    申请日:2013-09-06

    Applicant: Google Inc.

    CPC classification number: G10L13/06 G10L25/30

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing a representation based on structured data in resources. The methods, systems, and apparatus include actions of receiving target acoustic features output from a neural network that has been trained to predict acoustic features given linguistic features. Additional actions include determining a distance between the target acoustic features and acoustic features of a stored acoustic sample. Further actions include selecting the acoustic sample to be used in speech synthesis based at least on the determined distance and synthesizing speech based on the selected acoustic sample.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于基于资源中的结构化数据提供表示。 方法,系统和装置包括接收从神经网络输出的目标声学特征的动作,所述神经网络已被训练以预测具有语言特征的声学特征。 附加动作包括确定目标声学特征与存储的声学样本的声学特征之间的距离。 进一步的动作包括至少基于所确定的距离来选择要在语音合成中使用的声学样本,并且基于所选择的声学样本来合成语音。

    Multilingual prosody generation
    12.
    发明授权

    公开(公告)号:US09905220B2

    公开(公告)日:2018-02-27

    申请号:US14942300

    申请日:2015-11-16

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for multilingual prosody generation. In some implementations, data indicating a set of linguistic features corresponding to a text is obtained. Data indicating the linguistic features and data indicating the language of the text are provided as input to a neural network that has been trained to provide output indicating prosody information for multiple languages. The neural network can be a neural network having been trained using speech in multiple languages. Output indicating prosody information for the linguistic features is received from the neural network. Audio data representing the text is generated using the output of the neural network.

    Methods and systems for automated generation of nativized multi-lingual lexicons
    13.
    发明授权
    Methods and systems for automated generation of nativized multi-lingual lexicons 有权
    自动生成本土化多语言词典的方法和系统

    公开(公告)号:US09263028B2

    公开(公告)日:2016-02-16

    申请号:US14283586

    申请日:2014-05-21

    Applicant: Google Inc.

    Abstract: An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. The computing device may associate the linguistic content in the first language with one or more phonemes from a second language. The computing device may also determine a phonemic representation of the linguistic content in the first language based on use of the one or more phonemes from the second language. The phonemic representation may be indicative of a pronunciation of the linguistic content in the first language according to speech sounds of the second language.

    Abstract translation: 包括第一语言的语言内容的输入信号可以被计算设备接收。 语言内容可能包括文字或言语。 计算设备可将第一语言中的语言内容与来自第二语言的一个或多个音素相关联。 计算设备还可以基于来自第二语言的一个或多个音素的使用来确定第一语言中的语言内容的音位表示。 根据第二语言的语音,音素表示可以指示第一语言中的语言内容的发音。

    Methods and systems for sharing of adapted voice profiles
    14.
    发明授权
    Methods and systems for sharing of adapted voice profiles 有权
    用于共享适应语音配置文件的方法和系统

    公开(公告)号:US09117451B2

    公开(公告)日:2015-08-25

    申请号:US13872401

    申请日:2013-04-29

    Applicant: Google Inc.

    Abstract: Methods and systems for sharing of adapted voice profiles are provided. The method may comprise receiving, at a computing system, one or more speech samples, and the one or more speech samples may include a plurality of spoken utterances. The method may further comprise determining, at the computing system, a voice profile associated with a speaker of the plurality of spoken utterances, and including an adapted voice of the speaker. Still further, the method may comprise receiving, at the computing system, an authorization profile associated with the determined voice profile, and the authorization profile may include one or more user identifiers associated with one or more respective users. Yet still further, the method may comprise the computing system providing the voice profile to at least one computing device associated with the one or more respective users, based at least in part on the authorization profile.

    Abstract translation: 提供了用于共享适应语音简档的方法和系统。 该方法可以包括在计算系统处接收一个或多个语音样本,并且所述一个或多个语音样本可以包括多个讲话语音。 该方法还可以包括在计算系统处确定与多个讲话话语中的说话者相关联的语音简档,并且包括说话者的适配语音。 此外,该方法可以包括在计算系统处接收与所确定的语音简档相关联的授权简档,并且授权简档可以包括与一个或多个相应用户相关联的一个或多个用户标识符。 此外,该方法可以包括至少部分地基于授权简档而将语音简档提供给与一个或多个相应用户相关联的至少一个计算设备的计算系统。

    Creation of spoken news programs
    15.
    发明授权
    Creation of spoken news programs 有权
    创造口语新闻节目

    公开(公告)号:US09111534B1

    公开(公告)日:2015-08-18

    申请号:US13830703

    申请日:2013-03-14

    Applicant: Google Inc.

    CPC classification number: G10L13/00 G06Q30/0251

    Abstract: Implementations related to system and techniques for providing audio news reports are discussed. A computer-implemented method includes identifying, with a computer system, one or more news preferences for a first user, selecting a plurality of news stories, wherein particular ones of the new stories are determined to be responsive to the news preferences for the first user and comprise audio versions of stories converted automatically from textual news stories, assembling, with the computer system and for the first user, an audio news report that includes the audio versions of the selected news stories, and delivering, to a computing device, the assembled audio news report.

    Abstract translation: 讨论了与提供音频新闻报道的系统和技术相关的实现。 计算机实现的方法包括用计算机系统识别用于第一用户的一个或多个新闻偏好,选择多个新闻故事,其中确定特定新故事以响应于第一用户的新闻偏好 并且包括从文本新闻故事自动转换的故事的音频版本,与计算机系统和第一用户组装包括所选新闻的音频版本的音频新闻报告,以及将计算设备传送到组装 音频新闻报道。

    Methods and Systems for Sharing of Adapted Voice Profiles
    16.
    发明申请
    Methods and Systems for Sharing of Adapted Voice Profiles 有权
    用于分享适应语音配置文件的方法和系统

    公开(公告)号:US20140236598A1

    公开(公告)日:2014-08-21

    申请号:US13872401

    申请日:2013-04-29

    Applicant: Google Inc.

    Abstract: Methods and systems for sharing of adapted voice profiles are provided. The method may comprise receiving, at a computing system, one or more speech samples, and the one or more speech samples may include a plurality of spoken utterances. The method may further comprise determining, at the computing system, a voice profile associated with a speaker of the plurality of spoken utterances, and including an adapted voice of the speaker. Still further, the method may comprise receiving, at the computing system, an authorization profile associated with the determined voice profile, and the authorization profile may include one or more user identifiers associated with one or more respective users. Yet still further, the method may comprise the computing system providing the voice profile to at least one computing device associated with the one or more respective users, based at least in part on the authorization profile.

    Abstract translation: 提供了用于共享适应语音简档的方法和系统。 该方法可以包括在计算系统处接收一个或多个语音样本,并且所述一个或多个语音样本可以包括多个讲话语音。 该方法还可以包括在计算系统处确定与多个讲话话语中的说话者相关联的语音简档,并且包括说话者的适配语音。 此外,该方法可以包括在计算系统处接收与所确定的语音简档相关联的授权简档,并且授权简档可以包括与一个或多个相应用户相关联的一个或多个用户标识符。 此外,该方法可以包括至少部分地基于授权简档而将语音简档提供给与一个或多个相应用户相关联的至少一个计算设备的计算系统。

    Devices and methods for speech unit reduction in text-to-speech synthesis systems
    17.
    发明授权
    Devices and methods for speech unit reduction in text-to-speech synthesis systems 有权
    文本到语音合成系统中语音单元缩减的设备和方法

    公开(公告)号:US08751236B1

    公开(公告)日:2014-06-10

    申请号:US14061118

    申请日:2013-10-23

    Applicant: Google Inc.

    CPC classification number: G10L13/06

    Abstract: A device may receive a plurality of speech sounds that are indicative of pronunciations of a first linguistic term. The device may determine concatenation features of the plurality of speech sounds. The concatenation features may be indicative of an acoustic transition between a first speech sound and a second speech sound when the first speech sound and the second speech sound are concatenated. The first speech sound may be included in the plurality of speech sounds and the second speech sound may be indicative of a pronunciation of a second linguistic term. The device may cluster the plurality of speech sounds into one or more clusters based on the concatenation features. The device may provide a representative speech sound of the given cluster as the first speech sound when the first speech sound and the second speech sound are concatenated.

    Abstract translation: 设备可以接收指示第一语言术语的发音的多个语音。 设备可以确定多个语音的连接特征。 当第一语音和第二语音被级联时,级联特征可以指示第一语音和第二语音之间的声转换。 第一语音可以被包括在多个语音中,第二语音可以指示第二语言术语的发音。 该装置可以基于级联特征将多个语音进行聚类成一个或多个簇。 当第一语音和第二语音被级联时,该设备可以提供给定簇的代表性语音作为第一语音。

Patent Agency Ranking