Coherent Pitch and Intensity Modification of Speech Signals

    公开(公告)号:US20170092286A1

    公开(公告)日:2017-03-30

    申请号:US15378107

    申请日:2016-12-14

    Inventor: Alexander Sorin

    Abstract: A method comprising: receiving an utterance, an original pitch contour of the utterance, and a target pitch contour for the utterance, wherein the utterance comprises a plurality of consecutive frames, and wherein at least one of said frames is a voiced frame; calculating an original intensity contour of said utterance; generating a pitch modified utterance based on the target pitch contour; calculating an intensity modification factor for each of said frames, based on said original pitch contour and on said target pitch contour, to produce a sequence of intensity modification factors corresponding to said plurality of consecutive frames; calculating a final intensity contour for said utterance by applying said intensity modification factors to said original intensity contour; and generating a coherently modified speech signal by time dependent scaling of the intensity of said pitch modified utterance according to said final intensity contour.

    Text to speech method and system using voice characteristic dependent weighting
    147.
    发明授权
    Text to speech method and system using voice characteristic dependent weighting 有权
    使用语音特征依赖加权的文本到语音方法和系统

    公开(公告)号:US09454963B2

    公开(公告)日:2016-09-27

    申请号:US13799962

    申请日:2013-03-13

    CPC classification number: G10L15/26 G10L13/033 G10L13/08 G10L2021/0135

    Abstract: A text-to-speech method for simulating a plurality of different voice characteristics includes dividing inputted text into a sequence of acoustic units; selecting voice characteristics for the inputted text; converting the sequence of acoustic units to a sequence of speech vectors using an acoustic model having a plurality of model parameters provided in clusters each having at least one sub-cluster and describing probability distributions which relate an acoustic unit to a speech vector; and outputting the sequence of speech vectors as audio with the selected voice characteristics. A parameter of a predetermined type of each probability distribution is expressed as a weighted sum of parameters of the same type using voice characteristic dependent weighting. In converting the sequence of acoustic units to a sequence of speech vectors, the voice characteristic dependent weights for the selected voice characteristics are retrieved for each cluster such that there is one weight per sub-cluster.

    Abstract translation: 用于模拟多个不同语音特征的文本到语音方法包括将输入的文本划分为声学单元序列; 选择输入文本的语音特征; 使用具有多个模型参数的声学模型将所述声学单元的序列转换为语音向量序列,所述多个模型参数在每个具有至少一个子簇的簇中描述,并描述将声学单元与语音向量相关联的概率分布; 并输出所述语音向量序列作为具有所选语音特征的音频。 每个概率分布的预定类型的参数被表示为使用语音特性依赖加权的相同类型的参数的加权和。 在将声音单元的序列转换为语音向量序列时,针对每个群集检索所选语音特征的语音特征依赖权重,使得每个子群集存在一个权重。

    Customizable system and device for defining voice dimensions and methods of use
    148.
    发明授权
    Customizable system and device for defining voice dimensions and methods of use 有权
    可定制的系统和设备,用于定义语音维度和使用方法

    公开(公告)号:US09401157B2

    公开(公告)日:2016-07-26

    申请号:US14279489

    申请日:2014-05-16

    Inventor: Richard Fink, IV

    Abstract: The disclosure provides a customizable system for modifying voice dimensions. The system comprises a program interface located on an electronic device. The program interface is used to manipulate user input from one or more individuals relating to voice parameters. Instructions are then created by the program interface that allow for one or more individuals to modify the voice dimensions of the one or more individuals by following the instructions.The disclosure further provides a method for modifying an individual's voice dimensions. The method comprises identifying one or more dimensions in an individual's vocal dimensions that are to be modified. On an electronic device, a voice exercise is created by selecting at least one parameter that modifies the one or more dimensions in an individual's voice. Instructions created by the electronic device that are based on the selection of at least one parameter are then followed by the individual.

    Abstract translation: 本公开提供了一种用于修改语音维度的可定制系统。 该系统包括位于电子设备上的程序接口。 程序界面用于操纵一个或多个个人与语音参数有关的用户输入。 然后由程序界面创建允许一个或多个个人通过遵循说明来修改一个或多个个人的语音维度的指令。 本公开还提供了一种用于修改个人的语音维度的方法。 该方法包括识别要修改的个人声部维度中的一个或多个维度。 在电子设备上,通过选择修改个人语音中的一个或多个维度的至少一个参数来创建语音练习。 由电子设备创建的基于至少一个参数的选择的指令随后由该个体追踪。

    VOICE SIGNAL MODULATION SERVICE FOR GEOGRAPHIC AREAS
    149.
    发明申请
    VOICE SIGNAL MODULATION SERVICE FOR GEOGRAPHIC AREAS 有权
    地理区域语音信号调制服务

    公开(公告)号:US20160019912A1

    公开(公告)日:2016-01-21

    申请号:US14332729

    申请日:2014-07-16

    Abstract: Modulating a voice signal is provided. The voice signal corresponding to a voice communication is received from a sending voice communication device via a network. Voice signal features corresponding to the voice communication are extracted. A set of voice signal filters are selected to modulate the extracted voice signal features corresponding to the voice communication to an average voice signal associated with a geographic area where the voice communication is destined for. The voice signal features corresponding to the voice communication are modulated by applying the selected set of voice signal filters to generate the average voice signal associated with the geographic area where the voice communication is destined for.

    Abstract translation: 提供调制语音信号。 通过网络从发送语音通信设备接收对应于语音通信的语音信号。 提取与语音通信对应的语音信号。 选择一组语音信号滤波器以将与语音通信相对应的提取的语音信号特征调制为与语音通信所针对的地理区域相关联的平均语音信号。 通过应用所选择的一组语音信号滤波器来调制对应于语音通信的语音信号,以产生与语音通信所针对的地理区域相关联的平均语音信号。

    Methods and Systems for Voice Conversion
    150.
    发明申请
    Methods and Systems for Voice Conversion 有权
    语音转换方法与系统

    公开(公告)号:US20160005403A1

    公开(公告)日:2016-01-07

    申请号:US14631464

    申请日:2015-02-25

    Applicant: Google Inc.

    CPC classification number: G10L15/07 G10L17/06 G10L25/75 G10L2021/0135

    Abstract: A device may receive data indicative of a plurality of speech sounds associated with first voice characteristics of a first voice. The device may receive an input indicative of speech associated with second voice characteristics of a second voice. The device may map at least one portion of the speech of the second voice to one or more speech sounds of the plurality of speech sounds of the first voice. The device may compare the first voice characteristics with the second voice characteristics based on the map. The comparison may include vocal tract characteristics, nasal cavity characteristics, and voicing characteristics. The device may determine a given representation configured to associate the first voice characteristics with the second voice characteristics. The device may provide an output indicative of pronunciations of the one or more speech sounds of the first voice according to the second voice characteristics based on the given representation.

    Abstract translation: 设备可以接收指示与第一语音的第一语音特征相关联的多个语音的数据。 设备可以接收指示与第二语音的第二语音特征相关联的语音的输入。 设备可以将第二语音的语音的至少一部分映射到第一语音的多个语音的一个或多个语音。 设备可以基于地图将第一语音特征与第二语音特征进行比较。 比较可以包括声道特征,鼻腔特征和发音特征。 设备可以确定被配置为将第一语音特征与第二语音特征相关联的给定表示。 该装置可以基于给定的表示,根据第二语音特征提供指示第一语音的一个或多个语音的发音的输出。

Patent Agency Ranking