TECHNOLOGIES FOR AUTOMATIC SPEECH RECOGNITION USING ARTICULATORY PARAMETERS

    公开(公告)号:US20170278517A1

    公开(公告)日:2017-09-28

    申请号:US15080687

    申请日:2016-03-25

    Abstract: Technologies for automatic speech recognition using articulatory parameters are disclosed. An automatic speech recognition device may capture speech data from a speaker and also capture an image of the speaker. The automatic speech recognition device may determine one or more articulatory parameters based on the image, such as such as a jaw angle, a lip protrusion, or a lip height, and compare those parameters with articulatory parameters of training users. After selecting training users with similar articulatory parameters as the training speaker, the automatic speech recognition device may select training data associated with the selected training speakers, including parameters to use for an automatic speech recognition algorithm. By using the parameters already optimized for training users with similar articulatory parameters as the speaker, the automatic speech recognition device may quickly adapt an automatic speech recognition algorithm to the speaker.

    METHOD AND APPARATUS TO SYNTHESIZE VOICE BASED ON FACIAL STRUCTURES

    公开(公告)号:US20180322862A1

    公开(公告)日:2018-11-08

    申请号:US16039053

    申请日:2018-07-18

    CPC classification number: G10L13/027 G06K9/00315 G10L13/033 G10L13/047

    Abstract: A method for establishing an articulatory speech synthesis model of a person's voice includes acquiring image data representing a visage of a person, in which the visage includes facial characteristics defining exteriorly visible articulatory speech synthesis model parameters of the person's voice; selecting a predefined articulatory speech synthesis model from among stores of predefined models, the selection based at least in part on one or both of the facial characteristics or the exteriorly visible articulatory speech synthesis model parameters; and associating at least a portion of the selected predefined articulatory speech synthesis model with the articulatory speech synthesis model of the person's voice.

    Technologies for automatic speech recognition using articulatory parameters

    公开(公告)号:US10540975B2

    公开(公告)日:2020-01-21

    申请号:US15080687

    申请日:2016-03-25

    Abstract: Technologies for automatic speech recognition using articulatory parameters are disclosed. An automatic speech recognition device may capture speech data from a speaker and also capture an image of the speaker. The automatic speech recognition device may determine one or more articulatory parameters based on the image, such as such as a jaw angle, a lip protrusion, or a lip height, and compare those parameters with articulatory parameters of training users. After selecting training users with similar articulatory parameters as the training speaker, the automatic speech recognition device may select training data associated with the selected training speakers, including parameters to use for an automatic speech recognition algorithm. By using the parameters already optimized for training users with similar articulatory parameters as the speaker, the automatic speech recognition device may quickly adapt an automatic speech recognition algorithm to the speaker.

    Method and apparatus to synthesize voice based on facial structures

    公开(公告)号:US10056073B2

    公开(公告)日:2018-08-21

    申请号:US15440371

    申请日:2017-02-23

    CPC classification number: G10L13/027 G06K9/00315 G10L13/033 G10L13/047

    Abstract: A method, performed by a user equipment device, for text-to-speech conversion entails sending to an articulatory model server exterior facial structural information of a person, receiving from the articulatory model server at least a portion of a predefined articulatory model that corresponds to the exterior facial structural information, the predefined articulatory model representing a voice of a modeled person who is different from the person, and generating, based at least partly on the predefined articulatory model, speech from text stored in a memory of the user equipment device. Furthermore, a method of configuring text-to-speech conversion for a user equipment device entails determining at least a portion of an articulatory model that corresponds to exterior facial structural information based on a comparison of the exterior facial structural information to exterior facial structural information stored in a database of articulatory models.

    METHOD AND APPARATUS TO SYNTHESIZE VOICE BASED ON FACIAL STRUCTURES
    10.
    发明申请
    METHOD AND APPARATUS TO SYNTHESIZE VOICE BASED ON FACIAL STRUCTURES 有权
    基于FACIAL结构合成语音的方法和装置

    公开(公告)号:US20160093284A1

    公开(公告)日:2016-03-31

    申请号:US14496832

    申请日:2014-09-25

    CPC classification number: G10L13/027 G06K9/00315 G10L13/033 G10L13/047

    Abstract: Disclosed are embodiments for use in an articulatory-based text-to-speech conversion system configured to establish an articulatory speech synthesis model of a person's voice based on facial characteristics defining exteriorly visible articulatory speech synthesis model parameters of the person's voice and on a predefined articulatory speech synthesis model selected from among stores of predefined models.

    Abstract translation: 公开了用于基于发音的文本到语音转换系统中的实施例,其被配置为基于定义人的语音的外部可见的发音语音合成模型参数的面部特征和预定义的发音来建立人的语音的发音语音合成模型 从预定义模型的商店中选择的语音合成模型。

Patent Agency Ranking