METHOD AND SYSTEM FOR APPLYING SYNTHETIC SPEECH TO SPEAKER IMAGE

    公开(公告)号:US20230206896A1

    公开(公告)日:2023-06-29

    申请号:US18113671

    申请日:2023-02-24

    申请人: NEOSAPIENCE, INC.

    IPC分类号: G10L13/08 G10L25/30 G10L15/02

    摘要: The present disclosure relates to a method for applying synthesis voice to a speaker image, in which the method includes receiving an input text, inputting the input text to an artificial neural network text-to-speech synthesis model and outputting voice data for the input text, generating a synthesis voice corresponding to the output voice data, and generating information on a plurality of phonemes included in the output voice data, in which the information on the plurality of phonemes may include timing information for each of the plurality of phonemes included in the output voice data.

    METHOD FOR SEARCHING FOR CONTENTS HAVING SAME VOICE AS VOICE OF TARGET SPEAKER, AND APPARATUS FOR EXECUTING SAME

    公开(公告)号:US20210280173A1

    公开(公告)日:2021-09-09

    申请号:US17319566

    申请日:2021-05-13

    申请人: NEOSAPIENCE, INC.

    摘要: A method for searching content having same voice as a voice of a target speaker from among a plurality of contents includes extracting a feature vector corresponding to the voice of the target speaker, selecting any subset of speakers from a training dataset repeatedly by a predetermined number of times, generating linear discriminant analysis (LDA) transformation matrices using each of the selected any subsets of speakers repeatedly by a predetermined number of times, projecting the extracted speaker feature vector to the selected corresponding subsets of speakers using each of the generated LDA transformation matrices, assigning a value corresponding to nearby speaker class among corresponding subsets of speakers, to each of projection regions of the extracted speaker feature vector, generating a hash value corresponding to the extracted feature vector based on the assigned values, and searching content having a similar hash value to the generated hash value among the contents.

    SPEECH TRANSLATION METHOD AND SYSTEM USING MULTILINGUAL TEXT-TO-SPEECH SYNTHESIS MODEL

    公开(公告)号:US20200342852A1

    公开(公告)日:2020-10-29

    申请号:US16925888

    申请日:2020-07-10

    申请人: NEOSAPIENCE, INC.

    IPC分类号: G10L13/08 G06F40/40 G10L25/30

    摘要: A speech translation method using a multilingual text-to-speech synthesis model includes acquiring a single artificial neural network text-to-speech synthesis model having acquired learning based on a learning text of a first language and learning speech data of the first language corresponding to the learning text of the first language, and a learning text of a second language and learning speech data of the second language corresponding to the learning text of the second language, receiving input speech data of the first language and an articulatory feature of a speaker regarding the first language, converting the input speech data of the first language into a text of the first language, converting the text of the first language into a text of the second language, and generating output speech data for the text of the second language that simulates the speaker's speech.

    METHOD FOR PERFORMING SYNTHETIC SPEECH GENERATION OPERATION ON TEXT

    公开(公告)号:US20230186895A1

    公开(公告)日:2023-06-15

    申请号:US18108080

    申请日:2023-02-10

    申请人: NEOSAPIENCE, INC.

    IPC分类号: G10L13/10 G10L13/027

    CPC分类号: G10L13/10 G10L13/027

    摘要: A method for performing the synthetic speech generation operation on text is provided, including receiving a plurality of sentences, receiving a plurality of speech style characteristics for the plurality of sentences, inputting the plurality of sentences and the plurality of speech style characteristics into an artificial neural network text-to-speech synthesis model, so as to generate a plurality of synthetic speeches for the plurality of sentences that reflect the plurality of speech style characteristics, and receiving a response to at least one of the plurality of synthetic speeches.

    METHOD AND SYSTEM FOR GENERATING SYNTHETIC SPEECH FOR TEXT THROUGH USER INTERFACE

    公开(公告)号:US20210142783A1

    公开(公告)日:2021-05-13

    申请号:US17152913

    申请日:2021-01-20

    申请人: NEOSAPIENCE, INC.

    摘要: A method for generating synthetic speech for text through a user interface is provided. The method may include receiving one or more sentences, determining a speech style characteristic for the received one or more sentences, and outputting a synthetic speech for the one or more sentences that reflects the determined speech style characteristic. The one or more sentences and the determined speech style characteristic may be inputted to an artificial neural network text-to-speech synthesis model and the synthetic speech may be generated based on the speech data outputted from the artificial neural network text-to-speech synthesis model.