Voice recognition apparatus and method

    公开(公告)号:US10546574B2

    公开(公告)日:2020-01-28

    申请号:US15686913

    申请日:2017-08-25

    Abstract: A voice recognition apparatus and corresponding method include a processor configured to calculate a probability distribution corresponding to an intent associated with an utterance of a user by applying pre-stored training data to an input voice signal input based on the utterance. The processor is also configured to select a target feature extractor including either one or both of a training-based feature extractor and a rule-based feature extractor using the calculated probability distribution, and extract a feature associated with the utterance based on the selected target feature extractor.

    Method and apparatus for generating speech

    公开(公告)号:US11580963B2

    公开(公告)日:2023-02-14

    申请号:US17069927

    申请日:2020-10-14

    Abstract: A speech generation method and apparatus are disclosed. The speech generation method includes obtaining, by a processor, a linguistic feature and a prosodic feature from an input text, determining, by the processor, a first candidate speech element through a cost calculation and a Viterbi search based on the linguistic feature and the prosodic feature, generating, at a speech element generator implemented at the processor, a second candidate speech element based on the linguistic feature or the prosodic feature and the first candidate speech element, and outputting, by the processor, an output speech by concatenating the second candidate speech element and a speech sequence determined through the Viterbi search.

    Method and device with natural language processing

    公开(公告)号:US12039277B2

    公开(公告)日:2024-07-16

    申请号:US17186830

    申请日:2021-02-26

    CPC classification number: G06F40/30 G06N3/08 G06N5/02

    Abstract: A method and device with natural language processing is disclosed. The method includes performing a word embedding of an input sentence, encoding a result of the word embedding, using an encoder of a natural language processing model, to generate a context embedding vector, decoding the context embedding vector, using a decoder of the natural language processing model, to generate an output sentence corresponding to the input sentence, generating a score indicating a relationship between the context embedding vector and each of a plurality of knowledge embedding vectors, determining a first loss based on the output sentence, determining a second loss based on the generated score, and performing training of the natural language processing model, including training the natural language processing model based on the determined first loss, and training the natural language processing model based on the determined second loss.

Patent Agency Ranking