DECODING METHOD AND APPARATUS IN ARTIFICIAL NEURAL NETWORK FOR SPEECH RECOGNITION

    公开(公告)号:US20230306961A1

    公开(公告)日:2023-09-28

    申请号:US18321876

    申请日:2023-05-23

    CPC classification number: G10L15/16

    Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.

    METHOD AND APPARATUS FOR DETERMINING OUTPUT TOKEN

    公开(公告)号:US20210110259A1

    公开(公告)日:2021-04-15

    申请号:US16851300

    申请日:2020-04-17

    Inventor: Min-Joong LEE

    Abstract: A method for determining an output token includes predicting a first probability of each of candidate output tokens of a first model, predicting a second probability of each of the candidate output tokens of a second model interworking with the first model, adjusting the second probability of each of the candidate output tokens based on the first probability, and determining the output token among the candidate output tokens based on the first probability and the adjusted second probability.

    METHOD AND APPARATUS FOR PROCESSING SEQUENCE

    公开(公告)号:US20210081610A1

    公开(公告)日:2021-03-18

    申请号:US16844362

    申请日:2020-04-09

    Abstract: A sequence processing method and apparatus are provided. The sequence processing method includes determining a word of a first R-node corresponding to a root node based on an input sequence, generating first I-nodes that are connected to the first R-node and include relative position information with respect to the word of the first R-node, determining a word of a second R-node to correspond to each of the first I-nodes, and determining an output sequence corresponding to the input sequence based on the determined words.

    DECODING METHOD AND APPARATUS IN ARTIFICIAL NEURAL NETWORK FOR SPEECH RECOGNITION

    公开(公告)号:US20210035562A1

    公开(公告)日:2021-02-04

    申请号:US16844401

    申请日:2020-04-09

    Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.

Patent Agency Ranking