-
公开(公告)号:US11574190B2
公开(公告)日:2023-02-07
申请号:US16851300
申请日:2020-04-17
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee
Abstract: A method for determining an output token includes predicting a first probability of each of candidate output tokens of a first model, predicting a second probability of each of the candidate output tokens of a second model interworking with the first model, adjusting the second probability of each of the candidate output tokens based on the first probability, and determining the output token among the candidate output tokens based on the first probability and the adjusted second probability.
-
公开(公告)号:US12100392B2
公开(公告)日:2024-09-24
申请号:US18321876
申请日:2023-05-23
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee , Tae Gyoon Kang
IPC: G10L15/16
CPC classification number: G10L15/16
Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.
-
公开(公告)号:US11100374B2
公开(公告)日:2021-08-24
申请号:US16671639
申请日:2019-11-01
Applicant: Samsung Electronics Co., Ltd.
Inventor: Young-Seok Kim , Hwidong Na , Seongmin Ok , Min-Joong Lee
Abstract: A processor-implemented classification method includes: determining a first probability vector including a first probability, for each of a plurality of classes, resulting from a classification of an input with respect to the classes; determining, based on the determined first probability vector, whether one or more of the classes represented in the first probability vector are confusing classes; adjusting, in response to one or more of the classes being the confusing classes, the determined first probability vector based on a first probability of each of the confusing classes and a maximum value of the first probabilities; determining a second probability vector including a second probability, for each of the classes, resulting from another classification of the input with respect to the classes; and performing classification on the input based on a result of a comparison between the determined second probability vector and the adjusted first probability vector.
-
公开(公告)号:US12073825B2
公开(公告)日:2024-08-27
申请号:US17986000
申请日:2022-11-14
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee
CPC classification number: G10L15/16 , G06N3/045 , G06N3/088 , G10L15/197 , G10L15/22
Abstract: A speech recognition method includes adding a preset special sequence to a front end of an input sequence that corresponds to an input utterance of a speaker, recognizing the preset special sequence and the input sequence, and recognizing the input sequence based on the preset special sequence and a speech recognition result obtained by recognizing the preset special sequence and the input sequence.
-
公开(公告)号:US11694677B2
公开(公告)日:2023-07-04
申请号:US16844401
申请日:2020-04-09
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee , Tae Gyoon Kang
IPC: G10L15/16
CPC classification number: G10L15/16
Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.
-
公开(公告)号:US12211496B2
公开(公告)日:2025-01-28
申请号:US17064879
申请日:2020-10-07
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee
Abstract: A processor-implemented utterance time estimation method includes: determining a plurality of attention weight matrices using an attention-based sequence-to-sequence model; selecting an attention weight matrix from the plurality of attention weight matrices; and estimating an utterance time corresponding to an output sequence based on the selected attention weight matrix.
-
公开(公告)号:US11983626B2
公开(公告)日:2024-05-14
申请号:US17109490
申请日:2020-12-02
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Min-Joong Lee
Abstract: A method and apparatus for improving the quality of an attention-based sequence-to-sequence model. The method includes determining an output sequence corresponding to an input sequence based on an attention-based sequence-to-sequence model, selecting at least one target attention head from among a plurality of attention heads, detecting at least one error output token among output tokens constituting the output sequence based on the target attention head, and correcting the output sequence based on the error output token.
-
公开(公告)号:US11501761B2
公开(公告)日:2022-11-15
申请号:US16787701
申请日:2020-02-11
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee
Abstract: A speech recognition method includes adding a preset special sequence to a front end of an input sequence that corresponds to an input utterance of a speaker, recognizing the preset special sequence and the input sequence, and recognizing the input sequence based on the preset special sequence and a speech recognition result obtained by recognizing the preset special sequence and the input sequence.
-
公开(公告)号:US11361757B2
公开(公告)日:2022-06-14
申请号:US16388930
申请日:2019-04-19
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee
IPC: G10L15/16 , G06F40/284 , G06F40/274
Abstract: A processor-implemented decoding method in a first neural network is provided. The method predicts probabilities of candidates of an output token based on at least one previously input token, determines the output token among the candidates based on the predicted probabilities; and determines a next input token by selecting one of the output token and a pre-defined special token based on a determined probability of the output token.
-
公开(公告)号:US11249756B2
公开(公告)日:2022-02-15
申请号:US16812600
申请日:2020-03-09
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tae Gyoon Kang , Min-Joong Lee
Abstract: A processor implemented natural language processing method and apparatus are provided. The natural language processing method includes converting a natural language phrase into a token vector, calculating a repetition count of the token vector, and generating an input vector by encoding the token vector based on the calculated repetition count and a position of the token vector.
-
-
-
-
-
-
-
-
-