-
公开(公告)号:US11776529B2
公开(公告)日:2023-10-03
申请号:US17368983
申请日:2021-07-07
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tae Gyoon Kang
Abstract: A method, the method includes determining a target segment partially overlapping a preceding segment from a speech signal, determining a target character sequence corresponding to the target segment by decoding the target segment, identifying a first overlapping portion between the target character sequence and a preceding character sequence based on an edit distance, and merging the target character sequence and the preceding character sequence based on the first overlapping portion. A cost applied to the edit distance is determined based on any one or any combination of any two or more of a type of operation performed at the edit distance, whether characters to be operated are located in the first overlapping portion, and whether the characters to be operated match. A portion overlapping the preceding segment in the target segment is greater than or equal to 8.3% of the target segment.
-
公开(公告)号:US12100392B2
公开(公告)日:2024-09-24
申请号:US18321876
申请日:2023-05-23
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee , Tae Gyoon Kang
IPC: G10L15/16
CPC classification number: G10L15/16
Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.
-
公开(公告)号:US20210335341A1
公开(公告)日:2021-10-28
申请号:US17368983
申请日:2021-07-07
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tae Gyoon Kang
Abstract: A method, the method includes determining a target segment partially overlapping a preceding segment from a speech signal, determining a target character sequence corresponding to the target segment by decoding the target segment, identifying a first overlapping portion between the target character sequence and a preceding character sequence based on an edit distance, and merging the target character sequence and the preceding character sequence based on the first overlapping portion. A cost applied to the edit distance is determined based on any one or any combination of any two or more of a type of operation performed at the edit distance, whether characters to be operated are located in the first overlapping portion, and whether the characters to be operated match. A portion overlapping the preceding segment in the target segment is greater than or equal to 8.3% of the target segment.
-
公开(公告)号:US11670290B2
公开(公告)日:2023-06-06
申请号:US17106599
申请日:2020-11-30
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tae Gyoon Kang
IPC: G10L15/197 , G10L15/22 , G06N3/08 , G10L15/16 , G06N3/045
CPC classification number: G10L15/197 , G06N3/045 , G06N3/08 , G10L15/16 , G10L15/22
Abstract: A speech signal processing method and apparatus is disclosed. The speech signal processing method includes receiving an input token that is based on a speech signal, calculating first probability values respectively corresponding to candidate output tokens based on the input token, adjusting at least one of the first probability values based on a priority of each of the first probability values, and processing the speech signal based on an adjusted probability value obtained by the adjusting.
-
公开(公告)号:US11249756B2
公开(公告)日:2022-02-15
申请号:US16812600
申请日:2020-03-09
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tae Gyoon Kang , Min-Joong Lee
Abstract: A processor implemented natural language processing method and apparatus are provided. The natural language processing method includes converting a natural language phrase into a token vector, calculating a repetition count of the token vector, and generating an input vector by encoding the token vector based on the calculated repetition count and a position of the token vector.
-
公开(公告)号:US11721323B2
公开(公告)日:2023-08-08
申请号:US17083854
申请日:2020-10-29
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Tae Gyoon Kang
Abstract: A method, the method includes determining a target segment from a speech signal, determining a target character sequence corresponding to the target segment by decoding the target segment, identifying a first overlapping portion between the target character sequence and a preceding character sequence based on an edit distance, and merging the target character sequence and the preceding character sequence based on the first overlapping portion. A cost applied to the edit distance is determined based on any one or any combination of any two or more of a type of operation performed at the edit distance, whether characters to be operated are located in the first overlapping portion, and whether the characters to be operated match.
-
公开(公告)号:US11694677B2
公开(公告)日:2023-07-04
申请号:US16844401
申请日:2020-04-09
Applicant: Samsung Electronics Co., Ltd.
Inventor: Min-Joong Lee , Tae Gyoon Kang
IPC: G10L15/16
CPC classification number: G10L15/16
Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.
-
公开(公告)号:US10509864B2
公开(公告)日:2019-12-17
申请号:US15947915
申请日:2018-04-09
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tae Gyoon Kang , Hodong Lee
Abstract: A language model training method and an apparatus using the language model training method are disclosed. The language model training method includes assigning a context vector to a target translation vector, obtaining feature vectors based on the target translation vector and the context vector, generating a representative vector representing the target translation vector using an attention mechanism for the feature vectors, and training a language model based on the target translation vector, the context vector, and the representative vector.
-
-
-
-
-
-
-