Method and apparatus with speech processing

    公开(公告)号:US11776529B2

    公开(公告)日:2023-10-03

    申请号:US17368983

    申请日:2021-07-07

    Inventor: Tae Gyoon Kang

    CPC classification number: G10L15/04 G10L15/22 G10L15/26

    Abstract: A method, the method includes determining a target segment partially overlapping a preceding segment from a speech signal, determining a target character sequence corresponding to the target segment by decoding the target segment, identifying a first overlapping portion between the target character sequence and a preceding character sequence based on an edit distance, and merging the target character sequence and the preceding character sequence based on the first overlapping portion. A cost applied to the edit distance is determined based on any one or any combination of any two or more of a type of operation performed at the edit distance, whether characters to be operated are located in the first overlapping portion, and whether the characters to be operated match. A portion overlapping the preceding segment in the target segment is greater than or equal to 8.3% of the target segment.

    Decoding method and apparatus in artificial neural network for speech recognition

    公开(公告)号:US12100392B2

    公开(公告)日:2024-09-24

    申请号:US18321876

    申请日:2023-05-23

    CPC classification number: G10L15/16

    Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.

    METHOD AND APPARATUS WITH SPEECH PROCESSING

    公开(公告)号:US20210335341A1

    公开(公告)日:2021-10-28

    申请号:US17368983

    申请日:2021-07-07

    Inventor: Tae Gyoon Kang

    Abstract: A method, the method includes determining a target segment partially overlapping a preceding segment from a speech signal, determining a target character sequence corresponding to the target segment by decoding the target segment, identifying a first overlapping portion between the target character sequence and a preceding character sequence based on an edit distance, and merging the target character sequence and the preceding character sequence based on the first overlapping portion. A cost applied to the edit distance is determined based on any one or any combination of any two or more of a type of operation performed at the edit distance, whether characters to be operated are located in the first overlapping portion, and whether the characters to be operated match. A portion overlapping the preceding segment in the target segment is greater than or equal to 8.3% of the target segment.

    Method and apparatus with speech processing

    公开(公告)号:US11721323B2

    公开(公告)日:2023-08-08

    申请号:US17083854

    申请日:2020-10-29

    Inventor: Tae Gyoon Kang

    CPC classification number: G10L15/02 G10L15/10

    Abstract: A method, the method includes determining a target segment from a speech signal, determining a target character sequence corresponding to the target segment by decoding the target segment, identifying a first overlapping portion between the target character sequence and a preceding character sequence based on an edit distance, and merging the target character sequence and the preceding character sequence based on the first overlapping portion. A cost applied to the edit distance is determined based on any one or any combination of any two or more of a type of operation performed at the edit distance, whether characters to be operated are located in the first overlapping portion, and whether the characters to be operated match.

    Decoding method and apparatus in artificial neural network for speech recognition

    公开(公告)号:US11694677B2

    公开(公告)日:2023-07-04

    申请号:US16844401

    申请日:2020-04-09

    CPC classification number: G10L15/16

    Abstract: A decoding method and apparatus in an artificial neural network for speech recognition. The decoding method in the artificial neural network for speech recognition includes performing a first decoding task of decoding a feature including speech information and at least one token recognized up to current time, using a shared decoding layer included in the artificial neural network, performing a second decoding task of decoding the at least one token, using the shared decoding layer, and determining an output token to be recognized subsequent to the at least one token based on a result of the first decoding task and a result of the second decoding task.

Patent Agency Ranking