Method and apparatus with speech processing

    公开(公告)号:US11508369B2

    公开(公告)日:2022-11-22

    申请号:US16797069

    申请日:2020-02-21

    Inventor: Sanghyun Yoo

    Abstract: Disclosed is a method and apparatus for processing a speech. The method includes obtaining context information from a speech signal of a user using a neural network-based encoder, determining intent information of the speech signal based on the context information, determining, based on the context information, attention information corresponding to a segment included in the speech signal, and determining, based on the attention information, a segment value of the segment by recognizing, using a decoder, a portion of the context information identified as corresponding to the segment.

    Speech recognition method and apparatus

    公开(公告)号:US11282501B2

    公开(公告)日:2022-03-22

    申请号:US16656700

    申请日:2019-10-18

    Abstract: A speech recognition method and apparatus, including implementation and/or training, are disclosed. The speech recognition method includes obtaining a speech signal, and performing a recognition of the speech signal, including generating a dialect parameter, for the speech signal, from input dialect data using a parameter generation model, applying the dialect parameter to a trained speech recognition model to generate a dialect speech recognition model, and generating a speech recognition result from the speech signal by implementing, with respect to the speech signal, the dialect speech recognition model. The speech recognition method and apparatus may perform speech recognition and/or training of the speech recognition model and the parameter generation model.

    Method and apparatus with speech processing

    公开(公告)号:US11830493B2

    公开(公告)日:2023-11-28

    申请号:US17973452

    申请日:2022-10-25

    Inventor: Sanghyun Yoo

    CPC classification number: G10L15/22 G10L15/30 G10L2015/223 G10L2015/228

    Abstract: Disclosed is a method and apparatus for processing a speech. The method includes obtaining context information from a speech signal of a user using a neural network-based encoder, determining, based on the context information, attention information corresponding to a segment included in the speech signal, and recognizing, based on the attention information, the segment by decoding a portion of the context information identified as corresponding to the segment.

Patent Agency Ranking