Speech processing method and apparatus therefor

    公开(公告)号:US11373656B2

    公开(公告)日:2022-06-28

    申请号:US16704988

    申请日:2019-12-05

    Abstract: Disclosed are a speech processing method and a speech processing apparatus in a 5G communication environment through speech processing by executing embedded artificial intelligence (AI) algorithms and/or machine learning algorithms. The speech processing method includes determining a temporary pause of reception of a first spoken utterance, outputting a first spoken response utterance as a result of speech recognition processing of a second spoken utterance received after the temporary pause, determining, as an extension of the first spoken utterance, a third spoken utterance that is received after outputting the first spoken response utterance, deleting, using a deep neural network model, a duplicate utterance part from a fourth spoken utterance that is obtained by combining the first and the third spoken utterance, and outputting a second spoken response utterance as a result of speech recognition processing of the fourth spoken utterance from which the duplicate utterance part has been deleted.

    Speech processing method and apparatus therefor

    公开(公告)号:US11302324B2

    公开(公告)日:2022-04-12

    申请号:US16698649

    申请日:2019-11-27

    Abstract: Disclosed are a speech processing method and apparatus therefor which execute an installed artificial intelligence algorithm and/or machine learning algorithm to perform speech processing in a 5G communication environment. The speech processing method may include determining a temporary pause of reception of a first spoken utterance, outputting a first spoken response utterance as a result of speech recognition processing of a second spoken utterance received after the temporary pause, determining, as an extension of the first spoken utterance, a third spoken utterance received after outputting the first spoken response utterance, deleting a duplicate utterance part from a fourth spoken utterance that is obtained by combining the first and the third spoken utterance, when performing speech recognition processing on the fourth spoken utterance, and outputting a second spoken response utterance as a result of speech recognition processing of the fourth spoken utterance from which the duplicate utterance part has been deleted.

Patent Agency Ranking