-
公开(公告)号:US11514902B2
公开(公告)日:2022-11-29
申请号:US16541874
申请日:2019-08-15
Applicant: LG ELECTRONICS INC.
Inventor: Hyun Yu , Byeong Ha Kim
IPC: G10L15/19 , G10L15/22 , G10L25/78 , G10L15/08 , G06F40/242
Abstract: A speech recognition apparatus and an operating method thereof which execute a mounted artificial intelligence (AI) algorithm and/or machine learning algorithm to perform speech recognition and communicate with different electronic apparatuses and external servers in a 5G communication environment are disclosed. A speech recognition method according to an exemplary embodiment of the present disclosure includes determining a temporary pause for reception of a first utterance sentence in the middle of the reception of the first utterance sentence, outputting a speech recognition processing result of a second utterance sentence which is received after the temporary pause, separately from the first utterance sentence, determining a third utterance sentence which is received after outputting the speech recognition processing result of the second utterance sentence as an extension of the first utterance sentence, and outputting a speech recognition processing result of a fourth utterance sentence obtained by combining the first utterance sentence and the third utterance sentence. According to the present disclosure, a delay occurring in the middle of reception of uttering speech is recognized as an uncompleted utterance to be temporarily stored and a speech recognition processing result for an additional uttering speech received after the delay is provided and then uttering speech which is input again and the uttering speech before the delay are recognized as completed utterance and a speech recognition processing result is provided to improve the speech recognition processing performance.
-
公开(公告)号:US11373656B2
公开(公告)日:2022-06-28
申请号:US16704988
申请日:2019-12-05
Applicant: LG ELECTRONICS INC.
Inventor: Ye Jin Kim , Hyun Yu , Byeong Ha Kim
Abstract: Disclosed are a speech processing method and a speech processing apparatus in a 5G communication environment through speech processing by executing embedded artificial intelligence (AI) algorithms and/or machine learning algorithms. The speech processing method includes determining a temporary pause of reception of a first spoken utterance, outputting a first spoken response utterance as a result of speech recognition processing of a second spoken utterance received after the temporary pause, determining, as an extension of the first spoken utterance, a third spoken utterance that is received after outputting the first spoken response utterance, deleting, using a deep neural network model, a duplicate utterance part from a fourth spoken utterance that is obtained by combining the first and the third spoken utterance, and outputting a second spoken response utterance as a result of speech recognition processing of the fourth spoken utterance from which the duplicate utterance part has been deleted.
-
公开(公告)号:US11302324B2
公开(公告)日:2022-04-12
申请号:US16698649
申请日:2019-11-27
Applicant: LG ELECTRONICS INC.
Inventor: Ye Jin Kim , Hyun Yu , Byeong Ha Kim
IPC: G10L15/22 , G10L15/187 , G10L25/78 , G10L15/08
Abstract: Disclosed are a speech processing method and apparatus therefor which execute an installed artificial intelligence algorithm and/or machine learning algorithm to perform speech processing in a 5G communication environment. The speech processing method may include determining a temporary pause of reception of a first spoken utterance, outputting a first spoken response utterance as a result of speech recognition processing of a second spoken utterance received after the temporary pause, determining, as an extension of the first spoken utterance, a third spoken utterance received after outputting the first spoken response utterance, deleting a duplicate utterance part from a fourth spoken utterance that is obtained by combining the first and the third spoken utterance, when performing speech recognition processing on the fourth spoken utterance, and outputting a second spoken response utterance as a result of speech recognition processing of the fourth spoken utterance from which the duplicate utterance part has been deleted.
-
-