-
公开(公告)号:US20210082421A1
公开(公告)日:2021-03-18
申请号:US16676160
申请日:2019-11-06
Applicant: LG ELECTRONICS INC.
Inventor: Sang Ki KIM , Yongchul PARK , Minook KIM , Siyoung YANG , Juyeong JANG , Sungmin HAN
Abstract: Disclosed are a speech processing method and a speech processing apparatus, characterized in that a speech processing is carried out by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm, such that the speech processing apparatus, a user terminal, and a server can communicate with each other in a 5G communication environment. The speech processing method according to one exemplary embodiment of the present invention includes converting a response text, which is generated in response to a spoken utterance of a user, to a spoken response utterance, obtaining external situation information while outputting the spoken response utterance, generating a dynamic spoken response utterance by converting the spoken response utterance on the basis of the external situation information, and outputting the dynamic spoken response utterance.
-
公开(公告)号:US20210096810A1
公开(公告)日:2021-04-01
申请号:US16703768
申请日:2019-12-04
Applicant: LG ELECTRONICS INC.
Inventor: Sang Ki KIM , Yongchul PARK , Sungmin HAN , Siyoung YANG , Juyeong JANG , Minook KIM
Abstract: Disclosed are a sound source focus method and device in which the sound source focus device, in a 5G communication environment by amplifying and outputting a sound source signal of a user's object of interest extracted from an acoustic signal included in video content by executing a loaded artificial intelligence (AI) algorithm and/or machine learning algorithm. The sound source focus method includes playing video content including a video signal including at least one moving object and the acoustic signal in which sound sources output by the object are mixed, determining the user's object of interest from the video signal, acquiring unique sound source information about the user's object of interest, extracting an actual sound source for the user's object of interest corresponding to the unique sound source information from the acoustic signal, and outputting the actual sound source extracted for the user's object of interest.
-
公开(公告)号:US20200043495A1
公开(公告)日:2020-02-06
申请号:US16601787
申请日:2019-10-15
Applicant: LG ELECTRONICS INC.
Inventor: Yongchul PARK , Minook KIM , Sang Ki KIM , Siyoung YANG , Juyeong JANG , Sungmin HAN
Abstract: A method for performing multi-language communication includes receiving an utterance, identifying a language of the received utterance, determining whether the identified language matches a preset reference language, applying, to the received utterance, an interpretation model interpreting the identified language into the reference language when the identified language does not match the reference language, changing, to text, speech data which is outputted in the reference language as a result of applying the interpretation model, generating a response message responding to the text of the speech data, and outputting the response message. Here, the interpretation model may be a deep neural network model generated through machine learning, and the interpretation model may be stored in an edge device or provided through a server in an Internet of things environment through a 5G network.
-
-