-
公开(公告)号:US11721345B2
公开(公告)日:2023-08-08
申请号:US16917784
申请日:2020-06-30
Applicant: LG ELECTRONICS INC.
Inventor: Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim
CPC classification number: G10L15/32 , G10L15/22 , G10L15/30 , G16Y40/35 , G10L2015/228
Abstract: Disclosed is a device for controlling a plurality of voice recognition devices for determining and selecting a first voice recognition device that a user wants to use based on a point in time when the voice of the user is spoken or a place where the user spoke the voice. The device for controlling a plurality of voice recognition devices according to the present disclosure may be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G service, etc.
-
2.
公开(公告)号:US11107456B2
公开(公告)日:2021-08-31
申请号:US16561410
申请日:2019-09-05
Applicant: LG ELECTRONICS INC.
Inventor: Jonghoon Chae , Minook Kim , Sangki Kim , Yongchul Park , Siyoung Yang , Juyeong Jang , Sungmin Han
IPC: G10L15/02 , G10L13/08 , G10L13/033 , G10L13/047
Abstract: Discussed is an artificial intelligence (AI)-based voice sampling apparatus for providing a speech style, including a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample, a text encoder configured to receive text for reflecting the vocal feature, a processor configured to classify the vocal feature of the voice sample input to the rhyme encoder according to a label, extract an embedding vector representing the vocal feature from the label, and generate a speech style from the embedding vector and apply the generated speech style to the text, and a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.
-
公开(公告)号:US11114114B2
公开(公告)日:2021-09-07
申请号:US16850810
申请日:2020-04-16
Applicant: LG ELECTRONICS INC.
Inventor: Jonghoon Chae , Juyeong Jang , Yongchul Park , Siyoung Yang , Sungmin Han
Abstract: An apparatus that includes a microphone and a processor. The processor is configured to receive, via the microphone, audio comprising voice of a person, and determine whether the received audio is an actual voice or a synthesized voice. The apparatus also provides a first notification indicating that the received audio is the actual voice when the received audio is the actual voice, and provides a second notification indicating that the received audio is the synthesized voice when the received audio is the synthesized voice.
-
公开(公告)号:US11010124B2
公开(公告)日:2021-05-18
申请号:US16703768
申请日:2019-12-04
Applicant: LG ELECTRONICS INC.
Inventor: Sang Ki Kim , Yongchul Park , Sungmin Han , Siyoung Yang , Juyeong Jang , Minook Kim
Abstract: Disclosed are a sound source focus method and device in which the sound source focus device, in a 5G communication environment by amplifying and outputting a sound source signal of a user's object of interest extracted from an acoustic signal included in video content by executing a loaded artificial intelligence (AI) algorithm and/or machine learning algorithm. The sound source focus method includes playing video content including a video signal including at least one moving object and the acoustic signal in which sound sources output by the object are mixed, determining the user's object of interest from the video signal, acquiring unique sound source information about the user's object of interest, extracting an actual sound source for the user's object of interest corresponding to the unique sound source information from the acoustic signal, and outputting the actual sound source extracted for the user's object of interest.
-
公开(公告)号:US10692517B2
公开(公告)日:2020-06-23
申请号:US16151091
申请日:2018-10-03
Applicant: LG ELECTRONICS INC.
Inventor: Jonghoon Chae , Juyeong Jang , Yongchul Park , Siyoung Yang , Sungmin Han
Abstract: An apparatus that includes a microphone and a processor. The processor is configured to receive, via the microphone, audio comprising voice of a person, and determine whether the received audio is an actual voice or a synthesized voice. The apparatus also provides a first notification indicating that the received audio is the actual voice when the received audio is the actual voice, and provides a second notification indicating that the received audio is the synthesized voice when the received audio is the synthesized voice.
-
6.
公开(公告)号:US11721319B2
公开(公告)日:2023-08-08
申请号:US16803941
申请日:2020-02-27
Applicant: LG ELECTRONICS INC.
Inventor: Minook Kim , Yongchul Park , Sungmin Han , Siyoung Yang , Sangki Kim , Juyeong Jang
IPC: G10L13/10 , G06N5/04 , G10L13/047 , G06N20/00
CPC classification number: G10L13/10 , G06N5/04 , G06N20/00 , G10L13/047
Abstract: An artificial intelligence device includes a memory and a processor. The memory is configured to store audio data having a predetermined speech style. The processor is configured to generate a condition vector relating to a condition for determining the speech style of the audio data, reduce a dimension of the condition vector to a predetermined reduction dimension, acquire a sparse code vector based on a dictionary vector acquired through sparse dictionary coding with respect to the condition vector having the predetermined reduction dimension, and change a vector element value included in the sparse code vector.
-
7.
公开(公告)号:US11636845B2
公开(公告)日:2023-04-25
申请号:US16928815
申请日:2020-07-14
Applicant: LG ELECTRONICS INC.
Inventor: Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim
IPC: G10L13/00 , G10L13/08 , G10L13/10 , G10L13/033 , G10L13/04
Abstract: A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.
-
公开(公告)号:US11443732B2
公开(公告)日:2022-09-13
申请号:US16499816
申请日:2019-02-15
Applicant: LG ELECTRONICS INC.
Inventor: Jonghoon Chae , Sungmin Han
IPC: G10L13/047
Abstract: A speech synthesizer includes a memory configured to store a plurality of sentences and prior information of a word classified into a minor class among a plurality of classes with respect to each sentence, and a processor configured to determine an oversampling rate of the word based on the prior information, determine the number of times of oversampling of the word using the determined oversampling rate and generate sentences including the word by the determined number of times of oversampling. The plurality of classes includes a first class corresponding to first reading break, a second class corresponding to second reading break greater than the first break and a third class corresponding to third reading break greater than the second break, and the minor class has a smallest count among the first to third classes in one sentence.
-
公开(公告)号:US11120785B2
公开(公告)日:2021-09-14
申请号:US16547323
申请日:2019-08-21
Applicant: LG ELECTRONICS INC.
Inventor: Jonghoon Chae , Yongchul Park , Siyoung Yang , Juyeong Jang , Sungmin Han
Abstract: A voice synthesis device which includes a database configured to store a voice and a text corresponding to the voice and a processor configured to extract characteristic information and a tone of a first-language voice stored in the database, classify an utterance style of an utterer on basis of the extracted characteristic information, generate utterer analysis information including the utterance style and the tone, translate a text corresponding to the first-language voice into a second language, and synthesize the text, translated into the second language, in a second-language voice by using the utterer analysis information.
-
公开(公告)号:US11074904B2
公开(公告)日:2021-07-27
申请号:US16593161
申请日:2019-10-04
Applicant: LG Electronics Inc.
Inventor: Siyoung Yang , Minook Kim , Sangki Kim , Yongchul Park , Juyeong Jang , Sungmin Han
Abstract: A speech synthesis method and apparatus based on emotion information are disclosed. A speech synthesis method based on emotion information extracts speech synthesis target text from received data and determines whether the received data includes situation explanation information. First metadata corresponding to first emotion information is generated on the basis of the situation explanation information. When the extracted data does not include situation explanation information, second metadata corresponding to second emotion information generated on the basis of semantic analysis and context analysis is generated. One of the first metadata and the second metadata is added to the speech synthesis target text to synthesize speech corresponding to the extracted data. A speech synthesis apparatus of this disclosure may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.
-
-
-
-
-
-
-
-
-