Patent search ap:("LG ELECTRONICS INC.") AND inv:"Sungmin Han" Page 1

1.

发明授权
Device, system and method for controlling a plurality of voice recognition devices 有权

公开(公告)号：US11721345B2

公开(公告)日：2023-08-08

申请号：US16917784

申请日：2020-06-30

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim

IPC: G10L15/32 , G10L15/30 , G16Y40/35 , G10L15/22

CPC classification number: G10L15/32 , G10L15/22 , G10L15/30 , G16Y40/35 , G10L2015/228

Abstract: Disclosed is a device for controlling a plurality of voice recognition devices for determining and selecting a first voice recognition device that a user wants to use based on a point in time when the voice of the user is spoken or a place where the user spoke the voice. The device for controlling a plurality of voice recognition devices according to the present disclosure may be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G service, etc.

2.

发明授权
Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style 有权

公开(公告)号：US11107456B2

公开(公告)日：2021-08-31

申请号：US16561410

申请日：2019-09-05

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon Chae , Minook Kim , Sangki Kim , Yongchul Park , Siyoung Yang , Juyeong Jang , Sungmin Han

IPC: G10L15/02 , G10L13/08 , G10L13/033 , G10L13/047

Abstract: Discussed is an artificial intelligence (AI)-based voice sampling apparatus for providing a speech style, including a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample, a text encoder configured to receive text for reflecting the vocal feature, a processor configured to classify the vocal feature of the voice sample input to the rhyme encoder according to a label, extract an embedding vector representing the vocal feature from the label, and generate a speech style from the embedding vector and apply the generated speech style to the text, and a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.

3.

发明授权
Voice interpretation device 有权

公开(公告)号：US11114114B2

公开(公告)日：2021-09-07

申请号：US16850810

申请日：2020-04-16

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon Chae , Juyeong Jang , Yongchul Park , Siyoung Yang , Sungmin Han

IPC: G10L13/00 , G10L25/69 , G10L19/02 , G10L17/26 , G10L15/02 , G10L25/18 , G10L15/22

Abstract: An apparatus that includes a microphone and a processor. The processor is configured to receive, via the microphone, audio comprising voice of a person, and determine whether the received audio is an actual voice or a synthesized voice. The apparatus also provides a first notification indicating that the received audio is the actual voice when the received audio is the actual voice, and provides a second notification indicating that the received audio is the synthesized voice when the received audio is the synthesized voice.

4.

发明授权
Method and device for focusing sound source 有权

公开(公告)号：US11010124B2

公开(公告)日：2021-05-18

申请号：US16703768

申请日：2019-12-04

Applicant: LG ELECTRONICS INC.

Inventor： Sang Ki Kim , Yongchul Park , Sungmin Han , Siyoung Yang , Juyeong Jang , Minook Kim

IPC: H04N5/93 , G06F3/16 , G06K9/00 , G10L25/51

Abstract: Disclosed are a sound source focus method and device in which the sound source focus device, in a 5G communication environment by amplifying and outputting a sound source signal of a user's object of interest extracted from an acoustic signal included in video content by executing a loaded artificial intelligence (AI) algorithm and/or machine learning algorithm. The sound source focus method includes playing video content including a video signal including at least one moving object and the acoustic signal in which sound sources output by the object are mixed, determining the user's object of interest from the video signal, acquiring unique sound source information about the user's object of interest, extracting an actual sound source for the user's object of interest corresponding to the unique sound source information from the acoustic signal, and outputting the actual sound source extracted for the user's object of interest.

5.

发明授权
Voice interpretation device 有权

公开(公告)号：US10692517B2

公开(公告)日：2020-06-23

申请号：US16151091

申请日：2018-10-03

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon Chae , Juyeong Jang , Yongchul Park , Siyoung Yang , Sungmin Han

IPC: G10L17/00 , G10L25/69 , G10L19/02 , G10L17/26 , G10L15/02 , G10L25/18 , G10L15/22

Abstract: An apparatus that includes a microphone and a processor. The processor is configured to receive, via the microphone, audio comprising voice of a person, and determine whether the received audio is an actual voice or a synthesized voice. The apparatus also provides a first notification indicating that the received audio is the actual voice when the received audio is the actual voice, and provides a second notification indicating that the received audio is the synthesized voice when the received audio is the synthesized voice.

6.

发明授权
Artificial intelligence device and method for generating speech having a different speech style 有权

公开(公告)号：US11721319B2

公开(公告)日：2023-08-08

申请号：US16803941

申请日：2020-02-27

Applicant: LG ELECTRONICS INC.

Inventor： Minook Kim , Yongchul Park , Sungmin Han , Siyoung Yang , Sangki Kim , Juyeong Jang

IPC: G10L13/10 , G06N5/04 , G10L13/047 , G06N20/00

CPC classification number: G10L13/10 , G06N5/04 , G06N20/00 , G10L13/047

Abstract: An artificial intelligence device includes a memory and a processor. The memory is configured to store audio data having a predetermined speech style. The processor is configured to generate a condition vector relating to a condition for determining the speech style of the audio data, reduce a dimension of the condition vector to a predetermined reduction dimension, acquire a sparse code vector based on a dictionary vector acquired through sparse dictionary coding with respect to the condition vector having the predetermined reduction dimension, and change a vector element value included in the sparse code vector.

7.

发明授权
Method for synthesized speech generation using emotion information correction and apparatus 有权

公开(公告)号：US11636845B2

公开(公告)日：2023-04-25

申请号：US16928815

申请日：2020-07-14

Applicant: LG ELECTRONICS INC.

Inventor： Siyoung Yang , Yongchul Park , Sungmin Han , Sangki Kim , Juyeong Jang , Minook Kim

IPC: G10L13/00 , G10L13/08 , G10L13/10 , G10L13/033 , G10L13/04

Abstract: A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

8.

发明授权
Speech synthesizer using artificial intelligence, method of operating speech synthesizer and computer-readable recording medium 有权

公开(公告)号：US11443732B2

公开(公告)日：2022-09-13

申请号：US16499816

申请日：2019-02-15

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon Chae , Sungmin Han

IPC: G10L13/047

Abstract: A speech synthesizer includes a memory configured to store a plurality of sentences and prior information of a word classified into a minor class among a plurality of classes with respect to each sentence, and a processor configured to determine an oversampling rate of the word based on the prior information, determine the number of times of oversampling of the word using the determined oversampling rate and generate sentences including the word by the determined number of times of oversampling. The plurality of classes includes a first class corresponding to first reading break, a second class corresponding to second reading break greater than the first break and a third class corresponding to third reading break greater than the second break, and the minor class has a smallest count among the first to third classes in one sentence.

9.

发明授权
Voice synthesis device 有权

公开(公告)号：US11120785B2

公开(公告)日：2021-09-14

申请号：US16547323

申请日：2019-08-21

Applicant: LG ELECTRONICS INC.

Inventor： Jonghoon Chae , Yongchul Park , Siyoung Yang , Juyeong Jang , Sungmin Han

IPC: G10L13/08 , G10L25/63 , G10L25/90 , G10L13/033 , G06F40/58 , G10L13/00

Abstract: A voice synthesis device which includes a database configured to store a voice and a text corresponding to the voice and a processor configured to extract characteristic information and a tone of a first-language voice stored in the database, classify an utterance style of an utterer on basis of the extracted characteristic information, generate utterer analysis information including the utterance style and the tone, translate a text corresponding to the first-language voice into a second language, and synthesize the text, translated into the second language, in a second-language voice by using the utterer analysis information.

10.

发明授权
Speech synthesis method and apparatus based on emotion information 有权

公开(公告)号：US11074904B2

公开(公告)日：2021-07-27

申请号：US16593161

申请日：2019-10-04

Applicant: LG Electronics Inc.

Inventor： Siyoung Yang , Minook Kim , Sangki Kim , Yongchul Park , Juyeong Jang , Sungmin Han

IPC: G10L13/00 , G10L13/02 , G10L25/63 , G06F17/16 , G10L15/30 , G10L25/30

Abstract: A speech synthesis method and apparatus based on emotion information are disclosed. A speech synthesis method based on emotion information extracts speech synthesis target text from received data and determines whether the received data includes situation explanation information. First metadata corresponding to first emotion information is generated on the basis of the situation explanation information. When the extracted data does not include situation explanation information, second metadata corresponding to second emotion information generated on the basis of semantic analysis and context analysis is generated. One of the first metadata and the second metadata is added to the speech synthesis target text to synthesize speech corresponding to the extracted data. A speech synthesis apparatus of this disclosure may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification