专利检索 ap:("Samsung Electronics Co., Ltd.") AND inv:"Chanwoo KIM" 第 1 页

1.

发明公开
ELECTRONIC DEVICE AND CONTROLLING METHOD OF ELECTRONIC DEVICE 审中-公开

公开(公告)号：US20240312457A1

公开(公告)日：2024-09-19

申请号：US18669069

申请日：2024-05-20

申请人： Samsung Electronics Co., Ltd.

发明人： Dhananjaya Nagaraja GOWDA , Jiyeon KIM , Abhinav GARG , Chanwoo KIM

IPC分类号： G10L15/22 , G10L15/06

CPC分类号： G10L15/22 , G10L15/063

摘要： Provided are an electronic device and a method of controlling an electronic device. The electronic device includes: a memory storing at least one instruction; and at least one processor configured to execute the at least one instruction, wherein one or more of the at least one processor is configured to: acquire a first vector corresponding to each of a plurality of sections of a voice signal by inputting the voice signal to a common encoder based on acquiring the voice signal; acquire a second vector corresponding to each of the plurality of sections and independent on a context of the voice signal by inputting the first vector into a first individual encoder; acquire a phoneme sequence corresponding to the second vector by inputting the second vector into a first decoder; acquire a third vector corresponding to at least two sections among the plurality of sections and dependent on the context of the voice signal by inputting the first vectors into a second individual encoder; acquire a sub-word sequence corresponding to the third vector by inputting the third vector into a second decoder; and acquire text information corresponding to the plurality of sections by correcting the sub-word sequence based on the phoneme sequence, through a text information acquisition module.

2.

发明申请
ELECTRONIC DEVICE AND CONTROLLING METHOD OF ELECTRONIC DEVICE 有权

公开(公告)号：US20210304737A1

公开(公告)日：2021-09-30

申请号：US17189710

申请日：2021-03-02

申请人： SAMSUNG ELECTRONICS CO., LTD.

发明人： Changwoo HAN , Kwangyoun KIM , Chanwoo KIM , Kyungmin LEE , Youngho HAN

IPC分类号： G10L15/18 , G10L15/06 , G10L15/26

摘要： Disclosed are an electronic device and a method of controlling the electronic device. An electronic device according to an embodiment may perform a method comprising: performing natural language understanding for a first text included in learning data, obtaining first information associated with a speech corresponding to the first text being uttered based on a result of the natural language understanding, obtain second information associated with an acoustic feature corresponding to the speech corresponding to the first text being uttered based on the first information, obtaining a plurality of speech signals corresponding to the first text by converting a first speech signal corresponding to the first text based on the first information and the second information, and training a speech recognition model based on the plurality of obtained speech signals and the first text.

3.

发明公开
ELECTRONIC DEVICE FOR TRAINING SPEECH RECOGNITION MODEL AND CONTROL METHOD THEREOF 审中-公开

公开(公告)号：US20240078391A1

公开(公告)日：2024-03-07

申请号：US18225991

申请日：2023-07-25

申请人： SAMSUNG ELECTRONICS CO., LTD.

发明人： Chanwoo KIM

IPC分类号： G06F40/40

CPC分类号： G06F40/40

摘要： Provided is an electronic device for training a speech recognition model and a method for controlling thereof. The method of controlling the electronic device includes obtaining a first loss value by inputting a first learning speech sequence comprising an end-of-sentence (EOS) label to the speech recognition model; and training the speech recognition model based on the first loss value. Here, the first loss value is a loss value obtained from an output of an encoder included in the speech recognition model.

4.

发明申请
SYSTEM AND METHOD FOR MODIFYING SPEECH RECOGNITION RESULT 有权

公开(公告)号：US20210050017A1

公开(公告)日：2021-02-18

申请号：US16990343

申请日：2020-08-11

申请人： SAMSUNG ELECTRONICS CO., LTD.

发明人： Chanwoo KIM , Dhananjaya N. GOWDA , Abhinav GARG , Kyungmin LEE

IPC分类号： G10L15/30 , G10L15/22 , G10L19/008 , G10L19/06 , G10L15/06

摘要： Provided are a system and method for modifying a speech recognition result. The method includes: receiving, from a device, text output from an automatic speech recognition (ASR) model of the device; identifying at least one domain related to the received text; selecting, from among a plurality of text modification models included in the server, at least one text modification model corresponding to the identified at least one domain; and modifying the received text by using the selected at least one text modification model.

5.

发明申请
SERVER THAT SUPPORTS SPEECH RECOGNITION OF DEVICE, AND OPERATION METHOD OF THE SERVER 有权

公开(公告)号：US20210050018A1

公开(公告)日：2021-02-18

申请号：US16992943

申请日：2020-08-13

申请人： SAMSUNG ELECTRONICS CO., LTD.

发明人： Chanwoo KIM , Sichen JIN , Kyungmin LEE , Dhananjaya N. GOWDA , Kwangyoun KIM

IPC分类号： G10L15/30 , G10L15/02 , G10L15/16

摘要： A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.

6.

发明申请
VOICE RECOGNITION DEVICE AND METHOD 有权

公开(公告)号：US20220005481A1

公开(公告)日：2022-01-06

申请号：US17296806

申请日：2019-11-22

申请人： Samsung Electronics Co., Ltd.

发明人： Chanwoo KIM , Dhananjaya N. GOWDA , Sungsoo KIM , Minkyu SHIN , Larry Paul HECK , Abhinav GARG , Kwangyoun KIM , Mehul KUMAR

IPC分类号： G10L17/02 , G10L25/21 , G10L17/04

摘要： The disclosure relates to an electronic apparatus for recognizing user voice and a method of recognizing, by the electronic apparatus, the user voice. According to an embodiment, the method of recognizing the user voice includes obtaining an audio signal segmented into a plurality of frame units, determining an energy component for each filter bank by applying a filter bank distributed according to a preset scale to a frequency spectrum of the audio signal segmented into the frame units, smoothing the determined energy component for each filter bank, extracting a feature vector of the audio signal based on the smoothed energy component for each filter bank, and recognizing the user voice in the audio signal by inputting the extracted feature vector to a voice recognition model.

7.

发明申请
SYSTEM AND METHOD FOR RECOGNIZING USER'S SPEECH 有权

公开(公告)号：US20210050016A1

公开(公告)日：2021-02-18

申请号：US16988929

申请日：2020-08-10

申请人： Samsung Electronics Co., Ltd.

发明人： Chanwoo KIM , Dhananjaya N. GOWDA , Kwangyoun KIM , Kyungmin LEE

IPC分类号： G10L15/30 , G10L15/22 , G10L19/008 , G10L19/09

摘要： Provided is a system and method for recognizing a user's speech. A method, performed by a server, of providing a text string for a speech signal input to a device includes: receiving, from the device, an encoder output value derived from an encoder of an end-to-end automatic speech recognition (ASR) model included in the device; identifying a domain corresponding to the received encoder output value; selecting a decoder corresponding to the identified domain from among a plurality of decoders of an end-to-end ASR model included in the server; obtaining a text string from the received encoder output value using the selected decoder; and providing the obtained text string to the device.

8.

发明申请
METHOD AND DEVICE FOR SPEECH RECOGNITION 审中-公开

公开(公告)号：US20200234713A1

公开(公告)日：2020-07-23

申请号：US16750274

申请日：2020-01-23

申请人： Samsung Electronics Co., Ltd.

发明人： Dhananjaya N. GOWDA , Kwangyoun KIM , Abhinav GARG , Chanwoo KIM

IPC分类号： G10L15/26 , G10L15/02 , G10L15/28 , G10L17/04 , G10L17/20

摘要： Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类