-
公开(公告)号:US11961522B2
公开(公告)日:2024-04-16
申请号:US17296806
申请日:2019-11-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Chanwoo Kim , Dhananjaya N. Gowda , Sungsoo Kim , Minkyu Shin , Larry Paul Heck , Abhinav Garg , Kwangyoun Kim , Mehul Kumar
Abstract: The disclosure relates to an electronic apparatus for recognizing user voice and a method of recognizing, by the electronic apparatus, the user voice. According to an embodiment, the method of recognizing the user voice includes obtaining an audio signal segmented into a plurality of frame units, determining an energy component for each filter bank by applying a filter bank distributed according to a preset scale to a frequency spectrum of the audio signal segmented into the frame units, smoothing the determined energy component for each filter bank, extracting a feature vector of the audio signal based on the smoothed energy component for each filter bank, and recognizing the user voice in the audio signal by inputting the extracted feature vector to a voice recognition model.
-
公开(公告)号:US11521619B2
公开(公告)日:2022-12-06
申请号:US16990343
申请日:2020-08-11
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Chanwoo Kim , Dhananjaya N. Gowda , Abhinav Garg , Kyungmin Lee
IPC: G10L15/30 , G10L15/06 , G10L15/22 , G10L19/008 , G10L19/06
Abstract: Provided are a system and method for modifying a speech recognition result. The method includes: receiving, from a device, text output from an automatic speech recognition (ASR) model of the device; identifying at least one domain related to the received text; selecting, from among a plurality of text modification models included in the server, at least one text modification model corresponding to the identified at least one domain; and modifying the received text by using the selected at least one text modification model.
-
公开(公告)号:US11302331B2
公开(公告)日:2022-04-12
申请号:US16750274
申请日:2020-01-23
Applicant: Samsung Electronics Co., Ltd.
Inventor: Dhananjaya N. Gowda , Kwangyoun Kim , Abhinav Garg , Chanwoo Kim
Abstract: Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.
-
-