-
公开(公告)号:US20230017927A1
公开(公告)日:2023-01-19
申请号:US17944401
申请日:2022-09-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jaeyoung ROH , Hejung YANG , Hojun JIN , Donghan JANG
Abstract: An electronic apparatus is disclosed. The electronic apparatus may include a microphone; a communication interface; a memory configured to store at least one instruction; and a processor configured to execute the at least one instruction to: obtain a user voice input for registering a wake-up voice input via the microphone; input the user voice input into a trained neural network model to obtain a first feature vector corresponding to text included in the user voice input; receive a verification data set determined based on information related to the text included in the user voice input from an external server via the communication interface; input a verification voice input included in the verification data set into the trained neural network model to obtain a second feature vector corresponding to the verification voice input; and identify whether to register the user voice input as the wake-up voice input based on a similarity between the first feature vector and the second feature vector.
-
公开(公告)号:US20250149044A1
公开(公告)日:2025-05-08
申请号:US19013349
申请日:2025-01-08
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jonguk YOO , Dokyun LEE , Jaeyoung ROH , Youngmoon JUNG , Changwoo HAN , Jungwook HWANG
IPC: G10L17/06
Abstract: An electronic device is provided. The electronic device includes: a voice reception unit comprising circuitry, memory storing an artificial intelligence model configured to acquire a voice signal of a user from an audio signal and information on characteristics of a plurality of users, and at least one processor, comprising processing circuitry, individually and/or collectively, configured to: based on an audio signal being received through the voice reception unit, obtain a first audio signal by inputting information on a characteristic of a first user set as a target speaker among the plurality of users and the received audio signal to the artificial intelligence model, based on voice recognition based on the first audio signal failing, identify a similarity between information on a characteristic of a second audio signal excluding the first audio signal among the received audio signals and information on characteristics of remaining users excluding the first user among the plurality of users, and change the target speaker to a second user among the plurality of users.
-
公开(公告)号:US20220392456A1
公开(公告)日:2022-12-08
申请号:US17888051
申请日:2022-08-15
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Keunseok CHO , Jaeyoung ROH , Donghan JANG , Jiwon HYUNG , Jaewon LEE
Abstract: A method and apparatus for authenticating a user based on an utterance input includes obtaining an input audio signal based on the utterance input of the user; obtaining, from the input audio signal, at least one audio signal of an utterance section and at least one audio signal of a non-utterance section; generating environment information indicating an environment in which the utterance input is received, based on the at least one audio signal of the non-utterance section; obtaining a result of a comparison between the generated environment information and registration environment information indicating an environment in which a registration utterance input corresponding to a previously registered registration audio signal corresponding to the user is received; adjusting an authentication criterion for authenticating the user based on the result of the comparison; and authenticating the user based on the adjusted authentication criterion and the input audio signal.
-
公开(公告)号:US20240274128A1
公开(公告)日:2024-08-15
申请号:US18420338
申请日:2024-01-23
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Changwoo HAN , Dokyun LEE , Jungwook HWANG , Jaeyoung ROH , Jonguk YOO , Youngmoon JUNG , Youngo HAN
CPC classification number: G10L15/20 , G06F3/167 , G10L15/083 , G10L15/22 , G10L2015/088 , G10L2015/223
Abstract: An electronic device includes a microphone; at least one memory storing a wake-up word detection model; and at least one processor configured to: obtain a sound signal received through the microphone, input the sound signal into the wake-up word detection model, obtain, as an output of the wake-up word detection model, one or more first probability scores corresponding to one or more sections of the sound signal, wherein each first probability score of the one or more first probability scores represents a probability that a corresponding section of the one or more sections of the sound signal corresponds to a wake-up word, identify a first section of the sound signal, among the one or more sections of the sound signal, that corresponds to a first probability score, among the one or more first probability scores, that exceeds a first threshold value, and based on identifying a predetermined acoustic signal in the sound signal, reduce the first threshold value.
-
公开(公告)号:US20230154470A1
公开(公告)日:2023-05-18
申请号:US17424412
申请日:2021-06-22
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Hejung YANG , Hyungjun LIM , Jaeyoung ROH , Yeaseul SONG , Hojun JIN , Jubum HAN
IPC: G10L17/24 , G06F3/04847 , G06F40/279 , G10L17/14 , G06F3/16
CPC classification number: G10L17/24 , G06F3/04847 , G06F40/279 , G10L17/14 , G06F3/167
Abstract: An electronic apparatus is disclosed. The electronic apparatus may include a microphone; a memory configured to store a wakeup word; and a processor configured to: identify, based on context information of the electronic apparatus, an occurrence of a pre-determined event; change, based on the occurrence of the pre-determined event, a first threshold value for recognizing the wakeup word; obtain, based on a first user voice input received via the microphone, a similarity value between first text information corresponding to the first user voice input and the wakeup word; and perform, based on the similarity value being greater than or equal to the first threshold value, a voice recognition function on second text information corresponding to a second user voice input received via the microphone after the first user voice input.
-
6.
公开(公告)号:US20230162739A1
公开(公告)日:2023-05-25
申请号:US17425169
申请日:2021-06-30
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jubum HAN , Jaewon LEE , Jaeyoung ROH
CPC classification number: G10L17/20 , G10L17/24 , G10L17/18 , G10L21/0224 , G10L17/06 , G06F3/162 , G10L2021/02166
Abstract: An electronic apparatus is provided. The electronic apparatus may include a communication interface; and a processor configured to: control the communication interface to output an audio content signal to a sound input/output device including a speaker and a microphone; based on receiving a sound signal collected via the microphone from the sound input/output device via the communication interface, identify whether the sound signal includes a scene noise signal corresponding to a regular noise generated in a location in which the sound input/output device is located or an event noise signal corresponding to an irregular noise generated in the location in which the sound input/output device is located; based on identifying that the sound signal includes the scene noise signal, perform noise cancelling for the sound signal; and based on identifying that the sound signal includes the event noise signal, control the output of the audio content signal
-
7.
公开(公告)号:US20190244612A1
公开(公告)日:2019-08-08
申请号:US16265237
申请日:2019-02-01
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Youngho HAN , Keunseok CHO , Jaeyoung ROH , Namhoon KIM , Chiyoun PARK , Jongyoub RYU
IPC: G10L15/22 , G10L15/02 , G10L25/84 , G10L15/187
CPC classification number: G10L15/22 , G10L15/02 , G10L15/187 , G10L25/84 , G10L2015/025
Abstract: A method of processing a speech signal for speaker recognition in an electronic apparatus includes: obtaining a speech signal of a first user; extracting a speech feature comprising a feature value from the speech signal; comparing the speech feature extracted from the speech signal of the first user with a predetermined reference value; selecting a first user feature that corresponds to the speech feature of the first user compared with the reference value; generating a recommended phrase used for speaker recognition based on the first user feature; and outputting the recommended phrase.
-
公开(公告)号:US20230079163A1
公开(公告)日:2023-03-16
申请号:US17835590
申请日:2022-06-08
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jeongrae NOH , Jiwoo LEE , Jaeyoung ROH , Seokgyu BAN , Dongjo WOO , Intae JUN
Abstract: An electronic device includes a processor; a first camera module comprising a camera including a first lens assembly having a first field of view (FOV) and a second camera module comprising a camera spaced apart a first camera module and including a second lens assembly having a second FOV narrower than the first FOV; wherein the first camera module includes an image sensor and a filter including a glass plate spaced apart the image sensor and disposed on the image sensor, and a layer disposed on the glass plate and configured to absorb a portion of infrared light among the light transmitted through the first lens assembly, wherein the processor is configured to obtain depth information about on the subject located within the second FOV based on data about the light passing through the filter, which is obtained through the image sensor.
-
公开(公告)号:US20200175993A1
公开(公告)日:2020-06-04
申请号:US16700264
申请日:2019-12-02
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Keunseok CHO , Jaeyoung ROH , Donghan JANG , Jiwon HYUNG , Jaewon LEE
Abstract: A method and apparatus for authenticating a user based on an utterance input includes obtaining an input audio signal based on the utterance input of the user; obtaining, from the input audio signal, at least one audio signal of an utterance section and at least one audio signal of a non-utterance section; generating environment information indicating an environment in which the utterance input is received, based on the at least one audio signal of the non-utterance section; obtaining a result of a comparison between the generated environment information and registration environment information indicating an environment in which a registration utterance input corresponding to a previously registered registration audio signal corresponding to the user is received; adjusting an authentication criterion for authenticating the user based on the result of the comparison; and authenticating the user based on the adjusted authentication criterion and the input audio signal.
-
公开(公告)号:US20200168230A1
公开(公告)日:2020-05-28
申请号:US16692696
申请日:2019-11-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Jaeyoung ROH , Keunseok CHO , Jiwon HYUNG , Donghan JANG , Jaewon LEE
Abstract: A method and apparatus for processing voice data of a speech received from a speaker are provided. The method includes extracting a speaker feature vector from the voice data of the speech received from a speaker, generating a speaker feature map by positioning the extracted speaker feature vector at a specific position on a multi-dimensional vector space, forming a plurality of clusters indicating features of voices of a plurality of speakers by grouping at least one speaker feature vector positioned on the speaker feature map, and classifying the plurality of speakers according to the plurality of clusters.
-
-
-
-
-
-
-
-
-