-
公开(公告)号:US12027173B2
公开(公告)日:2024-07-02
申请号:US17888051
申请日:2022-08-15
发明人: Keunseok Cho , Jaeyoung Roh , Donghan Jang , Jiwon Hyung , Jaewon Lee
摘要: A method and apparatus for authenticating a user based on an utterance input includes obtaining an input audio signal based on the utterance input of the user; obtaining, from the input audio signal, at least one audio signal of an utterance section and at least one audio signal of a non-utterance section; generating environment information indicating an environment in which the utterance input is received, based on the at least one audio signal of the non-utterance section; obtaining a result of a comparison between the generated environment information and registration environment information indicating an environment in which a registration utterance input corresponding to a previously registered registration audio signal corresponding to the user is received; adjusting an authentication criterion for authenticating the user based on the result of the comparison; and authenticating the user based on the adjusted authentication criterion and the input audio signal.
-
公开(公告)号:US11990140B2
公开(公告)日:2024-05-21
申请号:US17888051
申请日:2022-08-15
发明人: Keunseok Cho , Jaeyoung Roh , Donghan Jang , Jiwon Hyung , Jaewon Lee
摘要: A method and apparatus for authenticating a user based on an utterance input includes obtaining an input audio signal based on the utterance input of the user; obtaining, from the input audio signal, at least one audio signal of an utterance section and at least one audio signal of a non-utterance section; generating environment information indicating an environment in which the utterance input is received, based on the at least one audio signal of the non-utterance section; obtaining a result of a comparison between the generated environment information and registration environment information indicating an environment in which a registration utterance input corresponding to a previously registered registration audio signal corresponding to the user is received; adjusting an authentication criterion for authenticating the user based on the result of the comparison; and authenticating the user based on the adjusted authentication criterion and the input audio signal.
-
公开(公告)号:US11238871B2
公开(公告)日:2022-02-01
申请号:US16665532
申请日:2019-10-28
发明人: Chanwoo Kim , Kyungmin Lee , Jaeyoung Roh , Donghan Jang , Keunseok Cho , Jiwon Hyung
摘要: An electronic apparatus and a control method are provided, including an input interface, a communication interface, a memory including at least one command, and at least one processor configured to control the electronic device and execute the at least one command to receive a user speech through the input interface, determine whether or not the user speech is a speech related to a task requiring user confirmation by analyzing the user speech, generate a question for the user confirmation when it is determined that the user speech is the speech related to the task requiring the user confirmation, and perform a task corresponding to the user speech when a user response corresponding to the question is input through the input interface. Embodiments may use an artificial intelligence model learned according to at least one of machine learning, a neural network, and a deep learning algorithm.
-
公开(公告)号:US12131738B2
公开(公告)日:2024-10-29
申请号:US17944401
申请日:2022-09-14
发明人: Jaeyoung Roh , Hejung Yang , Hojun Jin , Donghan Jang
CPC分类号: G10L15/22 , G10L15/02 , G10L15/26 , G10L2015/223
摘要: An electronic apparatus is disclosed. The electronic apparatus may include a microphone; a communication interface; a memory configured to store at least one instruction; and a processor configured to execute the at least one instruction to: obtain a user voice input for registering a wake-up voice input via the microphone; input the user voice input into a trained neural network model to obtain a first feature vector corresponding to text included in the user voice input; receive a verification data set determined based on information related to the text included in the user voice input from an external server via the communication interface; input a verification voice input included in the verification data set into the trained neural network model to obtain a second feature vector corresponding to the verification voice input; and identify whether to register the user voice input as the wake-up voice input based on a similarity between the first feature vector and the second feature vector.
-
公开(公告)号:US11443750B2
公开(公告)日:2022-09-13
申请号:US16700264
申请日:2019-12-02
发明人: Keunseok Cho , Jaeyoung Roh , Donghan Jang , Jiwon Hyung , Jaewon Lee
摘要: A method and apparatus for authenticating a user based on an utterance input includes obtaining an input audio signal based on the utterance input of the user; obtaining, from the input audio signal, at least one audio signal of an utterance section and at least one audio signal of a non-utterance section; generating environment information indicating an environment in which the utterance input is received, based on the at least one audio signal of the non-utterance section; obtaining a result of a comparison between the generated environment information and registration environment information indicating an environment in which a registration utterance input corresponding to a previously registered registration audio signal corresponding to the user is received; adjusting an authentication criterion for authenticating the user based on the result of the comparison; and authenticating the user based on the adjusted authentication criterion and the input audio signal.
-
公开(公告)号:US11430448B2
公开(公告)日:2022-08-30
申请号:US16692696
申请日:2019-11-22
发明人: Jaeyoung Roh , Keunseok Cho , Jiwon Hyung , Donghan Jang , Jaewon Lee
摘要: A method and apparatus for processing voice data of a speech received from a speaker are provided. The method includes extracting a speaker feature vector from the voice data of the speech received from a speaker, generating a speaker feature map by positioning the extracted speaker feature vector at a specific position on a multi-dimensional vector space, forming a plurality of clusters indicating features of voices of a plurality of speakers by grouping at least one speaker feature vector positioned on the speaker feature map, and classifying the plurality of speakers according to the plurality of clusters.
-
-
-
-
-