-
公开(公告)号:US11508378B2
公开(公告)日:2022-11-22
申请号:US16661658
申请日:2019-10-23
发明人: Kwangyoun Kim , Kyungmin Lee , Youngho Han , Sungsoo Kim , Sichen Jin , Jisun Park , Yeaseul Song , Jaewon Lee
摘要: An electronic device is provided. The electronic device includes a microphone to receive audio, a communicator, a memory configured to store computer-executable instructions, and a processor configured to execute the computer-executable instructions. The processor is configured to determine whether the received audio includes a predetermined trigger word; based on determining that the predetermined trigger word is included in the received audio; activate a speech recognition function of the electronic device; detect a movement of a user while the speech recognition function is activated; and based on detecting the movement of the user, transmit a control signal, to a second electronic device to activate a speech recognition function of the second electronic device.
-
公开(公告)号:US11830502B2
公开(公告)日:2023-11-28
申请号:US18057491
申请日:2022-11-21
发明人: Kwangyoun Kim , Kyungmin Lee , Youngho Han , Sungsoo Kim , Sichen Jin , Jisun Park , Yeaseul Song , Jaewon Lee
摘要: An electronic device is provided. The electronic device includes a microphone to receive audio, a communicator, a memory configured to store computer-executable instructions, and a processor configured to execute the computer-executable instructions. The processor is configured to determine whether the received audio includes a predetermined trigger word; based on determining that the predetermined trigger word is included in the received audio; activate a speech recognition function of the electronic device; detect a movement of a user while the speech recognition function is activated; and based on detecting the movement of the user, transmit a control signal, to a second electronic device to activate a speech recognition function of the second electronic device.
-
公开(公告)号:US20220165291A1
公开(公告)日:2022-05-26
申请号:US17574214
申请日:2022-01-12
发明人: Sichen Jin , Kwangyoun Kim , Sungsoo Kim , Junmo Park , Dhairya Sandhyana , Changwoo Han
摘要: An electronic apparatus, including a processor connected with a microphone, a memory and a communication interface, and configured to: based on receiving a user voice through the microphone, acquire an operation result by inputting the user voice into the first neural network model, and identify at least one device corresponding to the user voice by inputting the operation result into the second neural network model, and control the communication interface to transmit the operation result to the at least one device, wherein the first neural network model is configured to, after only some layers of a third neural network model trained to identify a text from a voice are additionally trained, include only the additionally trained some layers, and wherein the second neural network model is trained to identify a device corresponding to a voice.
-
公开(公告)号:US11893980B2
公开(公告)日:2024-02-06
申请号:US17430614
申请日:2021-06-22
发明人: Sichen Jin , Kwangyoun Kim , Sungsoo Kim , Junmo Park , Dhairya Sandhyana , Changwoo Han
IPC分类号: G10L15/183 , H04N21/488 , G06V10/20 , G10L15/26
CPC分类号: G10L15/183 , G06V10/255 , G10L15/26 , H04N21/4884
摘要: An electronic apparatus and a control method thereof are provided. The electronic apparatus includes a communication interface configured to receive content comprising image data and speech data; a memory configured to store a language contextual model trained with relevance between words; a display; and a processor configured to: extract an object and a character included in the image data, identify an object name of the object and the character, generate a bias keyword list comprising an image-related word that is associated with the image data, based on the identified object name and the identified character, convert the speech data to a text based on the bias keyword list and the language contextual model, and control the display to display the text that is converted from the speech data, as a caption.
-
公开(公告)号:US11514916B2
公开(公告)日:2022-11-29
申请号:US16992943
申请日:2020-08-13
发明人: Chanwoo Kim , Sichen Jin , Kyungmin Lee , Dhananjaya N. Gowda , Kwangyoun Kim
摘要: A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.
-
公开(公告)号:US11783838B2
公开(公告)日:2023-10-10
申请号:US18057491
申请日:2022-11-21
发明人: Kwangyoun Kim , Kyungmin Lee , Youngho Han , Sungsoo Kim , Sichen Jin , Jisun Park , Yeaseul Song , Jaewon Lee
摘要: An electronic device is provided. The electronic device includes a microphone to receive audio, a communicator, a memory configured to store computer-executable instructions, and a processor configured to execute the computer-executable instructions. The processor is configured to determine whether the received audio includes a predetermined trigger word; based on determining that the predetermined trigger word is included in the received audio; activate a speech recognition function of the electronic device; detect a movement of a user while the speech recognition function is activated; and based on detecting the movement of the user, transmit a control signal, to a second electronic device to activate a speech recognition function of the second electronic device.
-
公开(公告)号:US20230130396A1
公开(公告)日:2023-04-27
申请号:US17968517
申请日:2022-10-18
发明人: Jinhwan PARK , Sungsoo Kim , Sichen Jin , Junmo Park , Dhairya Sandhyana , Changwoo Han
摘要: An electronic apparatus includes a memory storing a speech recognition model and first recognition information corresponding to a first user voice obtained through the speech recognition model, the speech recognition model including a first network, a second network, and a third network; and a processor configured to: obtain a first vector by inputting voice data corresponding to a second user voice to the first network, obtain a second vector by inputting the first recognition information to the second network which generates a vector based on first weight information, and obtain second recognition information corresponding to the second user voice by inputting the first vector and the second vector to the third network which generates recognition information based on second weight information, wherein at least a part of the second weight information is the same as the first weight information.
-
-
-
-
-
-