Patent search ap:("ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE") AND inv:"Seung Hi KIM" Page 1

1.

发明申请
VOICE RECOGNITION SYSTEM FOR REPLACING SPECIFIC DOMAIN, MOBILE DEVICE AND METHOD THEREOF 有权
Title translation: 用于替换特定域，移动设备的语音识别系统及其方法

公开(公告)号：US20150340028A1

公开(公告)日：2015-11-26

申请号：US14602586

申请日：2015-01-22

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Hi KIM , Sang Hun Kim , Ki Hyun Kim , Sang Kyu Park , Soo Jong Lee

IPC: G10L15/08

CPC classification number: G10L15/08 , G10L15/30 , G10L2015/088 , G10L2015/228

Abstract: The present invention relates to a voice recognition system for replacing a specific domain, a mobile device, and a method thereof, and more particularly, to a technology that divides a search space for voice recognition into a general domain search space and a specific domain search space.A voice recognition system according to an exemplary embodiment of the present invention includes: a mobile terminal receiving a voice recognition target word from a user; and a voice recognition server dividing a search space for voice recognition into a general domain search space and a specific domain search space and storing the spaces, and performing voice recognition for the voice recognition target word through linkage of the general domain search space and the specific domain search space.

Abstract translation: 本发明涉及用于替换特定域的语音识别系统，移动装置及其方法，更具体地说，涉及一种将用于语音识别的搜索空间划分成一般域搜索空间和特定域搜索的技术空间。根据本发明的示例性实施例的语音识别系统包括：移动终端，从用户接收语音识别目标词; 以及语音识别服务器，用于将用于语音识别的搜索空间划分为通用域搜索空间和特定域搜索空间并存储空间，并且通过一般域搜索空间和特定的域搜索空间的链接来对语音识别目标词进行语音识别域搜索空间。

2.

发明申请
EMERGENCY REPORTING SYSTEM AND METHOD FOR THE SOCIALLY DISADVANTAGED 有权

公开(公告)号：US20220141328A1

公开(公告)日：2022-05-05

申请号：US17517437

申请日：2021-11-02

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Myung Sun BAEK , Won Joo PARK , Seung Hi KIM , Yong Jin KIM , Young Soo PARK , Jun Seong BANG , Sang Yun LEE , Yong Tae LEE , Eui Suk JUNG

IPC: H04M1/72424 , H04M1/72421 , H04W4/90 , G06F40/58 , G06N20/00

Abstract: The present invention relates to an emergency reporting system and method for the socially disadvantaged. The emergency reporting system for the socially disadvantaged according to the present invention includes a user terminal configured to receive an emergency report input in a preset manner according to environment information set by the socially disadvantaged, generate an emergency report message, and transmit the emergency report message, and a server configured to receive the emergency report message and transmit a dispatch notification signal.

3.

发明公开
VOICE RECOGNITION DEVICE HAVING BARGE-IN FUNCTION AND METHOD THEREOF 审中-公开

公开(公告)号：US20240212681A1

公开(公告)日：2024-06-27

申请号：US18498241

申请日：2023-10-31

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Min Kyu LEE , Seung Hi KIM , Sanghun KIM , Jeonguk BANG , Seung YUN

IPC: G10L15/22 , G06V40/16 , G10L13/02 , G10L17/00 , H04N23/611

CPC classification number: G10L15/22 , G06V40/172 , G10L13/02 , G10L17/00 , H04N23/611

Abstract: A voice recognition device having a barge-in function and a method thereof are proposed.
In an exemplary embodiment, there are disclosed an intelligent robot and a method for operating the intelligent robot, including an input unit for receiving a user's voice data, one or more processors, and an output unit for outputting a response generated on a basis of the user's voice data, wherein the processors generate the response corresponding to the users' voice data while maintaining a listening mode for identifying a dialogue partner by using the user's face image data and the user's voice data, and perform a speaking mode for control so as to perform an operation corresponding to the response.

4.

发明申请
APPARATUS AND METHOD FOR SPEECH RECOGNITION 有权
Title translation: 用于语音识别的装置和方法

公开(公告)号：US20130297304A1

公开(公告)日：2013-11-07

申请号：US13803141

申请日：2013-03-14

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Hi KIM , Sang Hun KIM

IPC: G10L15/22

CPC classification number: G10L15/22 , G10L2015/228

Abstract: Disclosed is an apparatus for speech recognition and automatic translation operated in a PC or a mobile device. The apparatus for speech recognition according to the present invention includes a display unit that displays a screen for selecting a domain as a unit for a speech recognition region previously sorted for speech recognition to a user; a user input unit that receives a selection of a domain from the user; and a communication unit that transmits the user selection information for the domain. According to the present invention, the apparatus for speech recognition using an intuitive and simple user interface is provided to a user to enable the user to easily select/correct a designation domain of a speech recognition system and improve accuracy and performance of speech recognition and automatic translation by the designated system for speech recognition.

Abstract translation: 公开了一种用于在PC或移动设备中操作的语音识别和自动翻译的装置。根据本发明的用于语音识别的装置包括显示单元，其将用于选择域的屏幕显示为用于对用户进行语音识别的先前排序的语音识别区域的单元; 用户输入单元，其从用户接收域的选择; 以及发送域的用户选择信息的通信单元。根据本发明，使用直观且简单的用户界面的用于语音识别的装置被提供给用户，以使得用户能够容易地选择/校正语音识别系统的指定域，并提高语音识别的精度和性能并且自动地由指定的语音识别系统翻译。

5.

发明申请
METHOD AND APPARATUS FOR IMPROVING PERFORMANCE OF ARTIFICIAL INTELLIGENCE MODEL USING SPEECH RECOGNITION RESULTS AS TEXT INPUT 有权

公开(公告)号：US20240420682A1

公开(公告)日：2024-12-19

申请号：US18585204

申请日：2024-02-23

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Hi KIM , Jeong Uk BANG , Seung YUN

IPC: G10L15/06 , G10L15/26

Abstract: The present disclosure relates to a method and device for improving the performance of an AI model that uses voice recognition results as text input. A method of training an AI model according to an embodiment of the present disclosure may include: generating first time information on a plurality of words included in a voice and transcription, using a first learning sample including the voice and the transcription; generating second time information by adding a pre-configured delay time to the first time information; generating a modified transcription based on an end time of a last word among the plurality of words and the second time information; and performing training of the AI model based on a second training sample including the voice and the modified transcription.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification