Abstract:
The present invention relates to a voice recognition system for replacing a specific domain, a mobile device, and a method thereof, and more particularly, to a technology that divides a search space for voice recognition into a general domain search space and a specific domain search space.A voice recognition system according to an exemplary embodiment of the present invention includes: a mobile terminal receiving a voice recognition target word from a user; and a voice recognition server dividing a search space for voice recognition into a general domain search space and a specific domain search space and storing the spaces, and performing voice recognition for the voice recognition target word through linkage of the general domain search space and the specific domain search space.
Abstract:
The present invention relates to an emergency reporting system and method for the socially disadvantaged. The emergency reporting system for the socially disadvantaged according to the present invention includes a user terminal configured to receive an emergency report input in a preset manner according to environment information set by the socially disadvantaged, generate an emergency report message, and transmit the emergency report message, and a server configured to receive the emergency report message and transmit a dispatch notification signal.
Abstract:
A voice recognition device having a barge-in function and a method thereof are proposed. In an exemplary embodiment, there are disclosed an intelligent robot and a method for operating the intelligent robot, including an input unit for receiving a user's voice data, one or more processors, and an output unit for outputting a response generated on a basis of the user's voice data, wherein the processors generate the response corresponding to the users' voice data while maintaining a listening mode for identifying a dialogue partner by using the user's face image data and the user's voice data, and perform a speaking mode for control so as to perform an operation corresponding to the response.
Abstract:
Disclosed is an apparatus for speech recognition and automatic translation operated in a PC or a mobile device. The apparatus for speech recognition according to the present invention includes a display unit that displays a screen for selecting a domain as a unit for a speech recognition region previously sorted for speech recognition to a user; a user input unit that receives a selection of a domain from the user; and a communication unit that transmits the user selection information for the domain. According to the present invention, the apparatus for speech recognition using an intuitive and simple user interface is provided to a user to enable the user to easily select/correct a designation domain of a speech recognition system and improve accuracy and performance of speech recognition and automatic translation by the designated system for speech recognition.
Abstract:
The present disclosure relates to a method and device for improving the performance of an AI model that uses voice recognition results as text input. A method of training an AI model according to an embodiment of the present disclosure may include: generating first time information on a plurality of words included in a voice and transcription, using a first learning sample including the voice and the transcription; generating second time information by adding a pre-configured delay time to the first time information; generating a modified transcription based on an end time of a last word among the plurality of words and the second time information; and performing training of the AI model based on a second training sample including the voice and the modified transcription.