Abstract:
A speech signal processing method of a user terminal includes: receiving a speech signal, detecting a personalized information section including personal information in the speech signal, performing data processing on the personalized information section of the speech signal by using a personalized model generated based on the personal information, and receiving, from a server, a result of the data processing performed by the server on a general information section of the speech signal that is different than the personalized information section of the speech signal.
Abstract:
A speech recognition method includes: storing at least one acoustic model (AM); obtaining, from a device located outside the ASR server, a device ID for identifying the device; obtaining speech data from the device; selecting an AM based on the device ID; performing speech recognition on the speech data by using the selected AM; and outputting a result of the speech recognition.
Abstract:
A speech recognition device is provided. The speech recognition device includes at least one microphone configured to receive a sound signal from a first sound source, and at least one processor configured to determine a direction of the first sound source based on the sound signal, determine whether the direction of the first sound source is in a registered direction, and based on whether the direction of the first sound source is in the registered direction, recognize a speech from the sound signal regardless of whether the sound signal comprises a wake-up keyword.
Abstract:
A device detects a wake-up keyword from a received speech signal of a user by using a wake-up keyword model, and transmits a wake-up keyword detection/non-detection signal and the received speech signal of the user to a speech recognition server. The speech recognition server performs a recognition process on the speech signal of the user by setting a speech recognition model according to the detection or non-detection of the wake-up keyword.