摘要:
A speech enhancement method including: estimating a direction of a speaker by using an input signal, generating direction information indicating the estimated direction, detecting speech of a speaker based on a result of the estimating a direction, and enhancing the speech of the speaker by using the direction information of the estimating of a direction based on a result of the detecting of speech.
摘要:
Provided are a method and an apparatus for performing exact start and end recognition of voice based on video recognition. The method includes determining whether a speech starts based on at least one of first video and audio data before conversion into a voice recognition mode, converting into the voice recognition mode and generating second audio data including a voice command, when it is determined that speech starts, and determining whether the speech is terminated based on at least one of second video and audio data after conversion into the voice recognition mode.
摘要:
Provided are a method and an apparatus for performing exact start and end recognition of voice based on video recognition. The method includes determining whether a speech starts based on at least one of first video and audio data before conversion into a voice recognition mode, converting into the voice recognition mode and generating second audio data including a voice command, when it is determined that speech starts, and determining whether the speech is terminated based on at least one of second video and audio data after conversion into the voice recognition mode.
摘要:
A method and system for speech recognition defined by using a microphone array that is directed to the face of a person speaking. Reading/scanning the output from the microphone array in order to determine which part of a face sound is emitting from. Using this information as input to a speech recognition system for improving speech recognition.
摘要:
A voice talk function-enabled terminal and voice talk control method. The terminal comprises a display unit; an audio processing unit; and a control unit configured to: select content corresponding to a first criterion associated with a user in response to a user input; determine a content output scheme based on a second criterion associated with the user; and output the selected content through at least one of the display unit and the audio processing unit according to the content output scheme.
摘要:
A method and apparatus for performing microphone beamforming. The method includes recognizing a speech of a speaker, searching for a previously stored image associated with the speaker, searching for the speaker through a camera based on the image, recognizing a position of the speaker, and performing microphone beamforming according to the position of the speaker.
摘要:
Haushaltgerät mit einer Funktionseinrichtung (10), mit einer Steuerungseinrichtung (14) und mit einer Bilderfassungseinrichtung (18) zum Erfassen eines Bildsignals (PS), welches gegenüber herkömmlichen Haushaltgeräten im Hinblick auf Barrierefreiheit weiterentwickelt ist, wobei das Gerät mit einer Erkennungseinrichtung (22) zur Verarbeitung und Auswertung eines Bildsignals (PS) ausgestattet ist, um hieraus eine vom Bediener gesprochene Lautfolge zu erkennen.