摘要:
According to one embodiment, a voice recognition apparatus includes a determination unit, an estimating unit, and a voice recognition unit. The determination unit determines whether a component with a frequency of not less than 1000 Hz and with a level not lower than a predetermined level is included in a sound input from a plurality of microphones. The estimating unit estimates a sound source direction of the sound when the determination unit determines that the component is included in the sound. The voice recognition unit recognizes whether the sound obtained in the sound source direction coincides with a voice model registered beforehand.
摘要:
According to one embodiment, a voice recognition apparatus includes a determination unit, an estimating unit, and a voice recognition unit. The determination unit determines whether a component with a frequency of not less than 1000 Hz and with a level not lower than a predetermined level is included in a sound input from a plurality of microphones. The estimating unit estimates a sound source direction of the sound when the determination unit determines that the component is included in the sound. The voice recognition unit recognizes whether the sound obtained in the sound source direction coincides with a voice model registered beforehand.
摘要:
A device for speech input includes a speech input unit configured to convert a speech of a user to a speech signal; an angle detection unit configured to detect an angle of the speech input unit; a distance detection unit configured to detect a distance between the speech input unit and the user; and an input switch unit configured to control on and off of the speech input unit based on the angle and the distance.
摘要:
A device for speech input includes a speech input unit configured to convert a speech of a user to a speech signal; an angle detection unit configured to detect an angle of the speech input unit; a distance detection unit configured to detect a distance between the speech input unit and the user; and an input switch unit configured to control on and off of the speech input unit based on the angle and the distance.
摘要:
A dialog generation apparatus includes a reception unit configured to receive a first text from a dialog partner, an information storage unit configured to store profile information specific to a person who can be the dialog partner and a fixed-pattern text associated with the person, a presentation unit configured to present the first text to a user, a speech recognition unit configured to perform speech recognition on speech the user has uttered about the first text presented to the user, and generate a speech recognition result showing the content of the speech, a generation unit configured to generate a second text from the profile information about the dialog partner, fixed-pattern text about the dialog partner, and the speech recognition result, and a transmission unit configured to transmit the second text to the dialog partner.
摘要:
An interface apparatus of an embodiment of the present invention is configured to perform a device operation in response to a voice instruction from a user. The interface apparatus detects a state change or state continuation of a device or the vicinity of the device; queries a user by voice about the meaning of the detected state change or state continuation; has a speech recognition unit recognize a teaching speech uttered by the user in response to the query; associates a recognition result for the teaching speech with a detection result for the state change or state continuation, and accumulate a correspondence between the recognition result for the teaching speech and the detection result for the state change or state continuation; has a speech recognition unit recognize an instructing speech uttered by a user for a device operation; compares a recognition result for the instructing speech with accumulated correspondences between recognition results for teaching speeches and detection results for state changes or state continuations, and select a device operation specified by a detection result for a state change or state continuation that corresponds to the recognition result for the instructing speech; and performs the selected device operation.
摘要:
A signal acquisition unit acquires a status signal from a home appliance. An expression unit expresses the status signal to a user. A reaction acquisition unit acquires a reaction signal from the user in response to the status signal expressed. A registration unit registers relativity in the status signal and the reaction signal in a storage unit. A first comparison unit compares an acquired status signal by the signal acquisition unit to the stored status signal in the storage unit. If the acquired status signal matches the stored status signal, the expression unit expresses the reaction signal related to the stored status signal. If the acquired status signal does not match the stored status signal, the expression unit expresses the acquired status signal, and the registration unit registers relativity in the acquired status signal and a reaction signal from the user in response to the acquired status signal expressed.
摘要:
A voice recognition apparatus includes: a voice recognition module that performs a voice recognition for an audio signal during a voice period; a distance measurement module that measures a current distance between the user and an voice input module; a calculation module that calculates a recommended distance range, in which being estimated that an S/N ratio exceeds a first threshold, based on the voice characteristic; and a display module that displays the recommended distance range and the current distance.
摘要:
A voice recognition apparatus includes: a voice recognition module that performs a voice recognition for an audio signal during a voice period; a distance measurement module that measures a current distance between the user and an voice input module; a calculation module that calculates a recommended distance range, in which being estimated that an S/N ratio exceeds a first threshold, based on the voice characteristic; and a display module that displays the recommended distance range and the current distance.
摘要:
According to one aspect of the invention, a speech recognizer includes: an audio data acquiring portion configured to acquire audio data via a microphone; a speech section detecting portion configured to detect a talking start time and a talking end time based on the audio data; a spoken word identifying portion configured to identify the audio in a speech section from the talking start time to the talking end time; and a noise suppressing portion configured to suppress a generation of a noise from an electrical noise source for the speech section.