摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.
摘要:
A parameter prediction device includes: an environmental characteristic acquirer that acquires an environmental characteristic quantity set which quantifies one or more characteristics of a sound collection environment for an acoustic signal; a target setter that sets a target evaluation value set which provides one or more values obtained by quantifying one or more performances of processing of the acoustic signal, or one or more evaluation values of a processed acoustic signal; and a first predictor that inputs the environmental characteristic quantity set and the target evaluation value set as independent variables to a first prediction model, and predicts a control parameter set for controlling the acoustic signal processing.
摘要:
In one aspect, the present application is directed to a device for providing different levels of sound quality in an audio entertainment system. The device includes a speech enhancement system with a reference signal modification unit and a plurality of acoustic echo cancellation filters. Each acoustic echo cancellation filter is coupled to a playback channel. The device includes an audio playback system with loudspeakers. Each loudspeaker is coupled to a playback channel. At least one of the speech enhancement system and the audio playback system operates according to a full sound quality mode and a reduced sound quality mode. In the full sound quality mode, all of the playback channels contain non-zero output signals. In the reduced sound quality mode, a first subset of the playback channels contains non-zero output signals and a second subset of the playback channels contains zero output signals.
摘要:
There is provided an information processing device including: a communication determination unit configured to determine, on the basis of a feature value extracted from speech data including at least a sound of speech of a user, whether communication occurs between users including the user, the feature value indicating an interaction between the users.
摘要:
A guiding device, a guiding method, a program, and an information storage medium are provided which can perform output control of a guidance related to a volume at which to input voice using the recognition ranking of a received voice. A voice receiving section (46) receives a voice. When given information is identified as a result of recognition of the voice, an output control section (58) performs control so as to output a guidance related to a volume at which to input voice in a mode corresponding to the recognition ranking of the information.
摘要:
Example implementations disclosed herein can be used to generate a local sound signal corresponding to utterances of a user and other sounds detected by a microphone array coupled to a communication device and to condition the local sound signals to separate the utterances of the user from the other sounds to generate a conditioned sound signal. The conditioned sound signals are evaluated to generate a local quality score for the conditioned sound signals, and when the local quality score of the conditioned sound signals is below a threshold associated with the communication device, a local feedback message indicating a local user position change can be generated. The local feedback message can include instructions for the user to move to another location to improve the quality of the condition sound signals.
摘要:
An automated communication system with an associated method for presenting customized voices is disclosed. The system which performs a predetermined task accepts information regarding an intended user indicating the intended user's identity, preferences, etc. Next, the system customizes one or more voices for the intended user based on the accepted information. The system then presents to the intended user one or more audible communications converted from text associated with a predetermined task performed by the system using the one or more customized voices.