摘要:
A method and apparatus for speaker recognition is provided. One embodiment of a method for determining whether a given speech signal is produced by an alleged speaker, where a plurality of statistical models (including at least one support vector machine) have been produced for the alleged speaker based on a previous speech signal received from the alleged speaker, includes receiving the given speech signal, the speech signal representing an utterance made by a speaker claiming to be the alleged speaker, scoring the given speech signal using at least two modeling systems, where at least one of the modeling systems is a support vector machine, combining scores produced by the modeling systems, with equal weights, to produce a final score, and determining, in accordance with the final score, whether the speaker is likely the alleged speaker.
摘要:
The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
摘要:
The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
摘要:
The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a plurality of impostor speakers and then incorporating the speech features in the speaker model.
摘要:
The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a plurality of impostor speakers and then incorporating the speech features in the speaker model.
摘要:
In one embodiment, the present invention is a method and apparatus for active noise cancellation. In one embodiment, a method for recognizing user speech in an audio signal received by a media system (where the audio signal includes user speech and ambient audio output produced by the media system and/or other devices) includes canceling portions of the audio signal associated with the ambient audio output and applying speech recognition processing to an uncancelled remainder of the audio signal.
摘要:
In one embodiment, the present invention is a method and apparatus for active noise cancellation. In one embodiment, a method for recognizing user speech in an audio signal received by a media system (where the audio signal includes user speech and ambient audio output produced by the media system and/or other devices) includes canceling portions of the audio signal associated with the ambient audio output and applying speech recognition processing to an uncancelled remainder of the audio signal.
摘要:
An apparatus and a concomitant method for recognizing speech in a noisy environment are provided. The present method includes applying a first interpolation weight to a clean speech model to produce a weighted clean speech model, applying a second interpolation weight to a noise model to produce a weighted noise model, and deriving a noisy speech model directly from the weighted clean speech model and the weighted noise model. At least one of the first interpolation weight and the second interpolation weight is computed in a maximum likelihood framework.
摘要:
An apparatus and a concomitant method for recognizing speech in a noisy environment are provided. The present method includes applying a first interpolation weight to a clean speech model to produce a weighted clean speech model, applying a second interpolation weight to a noise model to produce a weighted noise model, and deriving a noisy speech model directly from the weighted clean speech model and the weighted noise model. At least one of the first interpolation weight and the second interpolation weight is computed in a maximum likelihood framework.