Abstract:
Systems and methods for applying feature-space maximum likelihood linear regression (fMLLR) to correlated features are provided. A method for applying fMLLR to correlated features, comprises mapping the correlated features into an uncorrelated feature space, applying fMLLR in the uncorrelated feature space to obtain fMLLR transformed features, and mapping the fMLLR transformed features back to a correlated feature space.
Abstract:
A method includes providing a deep neural network acoustic model, receiving audio data including one or more utterances of a speaker, extracting a plurality of speech recognition features from the one or more utterances of the speaker, creating a speaker identity vector for the speaker based on the extracted speech recognition features, and adapting the deep neural network acoustic model for automatic speech recognition using the extracted speech recognition features and the speaker identity vector.
Abstract:
A method for spoken term detection, comprising generating a time-marked word list, wherein the time-marked word list is an output of an automatic speech recognition system, generating an index from the time-marked word list, wherein generating the index comprises creating a word loop weighted finite state transducer for each utterance, i, receiving a plurality of keyword queries, and searching the index for a plurality of keyword hits.
Abstract:
A method for spoken term detection, comprising generating a time-marked word list, wherein the time-marked word list is an output of an automatic speech recognition system, generating an index from the time-marked word list, wherein generating the index comprises creating a word loop weighted finite state transducer for each utterance, i, receiving a plurality of keyword queries, and searching the index for a plurality of keyword hits.
Abstract:
A method for spoken term detection, comprising generating a time-marked word list, wherein the time-marked word list is an output of an automatic speech recognition system, generating an index from the time-marked word list, wherein generating the index comprises creating a word loop weighted finite state transducer for each utterance, i, receiving a plurality of keyword queries, and searching the index for a plurality of keyword hits.
Abstract:
Systems and methods for applying feature-space maximum likelihood linear regression (fMLLR) to correlated features are provided. A method for applying fMLLR to correlated features, comprises mapping the correlated features into an uncorrelated feature space, applying fMLLR in the uncorrelated feature space to obtain fMLLR transformed features, and mapping the fMLLR transformed features back to a correlated feature space.