摘要:
A spoken language system (100) includes a recognition component (120) that generates (220) a recognized sequence of words from a sequence of received spoken words, and assigns (225) a confidence score to each word in the recognized sequence of words. A presentation component (140) of the spoken language system adjusts (240) nominal acoustical properties of words in a presentation (142) of the recognized sequence of words, the adjustment performed according to the confidence score of each word. The adjustments include adjustments to acoustical features and acoustical contexts of words and groups of words in the presented sequence of words. The presentation component presents (245) the adjusted sequence of words.
摘要:
A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.