摘要:
An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs speech recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.
摘要:
A method, a system and a computer program product for interpreting a verbal input in a multimodal dialog system are provided. The method includes assigning (302) a confidence value to at least one word generated by a verbal recognition component. The method further includes generating (304) a semantic unit confidence score for the verbal input. The generation of a semantic unit confidence score is based on the confidence value of at least one word and at least one semantic confidence operator.
摘要:
A tailored speaker-independent voice recognition system has a speech recognition dictionary (360) with at least one word (371). That word (371) has at least two transcriptions (373), each transcription (373) having a probability factor (375) and an indicator (377) of whether the transcription is active. When a speech utterance is received (510), the voice recognition system determines (520, 530) the word signified by the speech utterance, evaluates (540) the speech utterance against the transcriptions of the correct word, updates (550) the probability factors for each transcription, and inactivates (570) any transcription that has an updated probability factor that is less than a threshold.
摘要:
Disclosed are a method and wireless device for selecting a content file using speech recognition. The method includes establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files. At least one audible utterance (226) is received (804) from a user. A phoneme lattice (302) is generated (808) based on the audible utterance (226). A phoneme lattice statistical model is generated (810) based on the phoneme lattice (302). A score is assigned (1008) to the tagged text items based on probabilistic estimates in the phoneme lattice statistical model. A list of high scoring tagged text items is presented (1014) so that a selection of a content file may be made. A word lattice (402) and a word lattice statistical model are also used in some embodiments
摘要:
A method (100) or system (600) of facilitating goal based calendar management can include creating a calendar item (106) from a user entry (102) and an external entry (104), determining (108) if the calendar item is a policy related item, extracting policy attributes (110) for the calendar item, determining (112) if an action is required based on the policy attributes, the user entry, and the external entry, and executing the action based on the policy attributes. The method can further include presenting (114) a suggested action based on the policy attributes and accepting an entry (116) corresponding to the suggested action. The method can further present a modification of the external entry based on the policy attributes and accept an entry corresponding to the modification of the external entry. The method can also present an action related to a common context as a result of an analysis of the attributes.
摘要:
A selective call communication system (100), such as a paging system (100), and a method therefor, for using a lexicon (134) for voice data entry in the selective call communication system (100) is shown. The system (100) associates a Voice User ID (206) to a subscriber ID (202) for one or more subscribers of the system (100). The system (100) accumulates user system statistics (208). Then, based on the accumulated user system statistics (208), the system (100) defines a lexicon (134) to include at least one Voice User ID (206) for the one or more subscribers for voice data entry for the system (100).