摘要:
A method (40) and system (10 or 200) for sharing a cellular phone includes sending (41) a request to use a second cellular phone (12) as a server from a first cellular phone (11), exchanging (43) audio streams between the cellulars phone, receiving (44) a dialing signal at the first cellular phone from the second cellular phone and forming (45) a call connection between the first cellular phone and a third party (13) via the second cellular phone. The step of sending the request can include sending an SMS message, sending a phone number, or sending a push-to-share request for nearby cellular phones having stronger signal strength. The push-to-share request can be a Bluetooth search of nearby cellular phones having stronger signal strength for their cellular network connection. The method can also include automatically (42) sending the push-to-share request upon detection of a signal strength below a predetermined threshold.
摘要:
A method (10) and system (200) for personalized voice dialogue can include tracking (12) a user's use of voice dialogue states or transitions and progressively offering (16) a user more efficient voice dialogue transitions or states such as voice dialogue transition or states having fewer and fewer words. The tracking of dialog states or transitions can include tracking (14) of repeated use of the dialogue states or transitions. A user can be prompted to create a new transition or state. The prompting (18) and confirmation and verification (20) by the user of a new transition or state can be done using SCXML language. The method can further include instantiating (21) the new transition or state with voice tags or words and performing (22) speech recognition using the new transition or state. The method can again determine (23) if the new transition or state is a repeat transition or state.
摘要:
A method and apparatus for intention based communications in a mobile communication device is disclosed. The method may include receiving an input from a user of the mobile communication device, converting speech portions in the user's input into linguistic representations, generating a phoneme lattice based on the linguistic representations, scoring stored intention n-grams against the generated phoneme lattice, scoring intentions from the intention n grams, determining the highest scoring intention, determining whether the highest scoring intention is above a predetermined threshold, wherein if the highest scoring intention is above the predetermined threshold, executing the determined intention.
摘要:
A method, a system and a computer program product for interpreting a verbal input in a multimodal dialog system are provided. The method includes assigning (302) a confidence value to at least one word generated by a verbal recognition component. The method further includes generating (304) a semantic unit confidence score for the verbal input. The generation of a semantic unit confidence score is based on the confidence value of at least one word and at least one semantic confidence operator.
摘要:
A method and apparatus for intention based communications in a mobile communication device is disclosed. The method may include receiving an input from a user of the mobile communication device, converting speech portions in the user's input into linguistic representations, generating a phoneme lattice based on the linguistic representations, scoring stored intention n-grams against the generated phoneme lattice, scoring intentions from the intention n grams, determining the highest scoring intention, determining whether the highest scoring intention is above a predetermined threshold, wherein if the highest scoring intention is above the predetermined threshold, executing the determined intention.
摘要:
A method and apparatus for performing a voice search in a mobile communication device is disclosed. The method may include receiving a search query from a user of the mobile communication device, converting speech parts in the search query into linguistic representations, comparing the query linguistic representations to the linguistic representations of all items in the voice search database to find matches, wherein the voice search database has indexed all items that are associated with the device, displaying the matches to the user, receiving the user's selection from the displayed matches, and retrieving and executing the user's selection.
摘要:
A wireless transmitter (201) transmits (102) a message intended for at least one wireless personal communications device (202). That message comprises content (203) configured and arranged to at least attempt to prompt a particular operability configuration for the wireless personal communications device that conforms to social standards as correspond to a given local venue (204). Such content can vary with the application setting with some relevant examples comprising, but not being limited to, information indicative of a degree to which the operability configuration comprises a required operability configuration (as versus a voluntary or merely suggested configuration), information indicative of at least one particular capability of the wireless personal communication device to which the operability configuration pertains, and/or information corresponding to a time frame during which the operability configuration is applicable, to note but a few.
摘要:
Disclosed are a method and wireless device for selecting a content file using speech recognition. The method includes establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files. At least one audible utterance (226) is received (804) from a user. A phoneme lattice (302) is generated (808) based on the audible utterance (226). A phoneme lattice statistical model is generated (810) based on the phoneme lattice (302). A score is assigned (1008) to the tagged text items based on probabilistic estimates in the phoneme lattice statistical model. A list of high scoring tagged text items is presented (1014) so that a selection of a content file may be made. A word lattice (402) and a word lattice statistical model are also used in some embodiments
摘要:
An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs speech recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.
摘要:
A tailored speaker-independent voice recognition system has a speech recognition dictionary (360) with at least one word (371). That word (371) has at least two transcriptions (373), each transcription (373) having a probability factor (375) and an indicator (377) of whether the transcription is active. When a speech utterance is received (510), the voice recognition system determines (520, 530) the word signified by the speech utterance, evaluates (540) the speech utterance against the transcriptions of the correct word, updates (550) the probability factors for each transcription, and inactivates (570) any transcription that has an updated probability factor that is less than a threshold.