摘要:
Described is a technology by which a user's telephone-related data is aggregated from various sources for use in assisting the user with making telephone calls. For example, call history data corresponding to a landline telephone, a mobile telephone and/or an office telephone of the user may be combined. Other sources include a landline telephone service, a mobile telephone service, an enterprise telephone system or server, a computing device, voice mail data, web page data, electronic message content, instant message content, a contacts list, and/or an information data store. The telephone-related data can be processed (e.g., based on frequency and calling patterns) to determine corresponding probability data to help determine a user's intent in locating a particular recipient to call. The user may access the telephone-related data via voice commands input at one of the user's telephones, or by receiving a visible list of at least part of the telephone-related data.
摘要:
Methods and systems for recognizing a spoken alias are disclosed. The present invention includes generating a plurality of alias variations based on a discoverable name and creating a phonetic representation for each of the alias variations. The present invention also includes capturing a phonetic pronunciation of the spoken alias. At least one of the created alias variations that has a phonetic representation that corresponds to the captured phonetic pronunciation is selected.
摘要:
In a method of entering text into a device a first character input is provided that is indicative of a first character of a text entry. Next, a vocalization of the text entry is captured. A probable word candidate is then identified for a first word of the vocalization based upon the first character input and an analysis of the vocalization. Finally, the probable word candidate is displayed for a user.
摘要:
The present invention relates to methods and systems for handling interactions between a user and a computer. In particular, the present invention relates to methods and systems for handling communication messages from different types of communication interfaces.
摘要:
The present invention relates to establishing a media channel and a signaling channel between a client and a server. The media channel uses a chosen codec and protocol for communication. Through the media channel and signaling channel, an application on the client can utilize speech services on the server.
摘要:
The present invention provides a system and method for combining VoiceXML with an speech application development tool such as SALT. In one aspect of the present invention, a VoiceXML module includes VoiceXML executable instructions. A SALT module includes speech application language tags to execute instructions associated with the VoiceXML module.
摘要:
A speech recognition method and system utilize an acoustic model that is capable of providing probabilities for both a large acoustic unit and an acoustic sub-unit. Each of these probabilities describes the likelihood of a set of feature vectors from a series of feature vectors representing a speech signal. The large acoustic unit is formed from a plurality of acoustic sub-units. At least one sub-unit probability and at least on large unit probability from the acoustic model are used by a decoder to generate a score for a sequence of hypothesized words. When combined, the acoustic sub-units associated with all of the sub-unit probabilities used to determine the score span fewer than all of the feature vectors in the series of feature vectors. An overlapping decoding technique is also provided.
摘要:
Systems, methods and computer-storage media are provided for identifying low-match search queries and determining comparable item matches to suggest to the user in response to a low-match query. “Low-match queries” are queries for which an insufficient number of exact item matches are available. In embodiments, exact and/or comparable item matches may be determined via semantic analysis. Also provided are systems, methods and computer-storage media for informing the user, by way of a presented indicator, or the like, that a presented item was selected for presentation based upon a similarity metric rather than being determined an exact match for the input query.
摘要:
This patent application pertains to answer model comparison. One implementation can determine a first frequency at which an individual answer category appears in an individual slot on a query results page when utilizing a first model. The method can ascertain a second frequency at which the individual answer category appears in the individual slot on the query results page when utilizing a second model. The method can calibrate the second model so that the second frequency approaches the first frequency.
摘要:
Methods and system for authenticating a user are disclosed. The present invention includes accessing a collection of personal information related to the user. The present invention also includes performing an authentication operation that is based on the collection of personal information. The authentication operation incorporates at least one dynamic component and prompts the user to give an audible utterance. The audible utterance is compared to a stored voiceprint.