摘要:
The claimed subject matter relates to an architecture that can mitigate privacy concerns in connection with ad targeting or data collection. In particular, architecture can be included in a personal mobile communication device such as a cell phone. During communication transactions between the host device and a peer device, shared information can be extracted either from content included in the communication or from metadata. Based upon the shared information, a social graph maintained on the host device can be updated. In addition, the host device can receive a large set of ads and select or tailor a custom ad from the set based upon the social graph.
摘要:
The claimed subject matter relates to an architecture that can mitigate privacy concerns in connection with ad targeting or data collection. In particular, architecture can be included in a personal mobile communication device such as a cell phone. During communication transactions between the host device and a peer device, shared information can be extracted either from content included in the communication or from metadata. Based upon the shared information, a social graph maintained on the host device can be updated. In addition, the host device can receive a large set of ads and select or tailor a custom ad from the set based upon the social graph.
摘要:
The claimed subject matter provides systems and/or methods that effectuate distribution of advertising as real-time video calls. The system can include devices that detect whether or not a mobile device associated with a user is in the vicinity of a retail establishment, ascertains whether or not the proximate mobile device is receptive to receipt of advertising from the retail establishment, negotiates with the mobile device to determine at least one user preference with respect to a real-time video call, and based on the ascertained user preferences, downloads the appropriate real-time video call to the mobile device and/or establishes a live audio-visual connection with a representative of the retail establishment, wherein during the live audio-visual connection negotiations between the user and the representative is effectuated and the user comprehends that they are communicating with a famous personality.
摘要:
A method and apparatus are provided for training parameters in a hidden conditional random field model for use in speech recognition and phonetic classification. The hidden conditional random field model uses parameterized features that are determined from a segment of speech, and those values are used to identify a phonetic unit for the segment of speech. The parameters are updated after processing of individual training samples.
摘要:
An unsupervised adaptation method and apparatus are provided that reduce the storage and time requirements associated with adaptation. Under the invention, utterances are converted into feature vectors, which are decoded to produce a transcript and alignment unit boundaries for the utterance. Individual alignment units and the feature vectors associated with those alignment units are then provided to an alignment function, which aligns the feature vectors with the states of each alignment unit. Because the alignment is performed within alignment unit boundaries, fewer feature vectors are used and the time for alignment is reduced. After alignment, the feature vector dimensions aligned to a state are added to dimension sums that are kept for that state. After all the states in an utterance have had their sums updated, the speech signal and the alignment units are deleted. Once sufficient frames of data have been received to perform adaptive training, the acoustic model is adapted.
摘要:
A language processing system includes a unified language model. The unified language model comprises a plurality of context-free grammars having non-terminal tokens representing semantic or syntactic concepts and terminals, and an N-gram language model having non-terminal tokens. A language processing module capable of receiving an input signal indicative of language accesses the unified language model to recognize the language. The language processing module generates hypotheses for the received language as a function of words of the unified language model and/or provides an output signal indicative of the language and at least some of the semantic or syntactic concepts contained therein.
摘要:
Embodiments of a computer-implemented advertisement display system are disclosed. In one embodiment, the system includes a speech recognition component that processes a video clip and produces a corresponding collection of speech recognition data indicative of an audio portion of the video clip. The system also includes a collection of advertising material. An advertisement selection component selects an advertisement from the collection of advertising material based on the corresponding collection of speech recognition data. The system also includes a display. An advertisement presentation component displays an indication of the selected advertisement on the display during a simultaneous display of the video clip.
摘要:
A method of entering information into a mobile device includes receiving a multi-word speech input from a user, performing speech recognition on the speech input to obtain a multi-word speech recognition result, and sequentially displaying, in a display, words in the speech recognition result for user confirmation or correction, by adding one word at a time to the display. A next word is only displayed after user confirmation or correct has been received for a previously displayed word that is immediately preceding the next word in the speech recognition result. The method also includes calculating a hypothesis lattice indicative of a plurality of speech recognition hypotheses based on the speech input and, prior to finishing calculating the hypothesis lattice and while continuing to calculate the hypothesis lattice, calculating a preliminary hypothesis lattice indicative of only partial speech recognition hypotheses based on the speech input and outputting the preliminary hypotheses lattice.
摘要:
Described is a technology by which a communications identifier such as a telephone number is used in selecting relevant content (e.g., an advertisement) for outputting as audible and/or visible information to a destination device. The communications identifier is used like a search term/keyword to determine content relevant to that identifier. A telephone number may be the identifier, selected based on a partially-dialed or actually dialed number, or a user's current scrolling position among listed telephone numbers. A telephone number of an incoming call may be used as the identifier. Profile data and/or state data may be used in selecting relevant content. Some of the profile data may be mined based on previous user activities. In one example, the content may be provided via a remote source of the content, and used immediately or cached for later use (e.g., output).
摘要:
Embodiments of a computer-implemented advertisement display system are disclosed. In one embodiment, the system includes a speech recognition component that processes a video clip and produces a corresponding collection of speech recognition data indicative of an audio portion of the video clip. The system also includes a collection of advertising material. An advertisement selection component selects an advertisement from the collection of advertising material based on the corresponding collection of speech recognition data. The system also includes a display. An advertisement presentation component displays an indication of the selected advertisement on the display during a simultaneous display of the video clip.