摘要:
A speech recognition system is extensible in that new terms may be added to a list of terms that are recognized by the speech recognition system. The speech recognition system provides audio feedback when new terms are added so that a user may hear how the system expects the word to be pronounced. The user may then accept the pronunciation or provide his own pronunciation. The user may also selectively change the pronunciation of words to avoid misrecognitions by the system. The system may provide appropriate user interface elements for enabling a user to change the pronunciation of words. The system may also include intelligence for automatically changing the pronunciation of words used in recognition based upon empirically derived information.
摘要:
The claimed subject matter provides a system and/or a method that facilitates replicating a telepresence session with a real world physical meeting. A telepresence session can be initiated within a communication framework that includes two or more virtually represented users that communicate therein. A trigger component can monitor the telepresence session in real time to identify a participant interaction with an object, wherein the object is at least one of a real world physical object or a virtually represented object within the telepresence session. A feedback component can implement a force feedback to at least one participant within the telepresence session based upon the identified participant interaction with the object, wherein the force feedback is employed via a device associated with at least one participant.
摘要:
The present invention combines a conventional audio microphone with an additional speech sensor that provides a speech sensor signal based on an input. The speech sensor signal is generated based on an action undertaken by a speaker during speech, such as facial movement, bone vibration, throat vibration, throat impedance changes, etc. A speech detector component receives an input from the speech sensor and outputs a speech detection signal indicative of whether a user is speaking. The speech detector generates the speech detection signal based on the microphone signal and the speech sensor signal.
摘要:
A system that incorporates a user context into a computer-based search is provided. To establish the context, the innovation can identify information about a user state or context via a variety of sources and sensors. The state/context information can be used to filter, arrange and/or rank search results so as to facilitate converging on meaningful searches and results. Machine learning systems (implicitly and/or explicitly trained) can be employed to infer a current and/or future context related to user. An identified or inferred user context can be employed to modify an automated or user-defined search input/query. Contextual cues can be considered directly in the construction and use of context of context-sensitive retrieval algorithms that are optimized for identifying and/or ranking of informational items of potential interest or value in different contexts. As well, the context can be employed to intelligently render results of a query (e.g., user/application defined, context-modified query).
摘要:
Architecture for presenting advertisements in realtime in retail establishments. A sensor component includes sensors for collecting information about a customer or group of customers as they move through the store. The sensors can include capability for image processing, audio processing, light sensing, velocity sensing, direction sensing, proximity sensing, face recognition, pose recognition, transaction recognition, and biometric sensing, for example. A customer component analyzes the information and generates a profile about the customer. Advertisements are selected for presentation that target the customers as they walk in proximity of a presentation system of the store. An advertisement component facilitates dynamic presentation of a targeted advertisement to the individual as a function of the profile. The customer component can infer information during analysis using machine learning and reasoning.
摘要:
Prosodic databases hold fundamental frequency templates for use in a speech synthesis system. Prosodic database templates may hold fundamental frequency values for syllables in a given sentence. These fundamental frequency values may be applied in synthesizing a sentence of speech. The templates are indexed by tonal pattern markings. A predicted tonal marking pattern is generated for each sentence of text that is to be synthesized, and this predicted pattern of tonal markings is used to locate a best-matching template. The templates are derived by calculating fundamental frequencies on a pursuable basis for sentences that are spoken by a human trainer for a given unlabeled corpus.
摘要:
Inbound calls to an enterprise are handled by a call-handling component, by identifying the calling party from intrinsic information about the call, by identifying a potential recipient from non-intrinsic information about the calling party, and by automatically routing the inbound call to a potential recipient.
摘要:
Content management architecture for a portable wireless device. Caching and fetching techniques are provided to improve content handling for portable devices such as cellular telephones and portable computers. A search component automatically performs searches as a background process, and potentially desired content is received and cached by a content storing component to be available in the future when and if needed, mitigating latency associated with slow download speeds, refresh rates, and other system and/or network impediments. Content from background search results can be trickled into the device as part of the background process so as not to burden system resources for other processes. As part of memory management, aged and/or low priority or low interest content can be selectively removed or archived to increase available cache or memory space, as well as to maintain relevant content within the device. A presentation component facilitates presentation of the pre-stored content.
摘要:
A system that employs an explicitly and/or implicitly trained model in order to return entity-specific computer-based search results is provided. The innovation can provide for a customized search model that focuses search in connection with achieving information that is meaningful with respect to goals of an entity. The model can be used to modify a search query in accordance with a goal of the entity or to generate the search query thereby returning meaningful and/or targeted results to the user. The system can automatically gather entity-related data thereafter determining or inferring a goal as well as training the model. Moreover, the system can selectively configure (e.g., order, rank, filter) and render results to a user based upon the model.
摘要:
A system that incorporates a user context into a computer-based search is provided. To establish the context, the innovation can identify information about a user state or context via a variety of sources and sensors. The state/context information can be used to filter, arrange and/or rank search results so as to facilitate converging on meaningful searches and results. Machine learning systems (implicitly and/or explicitly trained) can be employed to infer a current and/or future context related to user. An identified or inferred user context can be employed to modify an automated or user-defined search input/query. Contextual cues can be considered directly in the construction and use of context of context-sensitive retrieval algorithms that are optimized for identifying and/or ranking of informational items of potential interest or value in different contexts. As well, the context can be employed to intelligently render results of a query (e.g., user/application defined, context-modified query).