摘要:
Systems and methods for multi-modal messaging that enable a user to compose, send and retrieve messages, such as SMS, MMS, IM or ordinary e-mail messages, for example, using one or more I/O (input/output) modalities (e.g., speech I/O and/or GUI I/O). A method for composing messages combines the advantages of a multi-modal interface (e.g., grammar-based speech and touchscreen or similar input devices) and message templates, which allows a user to construct a message with significantly less effort in a fraction of the time required by conventional methods. The user can dictate his/her messages using speech and/or GUI input, for example, based on a library of message templates which can be personalized by the user to fit his/her social interaction needs.
摘要:
Systems and methods for multi-modal messaging that enable a user to compose, send and retrieve messages, such as SMS, MMS, IM or ordinary e-mail messages, for example, using one or more I/O (input/output) modalities (e.g., speech I/O and/or GUI I/O). A method for composing messages combines the advantages of a multi-modal interface (e.g., grammar-based speech and touchscreen or similar input devices) and message templates, which allows a user to construct a message with significantly less effort in a fraction of the time required by conventional methods. The user can dictate his/her messages using speech and/or GUI input, for example, based on a library of message templates which can be personalized by the user to fit his/her social interaction needs.
摘要:
Systems and methods for multi-modal messaging that enable a user to compose, send and retrieve messages, such as SMS, MMS, IM or ordinary e-mail messages, for example, using one or more I/O (input/output) modalities (e.g., speech I/O and/or GUI I/O). A method for composing messages combines the advantages of a multi-modal interface (e.g., grammar-based speech and touchscreen or similar input devices) and message templates, which allows a user to construct a message with significantly less effort in a fraction of the time required by conventional methods. The user can dictate his/her messages using speech and/or GUI input, for example, based on a library of message templates which can be personalized by the user to fit his/her social interaction needs.
摘要:
Systems and methods for building speech-based applications using reusable dialog components based on VoiceXML (Voice eXtensible Markup Language). VoiceXML reusable dialog components can be used for building a voice interface for use with multi-modal, multi-channel and conversational applications that offer universal access to information anytime, from any location, using any pervasive computing device regardless of its I/O modality. In one embodiment, a framework for reusable dialog components built within the VoiceXML specifications is based on the tag and ECMAScript parameter objects to pass parameters, configuration and results. This solution is interpreted at the client side (VoiceXML browser). In another embodiment, a framework for reusable dialog components is based on JSP (Java Server Pages) and beans that generate VoiceXML subdialogs. This solution can be evaluated at the server side. These frameworks can be mixed and matched depending on the application.
摘要:
Clients connecting to a VoiceXML browser obtain a control channel. Using this channel, clients may initialize a new VoiceXML session or attach to an existing VoiceXML session. The client after obtaining a session may perform a range of actions including controlling and monitoring actions.
摘要:
Systems and methods for providing conversational computing in a distributed multi-modal environment and in particular, systems and methods for enabling a user to conversationally communicate with entities in a distributed network using a portable access device, wherein interaction dialog for communicating with entities in the network is described in a manner that is independent of, e.g., user language, access channel(s), I/O modalities of the access device, and regional/international settings, and wherein the user access device and/or a remote server comprise mechanisms for adapting the interaction dialog based on the user language, user preferences, access device characteristics and or regional/international settings, to thereby enable the user to interact with one or more entities in the network.
摘要:
An audio splitting system for sharing speech data associated with the same utterance between multiple speech technologies (consumers). In one aspect, the system comprises one or more queues for storing data, a plurality of consumers each sharing the data stored in the one or more queues and a scheduler for managing the storage of the data in the one or more queues and the consumption of the data in the one or more queues by each of the plurality of consumers. The consumers will register their data requirements and priority requests with the scheduler. The scheduler assigns each of the plurality of consumers to one or more of the queues based on the registered data requirements.
摘要:
An audio splitting system for sharing speech data associated with the same utterance between multiple speech technologies (consumers). In one aspect, the system comprises one or more queues for storing data, a plurality of consumers each sharing the data stored in the one or more queues and a scheduler for managing the storage of the data in the one or more queues and the consumption of the data in the one or more queues by each of the plurality of consumers. The consumers will register their data requirements and priority requests with the scheduler. The scheduler assigns each of the plurality of consumers to one or more of the queues based on the registered data requirements.
摘要:
Clients connecting to a VoiceXML browser obtain a control channel. Using this channel, clients may initialize a new VoiceXML session or attach to an existing VoiceXML session. The client after obtaining a session may perform a range of actions including controlling and monitoring actions.
摘要:
An audio splitting system for sharing speech data associated with the same utterance between multiple speech technologies (consumers). In one aspect, the system comprises one or more queues for storing data, a plurality of consumers each sharing the data stored in the one or more queues and a scheduler for managing the storage of the data in the one or more queues and the consumption of the data in the one or more queues by each of the plurality of consumers. The consumers will register their data requirements and priority requests with the scheduler. The scheduler assigns each of the plurality of consumers to one or more of the queues based on the registered data requirements.