摘要:
A voice processing system includes a real-time voice server, which is arranged to process real-time voice processing tasks for clients of the system. A gateway processor is arranged to accept from a client a request to perform an off-line voice processing task, to convert the off-line voice processing task into an equivalent real-time voice processing task, to invoke the voice server to process the equivalent real-time voice processing task, and to output a result of the equivalent real-time voice processing task.
摘要:
A communication system can include a telephony application server and at least one speech engine, where the system uses a standardized messaging protocol that specifies a standard for media resources. The telephony application server can have at least one voice server component. The speech engines can be allocated to handle requests for the voice server component on a per turn basis. The standardized messaging protocol can define a message format for request messages sent from the voice server component to a selected speech engine, a message format for response messages sent from the speech engine to the voice server component, and a message format for event messages sent from the speech engine to the voice server component. Each message format can include a field for a call identifier.
摘要:
A method for providing voice telephony services can include the step of receiving a call via a telephone gateway. The telephone gateway can convey call identifying data to a resource connector. A media port can be responsively established within a media converter that is communicatively linked to the telephone gateway through a port associated with the call. A call description object can be constructed that includes the call identifying data and an identifier for the media port. The call description object can be conveyed to a telephony application server that provides at least one speech service for the call. The telephony application server can initiate at least one programmatic action of a communicatively linked speech engine. The speech engine can convey results of the programmatic action to the media converter through the media port. The media converter can stream speech signals for the call based upon the results.
摘要:
A telephone capable of placing or receiving calls over the PSTN or a packet network. The telephone can store multiple telephone numbers for each potential called party along with preferences that govern the order of selecting telephone numbers to service any given outgoing call. Some or all of the telephone numbers can be associated with a presence service. Presence indicators stored in the telephone are dynamically updated via the packet network connection and are used as part of the telephone number selection algorithm. The preferred embodiment is a wireless system having a base station and a remote device.
摘要:
A system for implementing voice services can include at least one virtual machine, such as a Java 2 Enterprise Edition (J2EE) virtual machine. The virtual machine can include a bean container for handling software beans, such as Enterprise Java Beans. The bean container can include a voice browser bean. The voice browser bean can include a VoiceXML browser.
摘要:
A method for detecting speech utterances within a telephone call can include the steps of initializing a componentized voice server having at least one software-based speech detection routine. At least one previously established parameter can be used to discern a speech detection methodology for handling an incoming call. The software-based speech detection routine can be set in accordance with a select one of the parameters. An indicator of particular one of the parameters can be conveyed to an external speech detection component so that the external speech detection component is set to detect speech for the call in accordance with the conveyed indication. The software-based speech detection routine and/or the external speech detection component can detect a speech utterance for the call. The voice server can perform at least one programmatic action responsive to the detecting of the speech utterance.
摘要:
A method for interfacing an application server with a resource can include the step of associating a plurality of Enterprise Java Beans (EJBs) to a plurality of resources, where a one-to-one correspondence exists between EJB and resource. An application server can receive an application request and can determine a resource for handling the request. An EJB associated with the determined resource can interface the application server to the determined resource. The request can be handled with the determined resource.
摘要:
A method for barging-in a call can include the step of establishing a component as a communication intermediary between at least one speech engine and a telephone gateway. The component can include at least one input media port and at least one output media port, where the input media port and the output media port can interface with selective ones of the speech engines. A barge-in command can be received while an automated prompt is being audibly presented. The signals for the prompt can be conveyed to the component via the output media port. Once the barge-in command has been received, the output media port can be closed to immediately stop the prompt from being presented. A halt command can be conveyed to a prompt generating source so that the source stops generating the prompt.
摘要:
A method of interfacing a telephone application server and a speech engine can include the step of establishing one or more audio sockets in a media converting component of the telephone application server. The audio socket can remain available for approximately a duration of a call. A work unit that requires processing by a speech engine can be detected for the call. An identifier for the audio socket and a data for the work unit can be conveyed to a selected speech engine. Work unit results from the selected speech engine can be received by the media converting component via the previously established audio socket.
摘要:
A method of accessing an Extensible Hypertext Markup Language Plus Voice Web page can include the step of receiving a request for an Extensible Hypertext Markup Language Plus Voice Web page from a source that lacks the ability to directly render the Web page. The Web page can be segmented into a multitude of fragments, where each fragment can consist of a single interface category, such as a graphical user interface category or a voice category. These fragments can be used to provide the requesting source with information from the Web page.