Abstract:
The present invention discloses a solution for customizing synthetic voice characteristics in a user specific fashion. The solution can establish a communication between a user and a voice response system. A data store can be searched for a speech profile associated with the user. When a speech profile is found, a set of speech output characteristics established for the user from the profile can be determined. Parameters and settings of a text-to-speech engine can be adjusted in accordance with the determined set of speech output characteristics. During the established communication, synthetic speech can be generated using the adjusted text-to-speech engine. Thus, each detected user can hear a synthetic speech generated by a different voice specifically selected for that user. When no user profile is detected, a default voice or a voice based upon a user's speech or communication details can be used.
Abstract:
A list presentation method. The list presentation method can include the steps of: dynamically grouping selected items in a list based on sequentially positioned symbols in the items which are common to one another; labeling each group of selected items; audibly presenting each group label through a speech user interface; and, responsive to a selection of one of the presented group labels, presenting through the speech user interface items in a group corresponding to the selected group label.
Abstract:
A method for constructing speech elements within an interface can include a step of identifying a visual interface having multiple visual elements. Visual selectors can be presented proximate each of the visual elements. The visual selectors can permit a user to input a speech control type for the associated visual element. For each presented visual selector, a speech element having a speech control type specified in the visual selector can be automatically generated.
Abstract:
Embodiments in accordance with the invention can include a new method (500) and system (100) for testing code within a speech application. A test file (101) can be automatically generated to verify the functionality of a new section of code (172) presented within a graphical call flow builder application (156). In one arrangement, a user can specify through a wizard two points on a path identifying the code section to be tested. The wizard can generate a test file (101) and can configure a path (151) to a new subpath (152) and automatically assign predetermined values to graphical call flow prompts along the path. In this manner, the new code section is reached under the same path conditions for allowing repeatable testing. The system can include a test harness (110) configured to test a new code section from within a context of the speech application, and a test controller (120) for transitioning to the new code section. The test controller can run the test harness within the speech application to evaluate a functionality of the new code section.
Abstract:
The present invention addresses problems associated with accessing and using automated systems for scheduled travel events and provides a method and system for communicating information to a user interacting with an automated system for accessing travel schedule data. In a first embodiment of the present invention, a user request can be processed for an event scheduled on an automated system. An occurrence time for the scheduled event can then be loaded. If the occurrence time is within a preconfigured time interval including midnight, a disambiguation process can be triggered for the scheduled event. The scheduled event can be an airline flight and the occurrence time can be an arrival or departure time. The disambiguation process can include prompting a user with a disambiguation message, wherein the disambiguation message makes reference to either an arrival date for the airline flight or a departure date for the airline flight.
Abstract:
A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.
Abstract:
A method and system for testing code within a speech application. A test file can be automatically generated to verify the functionality of a new section of code within a graphical call flow builder application. A user can specify through a wizard two points on a path identifying the code section to be tested. The wizard can generate a test file and configure a path to a new subpath. Values are assigned to graphical call flow prompts along the path. Thus, the new code section is reached under the same path conditions for allowing repeatable testing. The system can include a test harness to test a new code section from within a context of the speech application, and a test controller for transitioning to the new code section. The test controller can run the test harness within the speech application to evaluate a functionality of the new code section.
Abstract:
A method, system and computer program for assessing the quality of a call recipient response during an interactive voice dialog. Embodiments of the present invention address deficiencies of the art in respect to biometric analysis and provide a novel and non-obvious method, system and computer program product for call center agent quality assurance using biometric technologies. A solution for automated monitoring of call center agents' skill, mood, professionalism and behavior using biometric technologies and for providing appropriate action to improve customer handling and satisfaction is provided. The solution provides an automated method for detecting potential problems and preemptively taking action to provide consistent, quality customer service.
Abstract:
Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.
Abstract:
Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.