摘要:
A method for constructing speech elements within an interface can include a step of identifying a visual interface having multiple visual elements. Visual selectors can be presented proximate each of the visual elements. The visual selectors can permit a user to input a speech control type for the associated visual element. For each presented visual selector, a speech element having a speech control type specified in the visual selector can be automatically generated.
摘要:
A method and system of identifying and optimizing audio segments in a speech application program. Audio segments are identified and extracted from a speech application program. The audio segments containing audio text to be recorded are then optimized in order to facilitate the recording of the audio text. The optimization of the extracted audio segments may include accounting for programmed pauses and variables in the speech application code, identifying multi-sentence segments and the presense of duplicate audio segments, and accounting for the effects of coarticulation.
摘要:
A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.
摘要:
A system, apparatus, and method for creating alternate-mode interactive applications is provided. A system for creating an alternate-mode interactive application includes a selection module for selecting a voice-mode element from a set of voice-mode elements defining a voice-mode interactive application for accomplishing a predetermined user-directed task The system also includes a generation module for generating an alternate-mode element corresponding to the selected voice-mode element, the alternate-mode element having a modality different than the voice-mode element. The system further includes a construction module for constructing an alternate-mode interactive application based upon the generated alternate-mode element.
摘要:
A method and system for automated code generation in a call flow builder (10) can include a display coupled to a processor. The processor can be programmed to select a real code (database connection) or a prototype code using a graphical interface (20) to provide a selected code and develop a call flow using the selected code. The processor can be programmed to select the prototype code as the selected code, test the call flow in a local development environment and further enable the switching of the selected code from the prototype to the real code to complete a database connection. The processor can be further programmed to enable specification of a default or range of values. Additionally, the processor can be programmed to use a database connection code that replaces a prototype assignment of values to variables when the real code is the selected code.
摘要:
A method (10) for arranging user-modified variable names in a presentation list such as a drop-down list can include the steps of receiving (12) a system request to display the variables in the drop-down list, and sorting (14) the variables by giving user named variables greater priority over system named variables and then sorting by a second criteria. The method can further include the step of displaying (16) the variables when a user selects the variables using a drop-down control.
摘要:
A wizard that from a fixed design can create various audio interfaces. The generated interfaces can be speech only, DTMF only, or various mixed speech and DTMF UIs. When specifying both speech and DTMF prompts, a number of combinations of these interfaces could be automatically generated. Robust speech recognition systems can be built by automatically generating a “shadow” DTMF application. The DTMF application will perform the same task as the primary speech application; however the transfer to a DTMF application could be done explicitly by the user, or could be transferred automatically (either a temporary or permanent transition) at a point in the call flow where there was a problem with the speech recognition.
摘要:
A method and system for defining standard catch styles used in generating speech application code for managing catch events, in which a style-selection menu that allows for selection of one or more catch styles is presented. Each catch style represents a system response to a catch event. A catch style can be selected from the style-selection menu. For each selected catch style, the system can prepare a response for each catch event. If the selected catch style requires playing a new audio message in response to a particular catch event, a contextual message can be entered in one or more text fields. The contextual message entered in each text field corresponds to the new audio message that will be played in response to the particular catch event. In certain catch styles, the entered contextual message is different for each catch event, while in other catch styles, the entered contextual message is the same for each catch event. Finally, if the selected catch style does not require playing of a new audio message in response to a particular catch event, the system can replay the system prompt.
摘要:
A method (10) of developing call flows can simply include a determination (12) whether an alternative speech field is filled. If the alternative speech field is not filled, then the description text is used (16) in a description field as a default for text for speech output. The description field can be presented graphically and in a properties sheet for speech output objects. If an optional speech text field is filled in the properties sheet, then the description text in the description field can be replaced (14) with the contents of the optional speech text field for text to speech output. The contents of the optional speech text field (32) can be represented as a flyover (23) graphically when pointing to the graphical object. Optionally, the description field (34) and the optional speech text field can be edited on a single graphical user interface (20).
摘要:
A method, system and apparatus for automatically capturing intonation cues in audio segments in speech applications. The method can include identifying planned audio segments in the speech application program, the audio segments containing audio text to be recorded and associated file names. The method further can include extracting the audio segments from the speech application program and processing the extracted audio segments to create an audio text recordation plan. Finally, the method can include further processing the audio text recordation plan to account for intonation cues.