摘要:
A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.
摘要:
A method and system of identifying and optimizing audio segments in a speech application program. Audio segments are identified and extracted from a speech application program. The audio segments containing audio text to be recorded are then optimized in order to facilitate the recording of the audio text. The optimization of the extracted audio segments may include accounting for programmed pauses and variables in the speech application code, identifying multi-sentence segments and the presense of duplicate audio segments, and accounting for the effects of coarticulation.
摘要:
A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.
摘要:
A system, apparatus, and method for creating alternate-mode interactive applications is provided. A system for creating an alternate-mode interactive application includes a selection module for selecting a voice-mode element from a set of voice-mode elements defining a voice-mode interactive application for accomplishing a predetermined user-directed task The system also includes a generation module for generating an alternate-mode element corresponding to the selected voice-mode element, the alternate-mode element having a modality different than the voice-mode element. The system further includes a construction module for constructing an alternate-mode interactive application based upon the generated alternate-mode element.
摘要:
A wizard that from a fixed design can create various audio interfaces. The generated interfaces can be speech only, DTMF only, or various mixed speech and DTMF UIs. When specifying both speech and DTMF prompts, a number of combinations of these interfaces could be automatically generated. Robust speech recognition systems can be built by automatically generating a “shadow” DTMF application. The DTMF application will perform the same task as the primary speech application; however the transfer to a DTMF application could be done explicitly by the user, or could be transferred automatically (either a temporary or permanent transition) at a point in the call flow where there was a problem with the speech recognition.
摘要:
The present invention provides a method of dynamically displaying speech recognition system information. The method can include providing a single floating window for displaying frames of speech recognition system state information to a user. The frames can be varied according to trigger events detected in the speech recognition system. Each frame can differ from others of the frames according to the speech recognition system state information.
摘要:
A method (10) of developing call flows can simply include a determination (12) whether an alternative speech field is filled. If the alternative speech field is not filled, then the description text is used (16) in a description field as a default for text for speech output. The description field can be presented graphically and in a properties sheet for speech output objects. If an optional speech text field is filled in the properties sheet, then the description text in the description field can be replaced (14) with the contents of the optional speech text field for text to speech output. The contents of the optional speech text field (32) can be represented as a flyover (23) graphically when pointing to the graphical object. Optionally, the description field (34) and the optional speech text field can be edited on a single graphical user interface (20).
摘要:
A method, system and apparatus for automatically capturing intonation cues in audio segments in speech applications. The method can include identifying planned audio segments in the speech application program, the audio segments containing audio text to be recorded and associated file names. The method further can include extracting the audio segments from the speech application program and processing the extracted audio segments to create an audio text recordation plan. Finally, the method can include further processing the audio text recordation plan to account for intonation cues.
摘要:
A method (10) of arranging grammar files in a presentation list such as a drop-down list can include the steps of receiving (12) a system request to display the grammar files in the presentation list, and sorting (14) the grammar files by giving user defined grammar files greater priority over built-in grammar files and then sorting by a second criteria. The method can further include the step of displaying (16) the grammar files when a user selects the grammar files and distinguishing between a user defined grammar and a built-in grammar.
摘要:
A method and system for defining standard catch styles used in generating speech application code for managing catch events, in which a style-selection menu that allows for selection of one or more catch styles is presented. Each catch style represents a system response to a catch event. A catch style can be selected from the style-selection menu. For each selected catch style, the system can prepare a response for each catch event. If the selected catch style requires playing a new audio message in response to a particular catch event, a contextual message can be entered in one or more text fields. The contextual message entered in each text field corresponds to the new audio message that will be played in response to the particular catch event. In certain catch styles, the entered contextual message is different for each catch event, while in other catch styles, the entered contextual message is the same for each catch event. Finally, if the selected catch style does not require playing of a new audio message in response to a particular catch event, the system can replay the system prompt.