摘要:
A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
摘要:
A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
摘要:
A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.
摘要:
A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
摘要:
An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.
摘要:
A speech recognition system (10) or method (20) can include a speech input device and a processor (14) coupled to the speech input. The processor can be programmed to identify (22) a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade (24) a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare (28) the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce (26) confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.
摘要:
A method for connecting and optimizing audio input devices, comprises the steps of: determining an audio input type; generating a first GUI display screen for prompting and enabling user selection of an audio input device; generating a second GUI display screen for prompting and enabling user connection of the audio input device; testing the connected audio input device; configuring audio settings of the connected audio input device; and, storing for later retrieval an association of the connected audio input device and the configured audio settings. The audio settings are configured and the association is stored only if the testing step is successful. The second GUI display screen can include a device specific image and device specific instructions.
摘要:
A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.
摘要:
A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.
摘要:
A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.