摘要:
A method of tuning a decision network can include a series of steps. The steps can include identifying a deviation between a correct interpretation of a data item and an incorrect interpretation of the data item. In a decision network including a hierarchical set of nodes and leaves, a path of traversed nodes in the decision network resulting in the deviation can be determined. The nodes can correspond to queries. A measure of goodness at a node in the path can be calculated using at least one new query as a replacement for an existing query to determine whether the measure of goodness improves using the new query compared to the old query. If the measure of goodness improves, at least one new query can be selected and the decision network can be regrown from the node down through the leaves using the at least one new query at the node.
摘要:
A method of developing natural language understanding (NLU) applications can include determining NLU interpretation information from an NLU training corpus of text using a multi-pass processing technique. The alteration of one pass automatically can alter an input for a subsequent pass. The NLU interpretation information can specify an interpretation of at least part of the NLU training corpus of text. The NLU interpretation information can be stored in a database, and selected items of the NLU interpretation information can be presented in a graphical editor. User specified edits also can be received in the graphical editor.
摘要:
A Monte Carlo method for use with natural language understanding and speech recognition language models can include a series of steps. The steps can include identifying at least one phrase embedded in a body of text wherein the phrase can belong to a phrase class. An additional attribute corresponding to the identified phrase can be determined. The body of text can be copied and the identified phrase can be replaced with a different phrase selected from a plurality of phrases. The different phrase can belong to the phrase class and correspond to the attribute.
摘要:
A system and method for servicing natural language requests with a plurality of remote host systems. The system utilizes a computer program that comprises: (1) an input system for inputting an NL command; (2) a translation system that extracts a request from the NL command and stores the request in a host-independent format; and (3) a routing system for servicing the request, wherein the routing system comprises a mechanism for selecting a host, for converting the request into a host dependent directive, and for forwarding the directive to the selected host. The system may further include a speech recognition system, a local data source for servicing the NL command, templates for converting the request into the host dependent directive, a heuristic for selecting the host, and an output system for obtaining and outputting the response.
摘要:
A method for processing dual tone multi-frequency signals for use with a natural language understanding system can include several steps. The step of determining whether a audio input signal is a dual tone multi-frequency signal or a human speech signal can be included. If the audio input signal is determined to be a dual tone multi-frequency signal, the audio input signal can be converted to at least one text equivalent. Also, the step of providing the at least one text equivalent to a natural language understanding system can be included. The natural language understanding system can determine a meaning from the text equivalent.
摘要:
A speech coding apparatus and method uses classification rules to code an utterance while consuming fewer computing resources. The value of at least one feature of an utterance is measured during each of a series of successive time intervals to produce a series of feature vector signals representing the feature values. The classification rules comprise at least first and second sets of classification rules. The first set of classification rules map each feature vector signal from a set of all possible feature vector signals to exactly one of at least two disjoint subsets of feature vector signals. The second set of classification rules map each feature vector signal in a subset of feature vector signals to exactly one of at least two different classes of prototype vector signals. Each class contains a plurality of prototype vector signals. According to the classification rules, a first feature vector signal is mapped to a first class of prototype vector signals. The closeness of the feature value of the first feature vector signal is compared to the parameter values of only the prototype vector signals in the first class of prototype vector signals to obtain prototype match scores for the first feature vector signal and each prototype vector signal in the first class. At least the identification value of at least the prototype vector signal having the best prototype match score is output as a coded utterance representation signal of the first feature vector signal.
摘要:
A method and apparatus for modeling words based on match scores representing (a) the closeness of a match between probabilistic word models and the acoustic features of at least two utterances, and (b) the closeness of a match between word models and the spelling of the word. A match score is calculated for a selection set of one or more probabilistic word models. A match score is also calculated for an expansion set comprising the probabilistic word models in the selection set and one probabilistic word model from a candidate set. If the expansion set match score improves the selection set match score by a selected nonzero threshold value, the word is modelled with the word models in the expansion set. If the expansion set match score does not improve the selection set match score by the selected nonzero threshold value, the word is modelled with the words in the selection set.
摘要:
A method and system for use in a natural language understanding system for including grammars within a statistical parser. The method involves a series of steps. The invention receives a text input. The invention applies a first context free grammar to the text input to determine substrings and corresponding parse trees, wherein the substrings and corresponding parse trees further correspond to the first context free grammar. Additionally, the invention can examine each possible substring using an inventory of queries corresponding to the CFG.
摘要:
Methods and apparatus for providing multiple output channels in a microphone. More particularly, provision is made for an arrangement wherein a single microphone is adapted to produce one or more different audio outputs depending upon characteristics of a speaker or user of the microphone while facilitating a high degree of accuracy in the recognition of the user or speaker by the arrangement. The microphone is adapted to produce one or more different audio streams or outputs depending upon the speaker presently using the microphone. In effect, this can be readily implemented by a main user or speaker, such as an interviewer on a radio or TV talk show, or any speaker in a conference room, intending to control the audio output streams by suitably activating a button or switch.
摘要:
The invention disclosed herein concerns a method of converting speech to text using a hierarchy of contextual models. The hierarchy of contextual models can be statistically smoothed into a language model. The method can include processing text with a plurality of contextual models. Each one of the plurality of contextual models can correspond to a node in a hierarchy of the plurality of contextual models. Also included can be identifying at least one of the contextual models relating to the text and processing subsequent user spoken utterances with the identified at least one contextual model.