摘要:
A method and system that expands a word graph to a phone graph. An unknown speech signal is received. A word graph is generated based on an application task or based on information extracted from the unknown speech signal. The word graph is expanded into a phone graph. The unknown speech signal is recognized using the phone graph. The phone graph can be based on a cross-word acoustical model to improve continuous speech recognition. By expanding a word graph into a phone graph, the phone graph can consume less memory than a word graph and can reduce greatly the computation cost in the decoding process than that of the word graph thus improving system performance. Furthermore, continuous speech recognition error rate can be reduced by using the phone graph, which provides a more accurate graph for continuous speech recognition.
摘要:
A method and system for providing a class-based statistical language model representation from rule-based knowledge is disclosed. The class-based language model is generated from a statistical representation of a class-based rule net. A class-based rule net is generated using the domain-related rules with words replaced with their corresponding class-tags that are manually defined. The class-based statistical representation from the class-based rule net is combined with a class-based statistical representation from a statistical language model to generate a language model. The language model is enhanced by smoothing/adapting with general-purpose and/or domain-related corpus for use as the final language model. A two-pass search algorithm is applied for speech decoding.
摘要:
A method and system providing a statistical representation from rule-based grammar specifications. The language model is generated by obtaining a statistical representation of a rule-based language model and combining it with a statistical representation of a statistical language model for use as a final language model. The language model may be enhanced by applying smoothing and/or adapting for use as the final language model.
摘要:
A search method based on a single triphone tree for large vocabulary continuous speech recognizer is disclosed in which speech signal are received. Tokens are propagated in a phonetic tree to integrate a language model to recognize the received speech signals. By propagating tokens, which are preserved in tree nodes and record the path history, a single triphone tree can be used in a one pass searching process thereby reducing speech recognition processing time and system resource use.
摘要:
A method of transmitting data to a receiver, wherein the data is transmitted using a plurality of sub-carriers, is provided. The method provided includes determining, for each sub-carrier and for each of a plurality of combinations of the sub-carrier and an antenna of a plurality of antennas to be used for transmitting the data, a transmission characteristic of a transmission of the sub-carrier using the antenna; and selecting, for each sub-carrier, an antenna of the plurality of antennas to be used for the transmission of the sub-carrier based on the transmission characteristic of the transmission of the sub-carrier between the antenna and the receiver.
摘要:
A method of transmitting data to a receiver, wherein the data is transmitted using a plurality of sub-carriers, is provided. The method provided includes determining, for each sub-carrier and for each of a plurality of combinations of the sub-carrier and an antenna of a plurality of antennas to be used for transmitting the data, a transmission characteristic of a transmission of the sub-carrier using the antenna; and selecting, for each sub-carrier, an antenna of the plurality of antennas to be used for the transmission of the sub-carrier based on the transmission characteristic of the transmission of the sub-carrier between the antenna and the receiver.