摘要:
A random multiple-access communication system operates both a feedback-ignored (e.g. ETHERNET) and a feedback-utilized (e.g. STACK) protocol simultaneously. The system is useful for communicating two types of information such as voice and data wherein one of the information types (e.g. voice) is subject to delay constraints. The feedback-utilized protocol has the effect of taking priority over the feedback-ignored protocol in the communication system to provide a priority transmission system.
摘要:
A system and method for reducing power consumption in a wireless transmitter is useful for conserving battery power in wireless communication devices. The method includes storing a voice data packet in a buffer operatively coupled to the wireless transmitter (step 310). A power supply of the wireless transmitter is then cycled between a high power level and a low power level (step 315). The voice data packet is then transmitted from the buffer to the wireless transmitter when the power supply of the wireless transmitter is at the high power level (step 320).
摘要:
A radio communication system includes a voice recognition system (221) for converting (400) a caller's voice message to a textual speech message. The textual speech message is then transmitted to an intended selective call radio (122). To perform these functions, the radio communication system includes a caller interface circuit (218), a transmitter (116), and a processor (222). To perform voice-to-text conversion, the processor is adapted to cause the caller interface circuit to sample a voice signal generated by the caller during a plurality of frame intervals, and to apply a Fourier transform thereto, thereby generating spectral data. The spectral data is subdivided into a plurality of bands. The spectral envelope of the spectral data is then filtered out to generate filtered spectral data. A Fourier transform is applied thereto to generate an autocorrelation function for each band. From the autocorrelation function of each band, a magnitude is determined, which is representative of the degree of voiceness of each band. The degree of voiceness for each band is then applied to a corresponding plurality of phoneme models, which are used to derive a textual equivalent of speech from the voice signal. The textual equivalent of speech is then transmitted to the selective call radio by way of the transmitter.
摘要:
An interactive method for composing an alphanumeric message by a caller using a telephone keypad includes storing (215) a lexical database (135) from which unigram probabilities, forward conditional probabilities, and backward conditional probabilities for a plurality of words can be recovered; storing a received sequence of key codes (405) representing a sequence in which keys on a telephone style keypad are keyed; generating a word trellis including candidate words (415) derived from the sequence and the lexical database; determining a most likely phrase (420) from the candidate words, the unigram probabilities, forward conditional probabilities, and backward conditional probabilities; generating a most likely message (425) from the most likely phrase and presenting the most likely message to the caller; and confirming that the most likely message is the alphanumeric message (430).
摘要:
A voice packet, containing coded voice information for a communication talkgroup that is currently being monitored, is received and stored in a buffer (200-202) dedicated to the communication talkgroup. When a decoder (203-205) is available, the voice packet is decoded and the resulting decoded voice information is combined with other decoded voice information, pertaining to other monitored communication talkgroups, such that the combined decoded voice information can be rendered audible.
摘要:
A communication resource allocator of a trunking communication system may more efficiently handle service requests during extremely busy times in the following manner. While receiving service requests from communication units, the communication resource allocator determines whether it can process the service request within a predetermined period of time. If the communication resource allocator cannot process the service request within a predetermined period it generates a global system busy which indicates that the communication resource allocator cannot individually process or acknowledge additional service requests. Having generated the global system busy signal, the communication resource allocator transmits it to the plurality of communication units. The communication units process the global system busy signal such that they will not transmit nonpriority service requests during the duration of the global system busy signal. While the global system busy signal is active, the communication resource allocator processes received service requests and any service request having a priority service level until the processing time of the non priority service request is at least a portion of the predetermined period of time.
摘要:
A method for animating an image is useful for animating avatars using real-time speech data. According to one aspect, the method includes identifying an upper facial part and a lower facial part of the image (step 705); animating the lower facial part based on speech data that are classified according to a reduced vowel set (step 710); tilting both the upper facial part and the lower facial part using a coordinate transformation model (step 715); and rotating both the upper facial part and the lower facial part using an image warping model (step 720).
摘要:
An electronic device (200) for speech dialog includes functions that receive (205, 105) an utterance that includes an instantiated variable (215), perform voice recognition (210, 115, 120) of the instantiated variable to determine a most likely set of acoustic states (220) and a corresponding sequence of phonemes with stress information (215), determine prosodic characteristics (272, 274, 276, 130) for a synthesized value of the instantiated variable (236) from the sequence of phonemes with stress information and a set of stored prosody models. The electronic device generates (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the prosodic characteristics of the instantiated variable.
摘要:
A technique is used in a speech encoder (107) that reduces non-speech activity of a low bit rate digital voice message. Speech model parameters that include quantized speech spectral parameter vectors are generated in a sequence of frames. A determination is made as to which frames of the sequence of frames are voiced frames and which frames are unvoiced frames. A consecutive sequence of frames of unvoiced frames is identified (2330) as an unvoiced burst when a length, NUV, of the consecutive sequence of frames exceeds a predetermined length, Ns. A non-speech activity portion of the unvoiced burst is identified (2335-2365) and removed.
摘要:
An interactive method for composing an alphanumeric message by a caller using a telephone keypad includes storing (215) a lexical database (135) from which unigram probabilities, forward conditional probabilities, and backward conditional probabilities for a plurality of words can be recovered; storing a received sequence of key codes (405) representing a sequence in which keys on a telephone style keypad are keyed; generating a word trellis including candidate words (415) derived from the sequence and the lexical database; determining a most likely phrase (420) from the candidate words, the unigram probabilities, forward conditional probabilities, and backward conditional probabilities; generating a most likely message (425) from the most likely phrase and presenting the most likely message to the caller; and confirming that the most likely message is the alphanumeric message (430).