摘要:
A digital signal processor for processing data including voice messaging data that may have both voiced and unvoiced speech components utilizes computer routines stored in a memory used by the digital signal processor. The computer routines programmed provide for control of at least a portion of a selective call receiver; receiving and decoding data received at the selective call receiver; comparing the addresses received at the selective call receiver with addresses stored in a memory location coupled to the digital signal processor; controlling voicing including both voiced and unvoiced speech components; and generating a pitch wave using an inverse discrete Fourier Transform and resample the pitch wave to provide a time domain voiced speech component.
摘要:
A MBE synthesizer (2200) for generating speech from information received by a receiver (114) includes a voiced signal generator (2280) for generating voiced signal components in the time domain using an IDFT in a pitch wave generator (2210) and and a pitch wave resampler (2232) and an unvoiced signal generator (2290) for generating unvoiced signal components in the time domain. The MBE synthesizer also includes a voicing processor (2218) responsive to band voicing flags within the excitation information for controlling selection of a voiced spectral component or an unvoiced spectral component from a harmonic amplitude spectrum.
摘要:
A MBE (Multi-Band Excitation) synthesizer (116) generates excitation components from information received by a receiver (2004). The information received includes spectral information representing a segment of speech. The MBE synthesizer (116) includes an excitation generator (2241) and a nonlinear voicing processor (2211). The excitation generator (2241) generates voiced excitation components and unvoiced excitation components. The nonlinear voicing processor (2211) is responsive to the spectral information and controls a selection of the excitation components from the voiced excitation components and the unvoiced excitation components.
摘要:
A portable device comprises a data storage for storing avatar data defining a user avatar. The user avatar is formed by a plurality of visual objects. The portable device further comprises a camera for capturing an image. A visual characteristic processor is arranged to determine a first visual characteristic from the image and an avatar processor is arranged to set an object visual characteristic of an object of the plurality of visual objects in response to the first visual characteristic. The invention may allow improved customization of user avatars. For example, a color of an element of a user avatar may be adapted to a color of a real-life object simply by a user taking a picture thereof.
摘要:
A pitch determiner (414) for use with a speech analyzer includes a pitch function generator (414) which generates a plurality of pitch components representing a pitch function for one or more sequential segments of speech. which are represented by a predetermined number of digitized speech samples. A pitch enhancer (1116) enhances the pitch function of a current segment of speech utilizing the pitch function of one or more sequential segments of speech to generate a plurality of enhanced pitch components. A pitch detector (1118) detects the pitch of the current segment of speech by determining the pitch of an enhanced pitch component having a largest amplitude of the plurality of enhanced pitch components.
摘要:
Error detection and correction of a received message, such as a digitized voice message is achieved by generating (318) interpolated vectors for each error vector corresponding to a codebook index in a sequence of codebook indexes representing parameters of portions of the message. A plurality of error corrected candidate vectors for the vector corresponding to the codebook index in error, are generated (322,324,326) by flipping one bit in a sequence of bits representing the codebook index in error. The error corrected candidate vector which has a minimal difference from its corresponding interpolated vector is used (338) to replace the error vector. In the case of digital voice, the vectors are spectral vectors which represent spectral information for a time sample of a voice message. An ordering property of vector components is exploited to detect errors in a received codebook index without parity bits.
摘要:
A system controller (106) is for transferring a low bit rate digital voice message. The system controller generates from an analog voice signal representing the voice message a set of speech model parameters, and generates a first derived set of speech model parameters from a first subset of the set of speech model parameters, the first derived set encoding the voice signal at a second voice quality and second vocoder rate that are less, respectively, than a first voice quality and vocoder rate. The system controller transmits (3610) the low bit rate-digital voice message comprising the first derived set of speech model parameters to a communication receiver (114). The communication receiver requests (3640) an incremental message when the quality of the voice message is unsatisfactory. The system controller generates and transmits (3555, 3650) an incremental message-and the communication receiver uses (3660) the incremental message to generate a higher quality voice message.
摘要:
A system controller (106) includes a speech encoder (107) that dynamically segments frames of a low bit rate digital voice message. Speech model parameters have been generated in a sequence of frames. The speech model parameters include quantized speech spectral parameter vectors. The speech encoder selects (1820) a first quantized speech spectral parameter vector as a current anchor vector, selects (1820, 1830) a second quantized speech spectral parameter vector located a predetermined number of frames (LMAX) from the current anchor vector as a target speech parameter vector, and perturbs (1840) the target speech parameter vector to derive a plurality (K) of perturbed speech parameter vectors.
摘要:
A communication system includes a transmitter for transmitting messages to a plurality of receiving devices of the communication system, and a processing system. The processing system is adapted to convert a caller's voice message to a sequence of phonemes whereby the caller's voice message is intended for a receiving device. To accomplish the conversion, steps of Fourier transform, spectral subdivision, envelope filtering autocorrelation function determination of each subdivision, and voiceness determination for each subdivision are performed. The processing system is further adapted to generate a sequence of phoneme indexes and voice features corresponding to the sequence of phonemes, and to cause the transmitter to transmit the sequence of phoneme indexes to the receiving device for generating a voice signal representative of the caller's voice message. The voice features can include spectral features, average energy, duration, and pitch to improve the quality of the voice signal. The receiving device can be a selective call radio.
摘要:
A method and apparatus is provided for a low bit rate speech transmission. Speech spectral parameter vectors are generated from a voice message and stored in a sequence of speech spectral parameter vectors within a speech spectral parameter matrix. A first index identifying a first speech parameter template corresponding to a first speech spectral parameter vector of the sequence of speech spectral parameter vectors is transmitted. A subsequent speech spectral parameter vector of the sequence is selected and a subsequent speech parameter template is determined having a subsequent index. One or more intervening interpolated speech parameter templates are interpolated between the first speech parameter template and the subsequent speech parameter template. The one or more intervening speech spectral parameter vectors are compared to the corresponding one or more intervening interpolated speech parameter templates to derive a distance. The subsequent index is transmitted when the distance derived is less than or equal to a predetermined distance.