摘要:
A selective call communication system (100), such as a paging system (100), and a method therefor, for using a lexicon (134) for voice data entry in the selective call communication system (100) is shown. The system (100) associates a Voice User ID (206) to a subscriber ID (202) for one or more subscribers of the system (100). The system (100) accumulates user system statistics (208). Then, based on the accumulated user system statistics (208), the system (100) defines a lexicon (134) to include at least one Voice User ID (206) for the one or more subscribers for voice data entry for the system (100).
摘要:
This rotary piston engine is to propel twin or double twin pistons to do a rotational run in the round-section cylinder. The characteristic is when the stop-piston is locked, it cannot rotate until being unlocked. Another piece or pairs rotary pistons perform the four processes of the internal combustion engine. They are air absorption, compression, expansion and exhaustion.
摘要:
A radio communication system includes a voice recognition system (218), a transmitter (202) and a processing system (210). The voice recognition system is utilized for receiving caller initiated messages, and the transmitter is used for transmitting messages to a plurality of SCRs (selective call radios) (122) of the radio communication system. The processing system, which is coupled to the voice recognition system, and the transmitter, is adapted to cause the voice recognition system to convert a voice signal representative of a voice message originated by a caller of the radio communication system to a text message (401, 417), wherein the text message is intended for a SCR, to then generate a likelihood of success that the voice signal has been flawlessly converted to a text message, to have a human listen to an audible representation of the voice signal, and to cause the transmitter to transmit the text message to the SCR (432). The converting step includes autocorrelation by Fourier transform, measure a degree of voiceness for each band, applying the degree of voiceness to a corresponding plurality of phenome models, and deriving a text equivalent by searching through a phenome library.
摘要:
An interactive method for composing an alphanumeric message by a caller using a telephone keypad includes storing (215) a lexical database (135) from which unigram probabilities, forward conditional probabilities, and backward conditional probabilities for a plurality of words can be recovered; storing a received sequence of key codes (405) representing a sequence in which keys on a telephone style keypad are keyed; generating a word trellis including candidate words (415) derived from the sequence and the lexical database; determining a most likely phrase (420) from the candidate words, the unigram probabilities, forward conditional probabilities, and backward conditional probabilities; generating a most likely message (425) from the most likely phrase and presenting the most likely message to the caller; and confirming that the most likely message is the alphanumeric message (430).
摘要:
A communication system includes a transmitter for transmitting messages to a plurality of receiving devices of the communication system, and a processing system. The processing system is adapted to convert a caller's voice message to a sequence of phonemes whereby the caller's voice message is intended for a receiving device. To accomplish the conversion, steps of Fourier transform, spectral subdivision, envelope filtering autocorrelation function determination of each subdivision, and voiceness determination for each subdivision are performed. The processing system is further adapted to generate a sequence of phoneme indexes and voice features corresponding to the sequence of phonemes, and to cause the transmitter to transmit the sequence of phoneme indexes to the receiving device for generating a voice signal representative of the caller's voice message. The voice features can include spectral features, average energy, duration, and pitch to improve the quality of the voice signal. The receiving device can be a selective call radio.
摘要:
A radio communication system includes a voice recognition system (221) for converting (400) a caller's voice message to a textual speech message. The textual speech message is then transmitted to an intended selective call radio (122). To perform these functions, the radio communication system includes a caller interface circuit (218), a transmitter (116), and a processor (222). To perform voice-to-text conversion, the processor is adapted to cause the caller interface circuit to sample a voice signal generated by the caller during a plurality of frame intervals, and to apply a Fourier transform thereto, thereby generating spectral data. The spectral data is subdivided into a plurality of bands. The spectral envelope of the spectral data is then filtered out to generate filtered spectral data. A Fourier transform is applied thereto to generate an autocorrelation function for each band. From the autocorrelation function of each band, a magnitude is determined, which is representative of the degree of voiceness of each band. The degree of voiceness for each band is then applied to a corresponding plurality of phoneme models, which are used to derive a textual equivalent of speech from the voice signal. The textual equivalent of speech is then transmitted to the selective call radio by way of the transmitter.
摘要:
An interactive method for composing an alphanumeric message by a caller using a telephone keypad includes storing (215) a lexical database (135) from which unigram probabilities, forward conditional probabilities, and backward conditional probabilities for a plurality of words can be recovered; storing a received sequence of key codes (405) representing a sequence in which keys on a telephone style keypad are keyed; generating a word trellis including candidate words (415) derived from the sequence and the lexical database; determining a most likely phrase (420) from the candidate words, the unigram probabilities, forward conditional probabilities, and backward conditional probabilities; generating a most likely message (425) from the most likely phrase and presenting the most likely message to the caller; and confirming that the most likely message is the alphanumeric message (430).
摘要:
A multiple camera imaging system, comprising: a first camera image sensor configured to obtain a first image of a scene from a first vantage perspective point; a second camera image sensor configured to obtain a second image of the scene from a second vantage perspective point; and an image signal processor (ISP), configured to process the first image and the second image by performing the following steps: producing a first roughly-aligned image from the first image, and a second roughly-aligned image from the second image; using the first and second roughly-aligned images to produce a disparity image; identifying a feature point within the first roughly-aligned image; utilizing the disparity image to create a search zone within the second roughly-aligned image; and identifying a group of candidate feature points within the search zone; identifying within the search zone a best-matched candidate feature point that best matches the feature point within the first roughly-aligned image to form a best-matched feature point pair; and using information from the best-matched feature point pair to further align the first and second roughly-aligned images.
摘要:
A system is provided and includes a buffer, an interface, a processor, and an output device. The interface is configured to receive a packet from a network. The processor is configured to: determine a delay of the network in transmitting the packet; prior to storing the packet in the buffer, determine statistics of the buffer, and an amount the buffer is filled; determine a predetermined delay of the buffer based on the delay of the network, and the statistics; estimate an actual delay of the buffer for the packet based on the amount the buffer is filled; generate an error signal based on the predetermined delay and the actual delay; and based on the error signal, one of compress and expand the packet to change a first length of the packet to a second length. The output device is configured to output the packet based on the second length.
摘要:
The present invention is a method of correcting packet discontinuities using the steps of: (A) generating a continuous real time data stream from input of media content from a media source comprising packets transmitted by way of a computer packet network to a specific receiving device to establish a transmission portion of an end to end communication, (B) a jitter buffer receiving real time data stream packets from the packet network and temporarily storing at least some of them in the jitter buffer, (C) the jitter buffer operating on multiple fixed length packets to output a first output of a predetermined sequence of said fixed length packets, preferably substantially as they were originally transmitted, (D) a control unit receiving the first output and changing the length of one or more of fixed length packets of the first output to form a second output in response to a detected delay or other discontinuity in the packet sequence, (E) a playout buffer receiving the second output and operating on the stream of original and varied length packets to deliver them to a digital to analog converter (DAC), and (F) transmission of analog output of the DAC to interface devices such as displays, speakers, and mechanical devices for intelligible playout of the media content for appreciation by a human interacting with the interface devices.