摘要:
A method for generating a digital motion video sequence at a plurality of bit rates uses a transitional coding source when switching between bitstreams having different bit rates during transmission of a video sequence. The transitional data may be frames coded using reconstructed frames reconstructed for a first bitstream using the characteristics of the second bitstream. These “low bit rate insert frames,” or LBIFs, contain the image characteristics of a signal coded at the lower bit rate. With a bitstream having a higher bit rate being periodically coded using an LBIF, a point of image continuity between the two bitstreams is provided. Thus, switching from one bitstream to the other at this point in the video sequence minimizes the production of artifacts caused by differences in bit rate. In another embodiment of the invention, a separate set of transitional data is created, taking the form of “switch” frames, or S-frames. The S-frames are typically the difference between a frame of a first bitstream and a frame of a second bitstream. These frames are inserted into the decoded bitstream during the transition from one bitstream to the other, and compensate for any visual artifacts that might otherwise occur due to the difference in bit rate of the two bitstreams.
摘要:
A method is provided for performing a fast Discrete Cosine Transform (DCT) and a fast Inverse Discrete Cosine Transform (IDCT) in a software implementation. The method provided exploits symmetries found in both the DCT and IDCT. As a result of the symmetries found in the DCT and IDCT, both transforms may be performed using a combination of look-up tables and butterfly operations, thus employing only a small number of additions and subtractions and no multiplications. Furthermore, there is provided an aspect of the present invention which exploits the excess precision available in current central processing units (CPUs) relative to the precision required by the DCT and IDCT calculations.
摘要:
A digital watermarking method and apparatus allows for the watermarking of a digital video signal in a compressed form, thereby allowing watermarking of a pre-compressed video sequence without requiring the decoding and re-coding of the signal. The watermark signal is a sequence of information bits which has been modulated by a pseudo-random noise sequence to spread it in the frequency domain. The video signal is transform coded, preferably with a discrete cosine transform, and a watermark signal, which has been transform coded using the same type of transform, is added to the coded video signal. The system also includes bitstream control to prevent an increase in the bit rate of the video signal. This allows the system to be used with transmission channels having strict bit rate constraints. For each transform coefficient of the video signal, the number of bits necessary to encode the watermarked coefficient is compared to the number of bits necessary to encode the unwatermarked coefficient. If more bits are required to transmit a watermarked coefficient than to transmit the corresponding unwatermarked coefficient, the watermarked coefficient is not output, and the unwatermarked coefficient is output in its place. When watermarking interframe coded data, a drift compensation signal may be used to compensate for the accumulating variations in the decoded video signal stored at the receiver. The system may also include an encryption/decryption capability, with the watermarking apparatus located at either the transmitting or receiving end of the transmission channel.
摘要:
A method and apparatus for encoding interframe error data in an image transmission system, and in particular in a motion compensated image transmission system for transmitting a sequence of image frames from a transmitter to a receiver, employ hierarchical entropy coded lattice threshold quantization to increase the data compression of the images being transmitted. The method and apparatus decimate an interframe predicted image data and an uncoded current image data, and apply hierarchical entropy coded lattice threshold quantization encoding to the resulting pyramid data structures. Lossy coding is applied on a level-by-level basis for generating the encoded data representation of the image difference between the predicted image data and the uncoded original image. The method and apparatus are applicable to systems transmitting a sequence of image frames (or other pattern data, such as speech) both with and without motion compensation.
摘要:
A method of tracking an inventory of objects via a mobile communications device includes acquiring an image of one or more of the objects via the mobile communications device, which also collects a location of the mobile communications device while acquiring the image of the one or more of the objects. The location and image are transferred from the mobile communications device to a remote server via a wireless network, such that the one or more of the objects are identified at the server based on the image, and the location and identity of the one or more objects are stored on a database associated with the server.
摘要:
A system and method for performing echo suppression on a server in browser-based online audio conferences without downloading or installing software on a participant's computing device is disclosed. Streams of audio communication data from the participants in an audio conference are received at the server. An echo suppression application determines the first party that speaks by analyzing the streams to locate speech data, and assigns that party as the “owner” of the audio channel. The speech data is sent to the other participants in the conference. The application then determines whether newly received audio from the owner of the channel is new speech; if so, then the party remains the owner of the channel, and the new speech data is also sent to the other parties in the conference. The channel is surrendered if no new speech is received from the owner in a defined period, and the next party that speaks becomes the new owner of the channel. The other audio data from the participants is replaced by silence.
摘要:
Packet-based delivery of video content is facilitated using a peer-to-peer network. According to an example embodiment of the present invention, a packet-based video stream is passed over a peer-to-peer network, with the packets in the stream scheduled and delivered in accordance with a priority assigned to the packets; this priority is based upon a determined or estimated effect that the packet will have upon video quality at a recipient peer. In some applications, the packets are temporally spaced to mitigate congestion and/or distortion of video associated with congestion in the delivery of the packets.
摘要:
A variety of methods, systems, devices and configured storage devices are used in relation to peer-to-peer streaming system with a plurality of processing-circuit-peer nodes sharing streaming data by passing the streaming data from parent nodes to child nodes. According to one such system, computer-based nodes are configured and adapted to detect a departure of a first child peer node from the peer-to-peer streaming system. The first child peer node having been a child peer node of the parent peer node and the first child peer having provided data to one or more additional child peers. Responsive to the detected departure, a second child peer is selected to provide data to the one or more additional child peers. Data is provided to the second child peer to facilitate establishment of a connection between the selected child peer and the one or more additional child peers and the parent peer node.
摘要:
A variety of methods, devices, systems and arrangements are implemented for processing and coding of video images. According to one such implementation, a method is implemented for encoding a sequence of images. A plurality of orthogonal transforms is implemented on a set of N images, where N is greater than one. The images are linked by motion fields that include sets of respective portions of the images. In particular, the construction of a motion-compensated orthogonal transform is accomplished for the important case where at least one portion of any of the N images—or any part of this portion—is used more than once to motion-compensate other portions of the N images—or parts thereof.