Abstract:
An inter mode for encoding a video macroblock is selected. An initial analysis on the macroblock homogeneity is performed to quickly reduce the number of inter modes for which motion estimation and rate distortion calculation are to be performed. An inter mode is selected based on these calculations and sub-pixel motion estimation, at various sub-pixel interpolation granularities, is performed only on the selected inter mode. Sub-pixel motion estimation may be skipped for macroblocks that exhibit relatively low distortion at the integer pixel level.
Abstract:
Transform based distortion cost estimation, which may be used to calculate a rate distortion cost associated with a mode for encoding a macroblock, is described. In one embodiment of the invention, a distortion value for a particular mode is estimated within the transform domain, which allows for the elimination of both inverse transformation and inverse motion functions in this calculation. A spatial domain residual of an encoding mode is estimated by identifying a difference, within the transform domain, between a motion compensated prediction residual of a macroblock and its corresponding reconstructed signal. The estimated spatial domain residual may then be used in distortion matrix computations to estimate a distortion level, within the transform domain, for an encoding mode.
Abstract:
An inter-mode for encoding a video macroblock is selected based on a comparison of at least two rate-distortion values associated with inter-modes. The number of potential inter-modes for encoding the video macroblock is initially reduced based on the rate-distortion calculation of the SKIP mode between the current macroblock and the collocated macroblock (corresponding to a 0,0 motion vector). Motion estimation is performed on the remaining inter-modes to identify reference blocks and motion vectors. The number of calculations performed in identifying the reference blocks and motion vectors is potentially reduced because associations between inter-modes are recognized and leveraged.
Abstract:
A method for reducing the memory requirements for decoding a bit stream is provided. The method initiates with receiving a video bit stream. Then, a frame of the bit stream is decoded into a discrete cosine transform (DCT) domain representation. Next, non-zero coefficients of the DCT domain representation are identified. Then, a hybrid data structure is assembled. The hybrid data structure includes a fixed size array and a variable size overflow vector. Next, the non-zero coefficients of the DCT domain representation are inserted into the hybrid data structure. A computer readable media, a printed circuit board and a device configured to decode video data are also provided.
Abstract:
A 3-D dot code is embedded in a printed medium using an embedder that embeds data directly into the halftone dots of a selected file that is ready for printing. An extractor extracts the embedded bits by scanning the printed medium and then interprets the embedded data. A bleed-through elimination module in the extractor is able to eliminate adverse affects on the scanned input of the embedded data caused by bleed-through from the print on the opposite side of the page. The extractor may also include a module that compensates for a misaligned scanned input by automatically detecting the rotation and translation, and then registering the image so that the upper-left corner is located in a certain position. This operation expedites subsequent data retrieval, in which data is read by block matching.
Abstract:
Smart printing/copying techniques for controlling the printing/copying and editing of a document based on document-descriptive and control information embedded in the printed output. These “smart” printing/copying techniques and controls allow a document user to control printing/copying costs by making tradeoffs with respect to quality and speed based on the embedded data. Such costs may be controlled, for example, by initially printing low cost (low quality, high speed) pages for distribution, while giving subsequent recipients of the document the option of making high quality (higher cost, possibly slower) copies of any or all of the pages in the document, if desired. With these features, a user is able to obtain high quality copies from low quality originals. The techniques of the present invention also allow a recipient to edit page content prior to reprinting, without loss of print output quality.
Abstract:
A method for adaptively filtering a video signal prior to encoding is provided. The method initiates with calculating a local gradient indicative of a region type. Then, a weight factor is determined based upon the local gradient. Next, the weight factor is applied to a difference signal according to the region type. A computer readable media, an integrated circuit and a system for processing an image data signal are also provided.
Abstract:
A layered presentation system (LAPE) includes a server that performs compressed-domain image processing on image data received from multiple clients including a master client and other clients to generate a composite image that incorporates imagery from the other clients with a master image from the master client for viewing on a shared display. The system's clients can add imagery in the form of questions, comments, and graphics to a currently displayed image. The added imagery is processed along with the master image to generate the composite image that then appears on the shared display and perhaps also on each client's individual display. The processing includes scaling the master image/added imagery, as required, and blending and/or overlaying the added imagery onto the master image so as to augment but not obscure it. A network protocol is included for sending image data in the compressed domain back and forth between the server and each of the clients.
Abstract:
The temporal and/or spatial characteristics of a macroblock are analyzed in order to reduce the number of modes for which motion estimation and rate distortion efficiency calculations are to be performed. In one embodiment, macroblock mean and variance characteristics are analyzed to merge sub-blocks together within the macroblock. These merged sub-blocks may be used to identify both inter and intra modes for the macroblock.
Abstract:
A method for reducing the memory requirements for decoding a bit stream is provided. The method initiates with receiving a video bit stream. Then, a frame of the bit stream is decoded into a discrete cosine transform (DCT) domain representation. Next, non-zero coefficients of the DCT domain representation are identified. Then, a hybrid data structure is assembled. The hybrid data structure includes a fixed size array and a variable size overflow vector. Next, the non-zero coefficients of the DCT domain representation are inserted into the hybrid data structure. A computer readable media, a printed circuit board and a device configured to decode video data are also provided.