摘要:
A digital image processing method reduces noise and blocking artifacts in a digital image by first converting the RGB values of the digital image pixels to Y, Cb and Cr components, then detecting the block boundaries in the Y, Cb and Cr image components, and estimating the noise in the Y, Cb and Cr image components. One or more noise tables are constructed for the Y, Cb and Cr image components. An adaptive Huber-Markov-random-field-model-based filter (HMRF) is applied to the Y, Cb and Cr image components, wherein the adaptive feature of the HMRF employs the detected block boundaries and the noise tables to produce filtered Y, Cb and Cr image components. Finally, the filtered Y, Cb and Cr image components are converted to RGB components.
摘要:
A digital image processing method determines the extent of blocking artifacts in a digital image by first forming a column difference image and averaging the values in the columns in the column difference image to produce a column difference array. The average of the values in the column difference array that are separated by one block width are computed to produce a block averaged column difference array. Then, the peak value in the block averaged column difference array is located, and the mean value of the block averaged column difference array (excluding the peak value) is calculated to produce a column base value, and the ratio between the peak value and the base value are computed to produce a column ratio. The foregoing steps are repeated in the row direction to produce a row ratio. Finally, the column and row ratios are employed as a measure of the extent of blocking artifacts in the digital image.
摘要:
A method for producing a compressed bit-stream from a digital image includes the steps of a) processing the digital image to produce a main subject belief map containing a continuum of belief values relating to the importance of subject and background regions in the image, b) performing a spatio-frequency transformation on the digital image to produce an array of transform coefficients, c) deriving a distortion-weighting factor for each transform coefficient from the belief map, and d) producing a compressed bit stream using an image compression system that is responsive to the distortion-weighting factors. The specific image compression system may be selected from a variety of image compression systems, including JPEG compression, JPEG2000 compression or vector quantization. The production of the main subject belief map involves the extraction of at least one structural saliency feature and at least one semantic saliency feature from the image, and the integration of the structural saliency feature and the semantic feature using a probabilistic reasoning engine into an estimate of a belief that a region of the digital image is the main subject.
摘要:
During the prediction stage of a video encoding and/or decoding process, a video coder can use relatively longer filters for certain motion vectors pointing to certain sub-pixel positions and relatively shorter filters for motion vectors pointing to other sub-pixel positions, where a longer filter generally refers to an interpolation filter with a greater number of filter coefficients, also called taps, while a shorter filter generally refers to an interpolation filter with fewer taps.
摘要:
A method for simultaneously recording motion and still images, includes the steps of: capturing a motion image sequence and accompanying audio of a scene with a digital video camera adapted to record both motion and higher resolution still images; simultaneously capturing a still image sequence having a higher resolution and lower frame rate than the motion capture sequence; compressing the motion image sequence using interframe compression and the accompanying audio and storing the compressed motion image and audio data; and compressing the still images using intraframe coding and storing the compressed still image data.
摘要:
A method for encoding rate-distortion information associated with the compression of an input digital image includes the steps of: computing rate and distortion-reduction values associated with each coding pass of each compressed codeblock bit-stream, and encoding rate and distortion-reduction values associated with coding passes contained in the final compressed bit-stream. A method for using encoded rate-distortion information associated with a compressed digital image bit-stream during transcoding of said compressed digital image bitstream includes the steps of: parsing the compressed digital image bit-stream to obtain compressed codeblock bit-streams, decoding the encoded RD information to obtain rate and distortion-reduction values associated with codeblock coding passes, and using such rate-distortion information to optimally transcode such compressed digital image bit-stream to form a new compressed digital image bit-stream at a given bit-rate, resolution, and for given visual weights.
摘要:
In general, techniques are described for implementing a 16-point inverse discrete cosine transform (IDCT) that is capable of applying multiple IDCTs of different sizes. For example, an apparatus comprising a 16-point inverse discrete cosine transform of type II (IDCT-II) unit may implement the techniques of this disclosure. The 16-point IDCT-II unit performs these IDCTs-II of different sizes to transform data from a spatial to a frequency domain. The 16-point IDCT-II unit includes an 8-point IDCT-II unit that performs one of the IDCTs-II of size 8 and a first 4-point IDCT-II unit that performs one of the IDCTs-II of size 4. The 8-point IDCT-II unit includes the first 4-point DCT-II unit. The 16-point IDCT-II unit also comprises an inverse 8-point DCT-IV unit that includes a second 4-point IDCT-II unit and a third 4-point IDCT-II unit. Each of the second and third 4-point IDCT-II units performs one of the IDCTs-II of size 4.
摘要:
This disclosure describes video encoding and decoding techniques in which a first order prediction process and a second order prediction process are used in combination to generate predictive video blocks for video coding. First order prediction may be similar to conventional motion estimation and motion compensation that generates residual video blocks. The second order prediction may involve a process similar to conventional intra-prediction, but is performed on the residual video blocks. The techniques of this disclosure may pre-define the second order prediction to a specific mode, such as a mode similar to the intra-DC mode used in intra coding. In addition, the techniques of this disclosure may combine aspects of the first order and second order prediction into a single process so that the effects of second order prediction on the residuals are taken into account during the first order prediction process, which may improve compression.
摘要:
In general, techniques are described for implementing an 8-point inverse discrete cosine transform (IDCT). An apparatus comprising an 8-point inverse discrete cosine transform (IDCT) hardware unit may implement these techniques to transform media data from a frequency domain to a spatial domain. The 8-point IDCT hardware unit includes an even portion comprising factors A, B that are related to a first scaled factor (μ) in accordance with a first relationship. The 8-point IDCT hardware unit also includes an odd portion comprising third, fourth, fifth and sixth internal factors (G, D, E, Z) that are related to a second scaled factor (η) in accordance with a second relationship. The first relationship relates the first scaled factor to the first and second internal factors. The second relationship relates the second scaled factor to the third, fourth, fifth and sixth internal factors.
摘要:
A system renders oblique slices through volumetric data accessed via a network using a client-server architecture. The system includes a server for processing and storing volumetric data comprising axial slices obtained from a diagnostic scanning system, a client for processing user requests related to specific views of the volumetric data, and a network protocol for connecting the client with the server over the network and obtaining data from the server for use by the client. A processing stage at the client specifies an oblique slice and communicates particulars of the oblique slice to the server, thereupon obtaining axial slice data from the server specifically for portions of the axial slices that are needed to render the oblique slice. Memory at the client stores the axial slice data, and a rendering stage at the client renders the oblique slice from the axial slice data in the memory.