摘要:
Embodiments include systems and methods of improving processing in a decoder in a multimedia transmission system. Multimedia data may include one or more of motion video, audio, still images, or any other suitable type of audio-visual data. One embodiment includes a system and method of parsing fixed and variable length codes from a bit-wise stream of multi-media data by maintaining and reading the data in a processor aligned (e.g., word aligned) way. Another embodiment includes a system and method of decoding variable length codes, particular as received from such a variable length code using multiple levels of decoding tables to speed up decoder table lookup.
摘要:
This system adaptively assigns picture types used for temporal compression to frames of streaming video at the input. Based on threshold testing of two metrics that are measures of distance between the frames at the input, a frame may be assigned to be compressed as an I, P, or B frame or be skipped over by the system without being coded at all.
摘要:
This disclosure describes techniques for improving mode selection decisions during the encoding of macroblocks (or other blocks) of multimedia frames of a multimedia sequence. During motion estimation, the encoding modes for macroblocks can be determined so that a desirable encoding rate and acceptable levels of distortion (i.e., acceptable rate-distortion) can be achieved. The techniques may include selecting a set of multimedia coding modes between at least two sets of possible multimedia coding modes for a macroblock of a multimedia frame based on a detail metric associated with the macroblock and mode information associated with neighboring blocks to the macroblock.
摘要:
The invention comprises devices and methods for processing multimedia data to generate progressive frame data from interlaced frame data. In one aspect, a method of processing multimedia data includes generating spatio-temporal information for a selected frame of interlaced multimedia data, generating motion information for the selected frame, and deinterlacing fields of the selected frame based on the spatio-temporal information and the motion information to form a progressive frame associated with the selected frame. In another aspect an apparatus for processing multimedia data can include a spatial filter module configured to generate spatio-temporal information of a selected frame of interlaced multimedia data, a motion estimator configured to generate motion information for the selected frame, and a deinterlacer configured to deinterlace fields of the selected frame and form a progressive frame corresponding to the selected frame based on the spatio-temporal information and the motion information.
摘要:
A technique for processing video to determine which segments of video originate in a telecine and which conform to the NTSC standard is described herein. The current pull-down phase of the 3:2 pull-down (see below) in a telecine generated video segment is estimated and used to invert the telecine process.
摘要:
Methods and apparatus efficiently encode multimedia data, such as live video streams. An encoding complexity of a predetermined time interval, such as 1 second, is estimated before the actual encoding that will be used. This permits the actual encoding to be performed with an a priori estimate of complexity, permitting the bits allocated for the predetermined time interval (bit rate) to be efficiently allocated within the predetermined time interval. Moreover, the estimated complexity can be provided to a device, such as a multiplexer, which can then allocate the available bandwidth for a collection of multiplexed video channels according to the encoding complexity anticipated for those video channels, which then permits the quality of a particular channel to remain relatively constant even when the bandwidth for the collection of multiplexed video channels is relatively constant.
摘要:
Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes A method of processing multimedia data, including obtaining content information of multimedia data, and encoding the multimedia data to form an encoded bitstream, said encoding comprising inserting at least one redundant block of intra-coded data into the encoded bitstream based on the content information. In some aspects, the encoding comprises encoding the multimedia data into a base layer and an enhancement layer, and wherein the intra-coded data is encoded using the quantization parameter for the base layer. In other aspects, the intra-coded data is encoded using the quantization parameter for the enhancement layer.
摘要:
This disclosure is directed to techniques for encoding and decoding transitional effects, i.e., visual video effects that are used to transition from a current scene of a multimedia sequence. According to the disclosed techniques, an encoding device detects a transitional effect associated with a multimedia sequence during the encoding of the multimedia sequence, and transmits information as part of an encoded multimedia sequence to identify the transitional effect associated with the encoded multimedia sequence to a decoder. The information may comprise metadata that can be used by the decoder to simulate or re-create the transitional effect. The decoder simulates a transitional effect in response to the information.
摘要:
Methods and apparatus encode video at a targeted bit rate and yet permit variation of a Quantization Parameter (QP) to encode video of varying complexity with relatively consistent visual quality. Constant bit rate (CBR) encoding is desirable in many applications, such as in transmission or broadcasting environments. However, conventional CBR techniques compromise visual quality. Disclosed techniques permit adaptive variation in a QP value and provide the improved visual encoding available in variable bit rate (VBR) schemes while maintaining enough adherence to a targeted bit rate to be applicable to CBR environments.
摘要:
Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes receiving multimedia data, and encoding the multimedia data into a first data group and a second data group based on content of the multimedia data, the first data group being configured to be independently decodable from the second data group, and wherein the first and second data groups are encoded at different quality levels. The method can also include classifying the content of the multimedia data and encoding the multimedia data based on the content classification.