Invention Grant
US07680190B2 Video coding system and method using 3-D discrete wavelet transform and entropy coding with motion information
失效
使用3-D离散小波变换和运动信息熵编码的视频编码系统和方法
- Patent Title: Video coding system and method using 3-D discrete wavelet transform and entropy coding with motion information
- Patent Title (中): 使用3-D离散小波变换和运动信息熵编码的视频编码系统和方法
-
Application No.: US10984467Application Date: 2004-11-09
-
Publication No.: US07680190B2Publication Date: 2010-03-16
- Inventor: Jizheng Xu , Shipeng Li , Ya-Qin Zhang
- Applicant: Jizheng Xu , Shipeng Li , Ya-Qin Zhang
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Lee & Hayes, PLLC
- Main IPC: H04N7/12
- IPC: H04N7/12 ; H04N11/02 ; H04N11/04 ; H04B1/66

Abstract:
A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a 3-D wavelet transform is applied in two parts. First, a temporal 1-D wavelet transform is applied to the corresponding pixels along the motion trajectories in a time direction. The temporal wavelet transform produces decomposed frames of temporal wavelet transforms, where the spatial correlation within each frame is well preserved. Second, a spatial 2-D wavelet transform is applied to all frames containing the temporal wavelet coefficients. The wavelet transforms produce coefficients within different sub-bands. The process then codes wavelet coefficients. In particular, the coefficients are assigned various contexts based on the significance of neighboring samples in previous, current, and next frame, thereby taking advantage of any motion information between frames. The wavelet coefficients are coded independently for each sub-band to permit easy separation at a decoder, making resolution scalability and temporal scalability natural and easy. During the coding, bits are allocated among sub-bands according to a technique that optimizes rate-distortion characteristics.
Public/Granted literature
Information query