Bidirectionally predicted pictures or video object planes for efficient and flexible video coding
    11.
    发明授权
    Bidirectionally predicted pictures or video object planes for efficient and flexible video coding 有权
    双向预测的图像或视频对象平面,用于高效和灵活的视频编码

    公开(公告)号:US07463685B1

    公开(公告)日:2008-12-09

    申请号:US11212262

    申请日:2005-08-27

    IPC分类号: H04N7/12 G06K9/36

    摘要: A method is provided for decoding a bit stream representing an image that has been encoded The method includes the steps of: performing an entropy decoding of the bit stream to form a plurality of transform coefficents and a plurality of motion vectors; performing an inverse transformation on the plurality of transform coefficients to form a plurality of error blocks; determining a plurality of predicted blocks based on bidirectional motion estimation that employs the motion vectors, wherein the bidirectional motion estimation includes a direct prediction mode and a second prediction mode; and, adding the plurality of error blocks to the plurality of predicted blocks to form the image. The second prediction mode may include forward, backward, and interpolated prediction modes.

    摘要翻译: 提供了一种用于对表示已经被编码的图像的比特流进行解码的方法。该方法包括以下步骤:执行比特流的熵解码以形成多个变换系数和多个运动矢量; 对所述多个变换系数执行逆变换以形成多个误差块; 基于使用所述运动矢量的双向运动估计来确定多个预测块,其中所述双向运动估计包括直接预测模式和第二预测模式; 并且将多个误差块添加到多个预测块以形成图像。 第二预测模式可以包括前向,后向和内插的预测模式。

    Systems and methods for playing, browsing and interacting with MPEG-4 coded audio-visual objects
    12.
    发明授权
    Systems and methods for playing, browsing and interacting with MPEG-4 coded audio-visual objects 有权
    用于播放,浏览和与MPEG-4编码的视听对象交互的系统和方法

    公开(公告)号:US07281200B2

    公开(公告)日:2007-10-09

    申请号:US10662831

    申请日:2003-09-15

    IPC分类号: G06F15/00

    摘要: A number of novel configurations for MPEG-4 playback, browsing and user interaction are disclosed. MPEG-4 playback systems are not simple extensions of MPEG-2 playback systems, but, due to object based nature of MPEG-4, present new opportunities and challenges in synchronized management of independent coded objects as well as scene composition and presentation. Therefore, these configurations allow significantly new and enhanced multimedia services and systems. In addition, MPEG-4 aims for an advanced functionality, called Adaptive Audio Visual Session (AAVS) or MPEG-J. Adaptive Audio Visual Session (AAVS) (i.e., MPEG-AAVS, MPEG-Java or MPEG-J) requires, in addition to the definition of configurations, a definition of an application programming interface (API) and its organization into Java packages. Also disclosed are concepts leading to definition of such a framework.

    摘要翻译: 公开了许多用于MPEG-4播放,浏览和用户交互的新颖配置。 MPEG-4播放系统不是MPEG-2播放系统的简单扩展,但由于MPEG-4的基于对象的性质,在独立编码对象的同步管理以及场景组合和呈现方面呈现新的机遇和挑战。 因此,这些配置允许显着的新增和增强的多媒体服务和系统。 此外,MPEG-4旨在提供高级功能,称为自适应视听会议(AAVS)或MPEG-J。 自适应音频视频会话(AAVS)(即MPEG-AAVS,MPEG-Java或MPEG-J)除了对配置的定义外,还要求将应用程序编程接口(API)及其组织定义为Java包。 还公开了导致定义这种框架的概念。

    Systems and Methods for Playing, Browsing and Interacting with MPEG-4 Coded Audio-Visual Objects
    14.
    发明申请
    Systems and Methods for Playing, Browsing and Interacting with MPEG-4 Coded Audio-Visual Objects 有权
    用于播放,浏览和与MPEG-4编码的视听对象进行交互的系统和方法

    公开(公告)号:US20130013989A1

    公开(公告)日:2013-01-10

    申请号:US13616038

    申请日:2012-09-14

    IPC分类号: G06F17/00

    摘要: A number of novel configurations for MPEG-4 playback, browsing and user interaction are disclosed. MPEG-4 playback systems are not simple extensions of MPEG-2 playback systems, but, due to object based nature of MPEG-4, present new opportunities and challenges in synchronized management of independent coded objects as well as scene composition and presentation. Therefore, these configurations allow significantly new and enhanced multimedia services and systems. In addition, MPEG-4 aims for an advanced functionality, called Adaptive Audio Visual Session (AAVS) or MPEG-J. Adaptive Audio Visual Session (AAVS) (i.e., MPEG-AAVS, MPEG-Java or MPEG-J) requires, in addition to the definition of configurations, a definition of an application programming interface (API) and its organization into Java packages. Also disclosed are concepts leading to definition of such a framework.

    摘要翻译: 公开了许多用于MPEG-4播放,浏览和用户交互的新颖配置。 MPEG-4播放系统不是MPEG-2播放系统的简单扩展,但由于MPEG-4的基于对象的性质,在独立编码对象的同步管理以及场景组合和呈现方面呈现新的机遇和挑战。 因此,这些配置允许显着的新增和增强的多媒体服务和系统。 此外,MPEG-4旨在提供高级功能,称为自适应视听会议(AAVS)或MPEG-J。 自适应音频视频会话(AAVS)(即MPEG-AAVS,MPEG-Java或MPEG-J)除了对配置的定义外,还要求将应用程序编程接口(API)及其组织定义为Java包。 还公开了导致定义这种框架的概念。

    System, method and computer-readable medium for encoding a signal into macroblocks
    15.
    发明授权
    System, method and computer-readable medium for encoding a signal into macroblocks 有权
    用于将信号编码成宏块的系统,方法和计算机可读介质

    公开(公告)号:US08279940B2

    公开(公告)日:2012-10-02

    申请号:US12776106

    申请日:2010-05-07

    IPC分类号: H04N7/18

    摘要: A quantizer and dequantizer for use in a video coding system that applies non linear, piece-wise linear scaling functions to video information signals based on a value of a variable quantization parameter. The quantizer and dequantizer apply different non linear, piece-wise linear scaling functions to a DC luminance signal, a DC chrominance signal and an AC chrominance signal. A code for reporting updates of the value of the quantization parameter is interpreted to require larger changes when the quantization parameter initially is large and smaller changes when the quantization parameter initially is small.

    摘要翻译: 一种在视频编码系统中使用的量化器和去量化器,其基于可变量化参数的值将非线性,分段线性缩放函数应用于视频信息信号。 量化器和去量子化器对DC亮度信号,DC色度信号和AC色度信号应用不同的非线性,分段线性缩放函数。 用于报告量化参数的值的更新的代码被解释为当量化参数最初是大的时候需要较大的改变,而当量化参数最初是小时,这个代码变化较小。

    Method and apparatus to prioritize video information during coding and decoding
    16.
    发明授权
    Method and apparatus to prioritize video information during coding and decoding 有权
    在编码和解码期间对视频信息进行优先排序的方法和装置

    公开(公告)号:US07983499B2

    公开(公告)日:2011-07-19

    申请号:US12624003

    申请日:2009-11-23

    IPC分类号: G06K9/36 H04N7/12

    摘要: A method and apparatus prioritizing video information during coding and decoding. Video information is received and an element of the video information, such as a visual object, video object layer, video object plane or keyregion, is identified. A priority is assigned to the identified element and the video information is encoded into a bitstream, such as a visual bitstream encoded using the MPEG-4 standard, including an indication of the priority of the element. The priority information can then be used when decoding the bitstream to reconstruct the video information

    摘要翻译: 一种在编码和解码期间优先化视频信息的方法和装置。 接收视频信息,并且识别诸如视觉对象,视频对象层,视频对象平面或密钥区域之类的视频信息的元素。 优先级被分配给所识别的元素,并且视频信息被编码为比特流,诸如使用MPEG-4标准编码的可视比特流,包括元素的优先级的指示。 然后可以在对比特流进行解码以重构视频信息时使用优先级信息

    System, method and computer-readable medium for encoding a signal into macroblocks
    17.
    发明授权
    System, method and computer-readable medium for encoding a signal into macroblocks 有权
    用于将信号编码成宏块的系统,方法和计算机可读介质

    公开(公告)号:US07738555B2

    公开(公告)日:2010-06-15

    申请号:US11170622

    申请日:2005-06-29

    IPC分类号: H04N7/18

    摘要: A quantizer and dequantizer for use in a video coding system that applies non linear, piece-wise linear scaling functions to video information signals based on a value of a variable quantization parameter. The quantizer and dequantizer apply different non linear, piece-wise linear scaling functions to a DC luminance signal, a DC chrominance signal and an AC chrominance signal. A code for reporting updates of the value of the quantization parameter is interpreted to require larger changes when the quantization parameter initially is large and smaller changes when the quantization parameter initially is small.

    摘要翻译: 一种在视频编码系统中使用的量化器和去量化器,其基于可变量化参数的值将非线性,分段线性缩放函数应用于视频信息信号。 量化器和去量子化器对DC亮度信号,DC色度信号和AC色度信号应用不同的非线性,分段线性缩放函数。 用于报告量化参数的值的更新的代码被解释为当量化参数最初是大的时候需要较大的改变,而当量化参数最初是小时,这个代码变化较小。

    Bidirectionally predicted pictures or video object planes for efficient and flexible video coding
    18.
    发明授权
    Bidirectionally predicted pictures or video object planes for efficient and flexible video coding 有权
    双向预测的图像或视频对象平面,用于高效和灵活的视频编码

    公开(公告)号:US07545863B1

    公开(公告)日:2009-06-09

    申请号:US12030928

    申请日:2008-02-14

    IPC分类号: H04N7/12 G06K9/36

    摘要: A method is provided for decoding a bit stream representing an image that has been encoded The method includes the steps of: performing an entropy decoding of the bit stream to form a plurality of transform coefficients and a plurality of motion vectors; performing an inverse transformation on the plurality of transform coefficients to form a plurality of error blocks; determining a plurality of predicted blocks based on bidirectional motion estimation that employs the motion vectors, wherein the bidirectional motion estimation includes a direct prediction mode and a second prediction mode; and, adding the plurality of error blocks to the plurality of predicted blocks to form the image. The second prediction mode may include forward, backward, and interpolated prediction modes.

    摘要翻译: 提供一种解码表示已经被编码的图像的比特流的方法。该方法包括以下步骤:执行比特流的熵解码以形成多个变换系数和多个运动矢量; 对所述多个变换系数执行逆变换以形成多个误差块; 基于使用所述运动矢量的双向运动估计来确定多个预测块,其中所述双向运动估计包括直接预测模式和第二预测模式; 并且将多个误差块添加到多个预测块以形成图像。 第二预测模式可以包括前向,后向和内插的预测模式。

    Generalized scalability for video coder based on video objects

    公开(公告)号:US06993201B1

    公开(公告)日:2006-01-31

    申请号:US10761518

    申请日:2004-01-20

    IPC分类号: G06K9/36

    摘要: A video coding system that codes video objects as scalable video object layers. Data of each video object may be segregated into one or more layers. A base layer contains sufficient information to decode a basic representation of the video object. Enhancement layers contain supplementary data regarding the video object that, if decoded, enhance the basic representation obtained from the base layer. The present invention thus provides a coding scheme suitable for use with decoders of varying processing power. A simple decoder may decode only the base layer of video objects to obtain the basic representation. However, more powerful decoders may decode the base layer data of video objects and additional enhancement layer data to obtain improved decoded output. The coding scheme supports enhancement of both the spatial resolution and the temporal resolution of video objects.

    Scene description nodes to support improved chroma-key shape representation of coded arbitrary images and video objects
    20.
    发明授权
    Scene description nodes to support improved chroma-key shape representation of coded arbitrary images and video objects 失效
    场景描述节点支持改进的编码任意图像和视频对象的色键形状表示

    公开(公告)号:US06618444B1

    公开(公告)日:2003-09-09

    申请号:US09469170

    申请日:1999-12-21

    IPC分类号: H04B166

    摘要: A technique implicitly encodes shape information by using a chroma-key color. A frame including an object is received and a box bounding the object is created. The pixels in the bounding box outside the object are identified and replaced with a key color. The object is coded. A first bitstream is output that includes the coded data for the pixels in the bounding box, and a second bitstream is output that includes a node containing the key color and chroma-key thresholds for the object. In one embodiment, the node is a MaterialKey node. In a further embodiment, the node comprises a transparency field, an isKeyed field, an isRGB field, a keycolor field, a lowThreshold field, and a highThreshold field. A decoding system decodes the object and rectangular bounding box containing the object, and assigns a value signifying transparent to each pixel for which the difference between the color of each of said decoded pixels and the key color is below or equal to a low threshold, and assigns a value signifying opaque to each pixel for which said difference is greater than a high threshold.

    摘要翻译: 一种技术通过使用色键颜色隐含地编码形状信息。 接收到包含对象的帧,并创建一个边界对象的框。 识别对象外部边界框中的像素,并用键颜色替换。 对象被编码。 输出包括边界框中的像素的编码数据的第一比特流,并且输出第二比特流,其包括包含对象的键色和色键阈值的节点。 在一个实施例中,节点是MaterialKey节点。 在另一实施例中,节点包括透明度字段,isKeyed字段,isRGB字段,keycolor字段,lowThreshold字段和highThreshold字段。 解码系统对包含对象的对象和矩形边界框进行解码,并且分配对每个像素指示透明度的值,对于每个像素,每个像素的颜色与键颜色之间的差低于或等于低阈值,以及 为所述差大于高阈值的每个像素分配表示不透明度的值。