-
公开(公告)号:US20070160128A1
公开(公告)日:2007-07-12
申请号:US11538421
申请日:2006-10-03
IPC分类号: H04B1/66
CPC分类号: H04N5/147 , H04N19/107 , H04N19/114 , H04N19/14 , H04N19/142 , H04N19/159 , H04N19/177 , H04N19/179 , H04N19/51 , H04N19/577 , H04N19/61 , H04N19/87
摘要: The invention comprises devices and methods for processing multimedia data. Such methods can include obtaining at least one metric indicative of a difference for a selected frame and adjacent frames in a plurality of video frames, the metric comprising bi-directional motion information and luminance difference information, determining a shot event in the selected frame based on the at least one metric, and adaptively encoding the selected frame based on the shot event. An apparatus for processing a multimedia data can include a motion compensator configured to obtain metrics indicative of a difference between adjacent frames of a plurality of video frames, said metrics comprising bi-directional motion information and luminance information, a shot classifier configured to determine a shot event in the plurality of video frames based on said metrics, and an encoder configured to adaptively encode the plurality of frames based on the shot event.
摘要翻译: 本发明包括用于处理多媒体数据的设备和方法。 这样的方法可以包括获得指示多个视频帧中的所选帧和相邻帧的差异的至少一个度量,该度量包括双向运动信息和亮度差异信息,基于所选择的帧中的拍摄事件确定 所述至少一个度量,并且基于所述拍摄事件自适应地编码所选择的帧。 用于处理多媒体数据的装置可以包括:运动补偿器,被配置为获得指示多个视频帧的相邻帧之间的差异的度量,所述度量包括双向运动信息和亮度信息;镜头分类器,被配置为确定镜头 事件,以及编码器,被配置为基于所述拍摄事件自适应地编码所述多个帧。
-
公开(公告)号:US20070074117A1
公开(公告)日:2007-03-29
申请号:US11501969
申请日:2006-08-09
申请人: Tao Tian , Fang Shi , Vijayalakshmi Raveendran
发明人: Tao Tian , Fang Shi , Vijayalakshmi Raveendran
IPC分类号: G11B27/00
CPC分类号: H04N5/262 , H04N19/103 , H04N19/142 , H04N19/179 , H04N19/46 , H04N19/61
摘要: This disclosure is directed to techniques for encoding and decoding transitional effects, i.e., visual video effects that are used to transition from a current scene of a multimedia sequence. According to the disclosed techniques, an encoding device detects a transitional effect associated with a multimedia sequence during the encoding of the multimedia sequence, and transmits information as part of an encoded multimedia sequence to identify the transitional effect associated with the encoded multimedia sequence to a decoder. The information may comprise metadata that can be used by the decoder to simulate or re-create the transitional effect. The decoder simulates a transitional effect in response to the information.
摘要翻译: 本公开涉及用于编码和解码过渡效果的技术,即用于从多媒体序列的当前场景转换的视觉效果。 根据所公开的技术,编码装置在多媒体序列的编码期间检测与多媒体序列相关联的过渡效应,并且作为编码的多媒体序列的一部分发送信息,以将与编码的多媒体序列相关联的过渡效果识别到解码器 。 信息可以包括可由解码器使用以模拟或重新创建过渡效果的元数据。 解码器模拟响应信息的过渡效应。
-
公开(公告)号:US20060227870A1
公开(公告)日:2006-10-12
申请号:US11373778
申请日:2006-03-09
申请人: Tao Tian , Vijayalakshmi Raveendran
发明人: Tao Tian , Vijayalakshmi Raveendran
CPC分类号: G06T9/00 , G06T9/005 , H04N19/00 , H04N19/115 , H04N19/124 , H04N19/126 , H04N19/137 , H04N19/146 , H04N19/152 , H04N19/172 , H04N19/176 , H04N19/192 , H04N19/20 , H04N19/60 , H04N19/61 , H04N21/6373 , H04N21/6377 , H04N21/658
摘要: Methods and apparatus encode video at a targeted bit rate and yet permit variation of a Quantization Parameter (QP) to encode video of varying complexity with relatively consistent visual quality. Constant bit rate (CBR) encoding is desirable in many applications, such as in transmission or broadcasting environments. However, conventional CBR techniques compromise visual quality. Disclosed techniques permit adaptive variation in a QP value and provide the improved visual encoding available in variable bit rate (VBR) schemes while maintaining enough adherence to a targeted bit rate to be applicable to CBR environments.
摘要翻译: 方法和装置以目标比特率编码视频,并且允许量化参数(QP)的变化以相对一致的视觉质量来编码具有不同复杂度的视频。 在许多应用中,例如在传输或广播环境中,恒定比特率(CBR)编码是期望的。 然而,传统的CBR技术会影响视觉质量。 公开的技术允许QP值中的自适应变化,并提供可变比特率(VBR)方案中可用的改进的可视编码,同时保持足够的对目标比特率的依从性以适用于CBR环境。
-
公开(公告)号:US20070071398A1
公开(公告)日:2007-03-29
申请号:US11527305
申请日:2006-09-25
IPC分类号: H04N5/91
CPC分类号: H04N7/17336 , H04N19/107 , H04N19/114 , H04N19/132 , H04N19/147 , H04N19/17 , H04N19/174 , H04N19/176 , H04N19/194 , H04N19/577 , H04N19/61 , H04N21/23424 , H04N21/2343 , H04N21/4383 , H04N21/44016
摘要: A method of processing a sequence of frames of multimedia data is presented. The method provides for progressively refreshing the image data. The method includes dynamically selecting portions of frames of the sequence with progressively increasing area to refresh, and excluding non-refreshed areas as potential reference data for other frames.
摘要翻译: 提出了一种处理多媒体数据帧序列的方法。 该方法提供逐渐刷新图像数据。 该方法包括动态地选择具有逐渐增加的区域来刷新的序列的帧的部分,并且将非刷新区域排除为其他帧的潜在参考数据。
-
公开(公告)号:US20070171972A1
公开(公告)日:2007-07-26
申请号:US11538023
申请日:2006-10-02
申请人: Tao Tian , Vijayalakshmi Raveendran
发明人: Tao Tian , Vijayalakshmi Raveendran
CPC分类号: H04N19/00139 , H04N7/0112 , H04N7/012 , H04N19/103 , H04N19/107 , H04N19/109 , H04N19/114 , H04N19/132 , H04N19/136 , H04N19/137 , H04N19/142 , H04N19/172 , H04N19/176 , H04N19/61 , H04N19/87
摘要: This system adaptively assigns picture types used for temporal compression to frames of streaming video at the input. Based on threshold testing of two metrics that are measures of distance between the frames at the input, a frame may be assigned to be compressed as an I, P, or B frame or be skipped over by the system without being coded at all.
摘要翻译: 该系统在输入端自适应地将用于时间压缩的图像类型分配给流视频帧。 基于作为输入之间的帧之间的距离的度量的两个度量的阈值测试,可以将帧分配为被压缩为I,P或B帧,或者被系统跳过而不被完全编码。
-
公开(公告)号:US20070071105A1
公开(公告)日:2007-03-29
申请号:US11509214
申请日:2006-08-23
申请人: Tao Tian , Vijayalakshmi Raveendran
发明人: Tao Tian , Vijayalakshmi Raveendran
CPC分类号: H04N19/192 , H04N19/109 , H04N19/147 , H04N19/176 , H04N19/196 , H04N19/61
摘要: This disclosure describes techniques for improving mode selection decisions during the encoding of macroblocks (or other blocks) of multimedia frames of a multimedia sequence. During motion estimation, the encoding modes for macroblocks can be determined so that a desirable encoding rate and acceptable levels of distortion (i.e., acceptable rate-distortion) can be achieved. The techniques may include selecting a set of multimedia coding modes between at least two sets of possible multimedia coding modes for a macroblock of a multimedia frame based on a detail metric associated with the macroblock and mode information associated with neighboring blocks to the macroblock.
摘要翻译: 本公开描述了在多媒体序列的多媒体帧的宏块(或其他块)的编码期间改进模式选择决定的技术。 在运动估计期间,可以确定宏块的编码模式,使得可以实现期望的编码速率和可接受的失真水平(即,可接受的速率失真)。 这些技术可以包括基于与宏块相关联的细节度量和与宏块的相邻块相关联的模式信息,在多媒体帧的宏块的至少两组可能的多媒体编码模式之间选择一组多媒体编码模式。
-
17.
公开(公告)号:US08879635B2
公开(公告)日:2014-11-04
申请号:US11528139
申请日:2006-09-26
申请人: Vijayalakshmi Rajasundaram Raveendran , Gordon Kent Walker , Tao Tian , Phanikumar Kanakadurga Bhamidipati , Fang Shi , Peisong Chen , Sitaraman Ganapathy Subramania , Seyfullah Halit Oguz
发明人: Vijayalakshmi Rajasundaram Raveendran , Gordon Kent Walker , Tao Tian , Phanikumar Kanakadurga Bhamidipati , Fang Shi , Peisong Chen , Sitaraman Ganapathy Subramania , Seyfullah Halit Oguz
IPC分类号: H04N7/12 , H04N11/02 , H04N11/04 , H04N19/61 , H04N19/142 , H04N19/14 , H04N19/137 , H04N21/2389 , H04N19/194 , H04N19/115 , H04N5/21 , H04N19/18 , H04N19/65 , H04N21/235 , H04N19/577 , H04N19/87 , H04N19/89 , H04N19/114 , H04N19/86 , H04N19/40 , H04N19/149 , H04N19/187 , H04N19/154 , H04N19/107 , H04N5/14 , H04N21/2343 , H04N19/132 , H04N19/159 , H04N19/30 , H04N19/36 , H04N19/172 , H04N19/147 , H04N7/01
CPC分类号: H04N19/40 , H04N5/144 , H04N5/147 , H04N5/21 , H04N7/0115 , H04N7/012 , H04N19/107 , H04N19/114 , H04N19/115 , H04N19/132 , H04N19/137 , H04N19/14 , H04N19/142 , H04N19/147 , H04N19/149 , H04N19/154 , H04N19/159 , H04N19/172 , H04N19/18 , H04N19/187 , H04N19/194 , H04N19/30 , H04N19/36 , H04N19/577 , H04N19/61 , H04N19/65 , H04N19/86 , H04N19/87 , H04N19/89 , H04N21/234309 , H04N21/2353 , H04N21/2389
摘要: Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes obtaining content information of multimedia data, and encoding the multimedia data so as to align a data boundary with a frame boundary in a time domain, wherein said encoding is based on the content information. In another aspect, a method of processing multimedia data includes obtaining a content classification of the multimedia data, and encoding blocks in the multimedia data as intra-coded blocks or inter-coded blocks based on the content classification to increase the error resilience of the encoded multimedia data. Apparatus that can process multimedia data described in these methods are also disclosed.
摘要翻译: 描述了使用内容信息来编码多媒体数据的装置和方法。 一种处理多媒体数据的方法包括:获取多媒体数据的内容信息,并对多媒体数据进行编码,以使数据边界与时域中的边界对齐,其中所述编码基于内容信息。 在另一方面,一种处理多媒体数据的方法包括获取多媒体数据的内容分类,以及基于内容分类将多媒体数据中的块编码为帧内编码块或帧间编码块,以增加已编码的多媒体数据的错误弹性 多媒体资料 还公开了可以处理这些方法中描述的多媒体数据的装置。
-
公开(公告)号:US20070230564A1
公开(公告)日:2007-10-04
申请号:US11562360
申请日:2006-11-21
申请人: Peisong Chen , Tao Tian , Fang Shi , Vijayalakshmi R. Raveendran
发明人: Peisong Chen , Tao Tian , Fang Shi , Vijayalakshmi R. Raveendran
IPC分类号: H04N11/02
CPC分类号: H04N21/434 , H04N19/29 , H04N19/31 , H04N19/70 , H04N21/234327 , H04N21/2662
摘要: In general, this disclosure describes video processing techniques that make use of syntax elements and semantics to support low complexity extensions for multimedia processing with video scalability. The syntax elements and semantics may be added to network abstraction layer (NAL) units and may be especially applicable to multimedia broadcasting, and define a bitstream format and encoding process that support low complexity video scalability. In some aspects, the techniques may be applied to implement low complexity video scalability extensions for devices that otherwise conform to the H.264 standard. For example, the syntax element and semantics may be applicable to NAL units conforming to the H.264 standard.
摘要翻译: 一般来说,本公开描述了利用语法元素和语义来支持具有视频可扩展性的多媒体处理的低复杂度扩展的视频处理技术。 语法元素和语义可以被添加到网络抽象层(NAL)单元,并且可以特别适用于多媒体广播,并且定义支持低复杂度视频可扩展性的比特流格式和编码过程。 在一些方面,可以将这些技术应用于为符合H.264标准的设备实现低复杂度视频可扩展性扩展。 例如,语法元素和语义可以适用于符合H.264标准的NAL单元。
-
公开(公告)号:US09088776B2
公开(公告)日:2015-07-21
申请号:US12541780
申请日:2009-08-14
申请人: Vijayalakshmi R. Raveendran , Gordon Kent Walker , Tao Tian , Phanikumar Bhamidipati , Fang Shi , Peisong Chen , Sitaraman Ganapathy Subramanian , Seyfullah Halit Oguz
发明人: Vijayalakshmi R. Raveendran , Gordon Kent Walker , Tao Tian , Phanikumar Bhamidipati , Fang Shi , Peisong Chen , Sitaraman Ganapathy Subramanian , Seyfullah Halit Oguz
IPC分类号: H04J3/00 , H04N5/14 , H04N5/21 , H04N19/40 , H04N19/107 , H04N19/114 , H04N19/115 , H04N19/132 , H04N19/137 , H04N19/14 , H04N19/142 , H04N19/147 , H04N19/149 , H04N19/154 , H04N19/159 , H04N19/172 , H04N19/18 , H04N19/187 , H04N19/194 , H04N19/30 , H04N19/36 , H04N19/577 , H04N19/61 , H04N19/65 , H04N19/86 , H04N19/87 , H04N19/89 , H04N21/2343 , H04N21/235 , H04N21/2389 , H04N7/01
CPC分类号: H04N19/40 , H04N5/144 , H04N5/147 , H04N5/21 , H04N7/0115 , H04N7/012 , H04N19/107 , H04N19/114 , H04N19/115 , H04N19/132 , H04N19/137 , H04N19/14 , H04N19/142 , H04N19/147 , H04N19/149 , H04N19/154 , H04N19/159 , H04N19/172 , H04N19/18 , H04N19/187 , H04N19/194 , H04N19/30 , H04N19/36 , H04N19/577 , H04N19/61 , H04N19/65 , H04N19/86 , H04N19/87 , H04N19/89 , H04N21/234309 , H04N21/2353 , H04N21/2389
摘要: Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes classifying content of multimedia data, and encoding the multimedia data in a first data group and in a second data group based on the content classification. The first and second groups are associated with quality levels. A user can request a target quality level.
摘要翻译: 描述了使用内容信息来编码多媒体数据的装置和方法。 一种处理多媒体数据的方法包括:根据内容分类,对多媒体数据的内容进行分类,并将第一数据组和第二数据组中的多媒体数据进行编码。 第一组和第二组与质量水平相关。 用户可以请求目标质量等级。
-
公开(公告)号:US07974341B2
公开(公告)日:2011-07-05
申请号:US11416858
申请日:2006-05-02
IPC分类号: H04N7/18
CPC分类号: H04N19/31 , H04N19/115 , H04N19/124 , H04N19/15 , H04N19/177 , H04N19/194
摘要: Methods and apparatus for efficient encoding multimedia data, such as live video streams are disclosed. The multimedia data is pre-encoded into multiple layers and characteristics of the pre-encoded data are determined. Based at least in part on the determined characteristics, the multimedia data is encoded into multiple layers.
摘要翻译: 公开了用于高效编码多媒体数据(例如直播视频流)的方法和装置。 多媒体数据被预编码成多层,并且确定预编码数据的特性。 至少部分地基于所确定的特征,将多媒体数据编码成多层。
-
-
-
-
-
-
-
-
-