Method and apparatus for motion compensated temporal interpolation of video sequences
    1.
    发明申请
    Method and apparatus for motion compensated temporal interpolation of video sequences 审中-公开
    视频序列的运动补偿时间插值的方法和装置

    公开(公告)号:US20050226330A1

    公开(公告)日:2005-10-13

    申请号:US10498953

    申请日:2002-12-16

    摘要: Method for encoding a digital video stream, comprising the steps of encoding a video sequence into a full frame sequence, forming a decimated frame sequence by removing a predetermined number of frames from the full frame sequence by means of temporal decimation, locally decoding the full frame sequence, locally decoding the decimated frame sequence, temporally interpolating the decoded decimated frame sequence by means of an interpolator, comparing the locally decoded frames of the full frame sequence with the corresponding frames of the locally interpolated frame sequence, determining residual information for a frame based on at least the comparison for that frame, and providing an output stream comprising the decimated frame sequence and the determined residual information.

    摘要翻译: 用于对数字视频流进行编码的方法,包括以下步骤:将视频序列编码为全帧序列,通过借助于时间抽取从全帧序列中去除预定数量的帧来形成抽取帧序列,本地解码全帧 序列,本地解码抽取的帧序列,通过内插器对经解码的抽取帧序列进行时间内插,将全帧序列的局部解码帧与局部内插帧序列的对应帧进行比较,确定基于帧的残差信息 至少对该帧进行比较,以及提供包括抽取帧序列和所确定的剩余信息的输出流。

    System and method for encoding and decoding enhancement layer data using descriptive model parameters
    3.
    发明申请
    System and method for encoding and decoding enhancement layer data using descriptive model parameters 失效
    使用描述性模型参数对增强层数据进行编码和解码的系统和方法

    公开(公告)号:US20060262846A1

    公开(公告)日:2006-11-23

    申请号:US10569126

    申请日:2004-08-25

    IPC分类号: H04N11/04

    摘要: There is provided an image encoding system (300, 400) including an encoder (300) for receiving input image data and generating corresponding encoded image output data. The encoder includes image processing features (310, 320, 330, 360) for processing said input image data to generate for each input image therein a plurality of corresponding image layers including at least one basic layer BLOP and at least one enhancement layer ELOP. Moreover, the encoder (300) further includes encoding features (350) for receiving said image layers and generating therefrom the encoded image output data. The encoding features further comprising block selecting features (340) for selecting one or more sub-regions of said at least one enhancement layer and modelling said one or more sub-regions for representation thereof in the image output data by way of descriptive model parameters.

    摘要翻译: 提供了一种包括用于接收输入图像数据并产生对应的编码图像输出数据的编码器(300)的图像编码系统(300,400)。 编码器包括用于处理所述输入图像数据以对其中的每个输入图像生成包括至少一个基本层BLOP和至少一个增强层ELOP的多个对应图像层的图像处理特征(310,320,330,360)。 此外,编码器(300)还包括用于接收所述图像层并由其生成编码图像输出数据的编码特征(350)。 所述编码特征进一步包括用于选择所述至少一个增强层的一个或多个子区域的块选择特征(340),并且通过描述性模型参数对所述一个或多个子区域进行建模以将其表示在所述图像输出数据中。

    Video encoding
    4.
    发明申请
    Video encoding 审中-公开
    视频编码

    公开(公告)号:US20060165163A1

    公开(公告)日:2006-07-27

    申请号:US10547324

    申请日:2004-02-25

    摘要: The invention relates to a video encoder (201) for encoding a video signal. The video encoder comprises a segmentation processor (207) which divides the picture into picture regions. Preferably, picture regions having a high degree of flatness or uniformity are determined in this way. A characteristics processor (209) determine a spatial frequency characteristic for each picture region, and a coding controller (211) selects an encoding block size, such as a prediction block size for motion estimation, in response to the spatial frequency characteristic. An encode processor (213) encodes the picture using the selected encoding block size. Specifically, increasing block sizes are selected for increasing degrees of uniformity or flatness indicated by the spatial frequency characteristic. Thereby, an increasing proportion of high frequency components and a consistent choice of encoding block sizes are maintained, and thus the coding artefacts from many encoders having variable prediction block sizes is reduced. The invention is particularly suitable for H.264 and similar encoders.

    摘要翻译: 本发明涉及用于对视频信号进行编码的视频编码器(201)。 视频编码器包括将图像划分成图像区域的分割处理器(207)。 优选地,以这种方式确定具有高度平坦度或均匀性的图像区域。 特征处理器(209)确定每个图像区域的空间频率特性,并且编码控制器(211)响应于空间频率特性来选择诸如用于运动估计的预测块大小的编码块大小。 编码处理器(213)使用所选择的编码块大小来对图像进行编码。 具体地,增加块尺寸被选择以增加由空间频率特性指示的均匀度或平坦度。 因此,保持了高频分量的不断增加的比例和编码块大小的一致选择,因此减少了具有可变预测块大小的许多编码器的编码伪影。 本发明特别适用于H.264和类似的编码器。

    Video coding
    5.
    发明申请
    Video coding 审中-公开
    视频编码

    公开(公告)号:US20060104357A1

    公开(公告)日:2006-05-18

    申请号:US10542836

    申请日:2004-01-19

    摘要: Coding of a video signal is provided according to a predefined standard, wherein in a given operation mode some of the tools provided by the predefined standard are disabled, and wherein an identification of the disabled tools is included in the bit-stream, the disabled tools being one or more out of the group of: bidirectional predictive coding of pictures or picture parts, use of a de-blocking filter, use of more than one reference picture.

    摘要翻译: 根据预定义的标准提供视频信号的编码,其中在给定的操作模式中,由预定标准提供的一些工具被禁用,并且其中禁用的工具的标识被包括在位流中,禁用的工具 是一组或多个:图像或图像部分的双向预测编码,使用去块滤波器,使用多个参考图像。

    CONTENT AUGMENTATION FOR PERSONAL RECORDINGS
    6.
    发明申请
    CONTENT AUGMENTATION FOR PERSONAL RECORDINGS 审中-公开
    个人记录的内容保证

    公开(公告)号:US20100185617A1

    公开(公告)日:2010-07-22

    申请号:US12376586

    申请日:2007-08-09

    IPC分类号: G06F17/30

    摘要: A content augmentation process for personal recordings involves a service center (SC). The service center (SC) collects personal recordings from various different users via a network so as to constitute a database (DB) of personal recordings. The service center (SC) identifies personal recordings within the database (DB) that concern a particular scene and that are mutually complementary so as to form a selection of personal recordings (FSRR) for content augmentation purposes. The service center (SC) applies a content augmentation process (AUGP) to the selection of personal recordings (FSRR) so as to obtain an enhanced representation (CA).

    摘要翻译: 用于个人录音的内容增加过程涉及服务中心(SC)。 服务中心(SC)通过网络从各种不同的用户收集个人记录,以构成个人记录的数据库(DB)。 服务中心(SC)识别数据库(DB)内的个人记录,其涉及特定场景并且是相互补充的,以便形成用于内容增加目的的个人记录(FSRR)的选择。 服务中心(SC)将内容增加处理(AUGP)应用于个人记录(FSRR)的选择,以获得增强表示(CA)。

    Time-scale modification of signals
    7.
    发明授权
    Time-scale modification of signals 失效
    时间尺度修改信号

    公开(公告)号:US07412379B2

    公开(公告)日:2008-08-12

    申请号:US10114505

    申请日:2002-04-02

    IPC分类号: G10L11/06

    CPC分类号: G10L21/04 G10L25/93

    摘要: Techniques utilising Time Scale Modification (TSM) of signals are described. The signal is analysed and divided into frames of similar signal types. Techniques specific to the signal type are then applied to the frames thereby optimising the modification process. The method of the present invention enables TSM of different audio signal parts to be realized using different methods, and a system for effecting said method is also described.

    摘要翻译: 描述了使用信号的时间尺度修正(TSM)的技术。 信号被分析并分成类似信号类型的帧。 然后将特定于信号类型的技术应用于帧,从而优化修改过程。 本发明的方法能够使用不同的方法实现不同的音频信号部分的TSM,并且还描述了用于实现所述方法的系统。

    System and method for encoding and decoding enhancement layer data using descriptive model parameters
    9.
    发明授权
    System and method for encoding and decoding enhancement layer data using descriptive model parameters 失效
    使用描述性模型参数对增强层数据进行编码和解码的系统和方法

    公开(公告)号:US07953156B2

    公开(公告)日:2011-05-31

    申请号:US10569126

    申请日:2004-08-25

    摘要: There is provided an image encoding system (300, 400) including an encoder (300) for receiving input image data and generating corresponding encoded image output data. The encoder includes image processing features (310, 320, 330, 360) for processing said input image data to generate for each input image therein a plurality of corresponding image layers including at least one basic layer BLOP and at least one enhancement layer ELOP. Moreover, the encoder (300) further includes encoding features (350) for receiving said image layers and generating therefrom the encoded image output data. The encoding features further comprising block selecting features (340) for selecting one or more sub-regions of said at least one enhancement layer and modelling said one or more sub-regions for representation thereof in the image output data by way of descriptive model parameters.

    摘要翻译: 提供了一种包括用于接收输入图像数据并产生对应的编码图像输出数据的编码器(300)的图像编码系统(300,400)。 编码器包括用于处理所述输入图像数据以对其中的每个输入图像生成包括至少一个基本层BLOP和至少一个增强层ELOP的多个对应图像层的图像处理特征(310,320,330,360)。 此外,编码器(300)还包括用于接收所述图像层并由其生成编码图像输出数据的编码特征(350)。 所述编码特征进一步包括用于选择所述至少一个增强层的一个或多个子区域的块选择特征(340),并且通过描述性模型参数对所述一个或多个子区域进行建模以将其表示在所述图像输出数据中。

    METHOD AND SYSTEM TO CONVERT 2D VIDEO INTO 3D VIDEO
    10.
    发明申请
    METHOD AND SYSTEM TO CONVERT 2D VIDEO INTO 3D VIDEO 失效
    将2D视频转换为3D视频的方法和系统

    公开(公告)号:US20100026784A1

    公开(公告)日:2010-02-04

    申请号:US12519378

    申请日:2007-12-14

    IPC分类号: H04N13/02

    CPC分类号: H04N13/261 H04N13/10

    摘要: 2D/3D video conversion using a method for providing an estimation of visual depth for a video sequence, the method comprises an audio scene classification (34) in which a visual depth categorization index of visual depth (37) of a scene is made on basis of an analysis of audio information (32) for the scene, wherein the visual depth categorization index (37) is used in a 5 following visual depth estimation (38) based on video information (33) for the same scene, thereby reducing the calculation load and speeding up the processing.

    摘要翻译: 使用用于提供视频序列的视觉深度估计的方法的2D / 3D视频转换,该方法包括音频场景分类(34),其中基于场景的视觉深度(37)的视觉深度分类指数 对所述场景的音频信息(32)进行分析,其中,基于相同场景的视频信息(33),在后续的视觉深度估计(38)中使用所述视觉深度分类指数(37),从而减少所述计算 加载和加速处理。