Bit rate control mechanism for digital image and video data compression
    1.
    发明授权
    Bit rate control mechanism for digital image and video data compression 失效
    用于数字图像和视频数据压缩的比特率控制机制

    公开(公告)号:US5995151A

    公开(公告)日:1999-11-30

    申请号:US932681

    申请日:1997-09-18

    摘要: A bit rate control mechanism for a digital image or video compression system estimates a complexity parameter for a current picture, or block of samples, of a video signal as a function of parameters for a prior picture of the video signal, which parameters include a bit rate. From the complexity parameter a quality factor for the current picture is determined and applied to a quantizer to compress the current picture. A complexity pre-processor may also be used to detect scene changes in the video signal prior to estimating the complexity parameter. If there is a scene change detected, then the rate control mechanism is reset prior to estimating the complexity parameter for the first picture in the new scene. Also a video buffer verifier is controlled so that the buffer occupancy at the end of a specified image sequence is at a target value so that looping and editing applications are facilitated.

    摘要翻译: 用于数字图像或视频压缩系统的比特率控制机构估计作为视频信号的先前图像的参数的视频信号的当前图像或采样块的复杂度参数,哪些参数包括位 率。 根据复杂度参数确定当前图像的质量因子并将其应用于量化器以压缩当前图像。 在估计复杂度参数之前,还可以使用复杂度预处理器来检测视频信号中的场景变化。 如果检测到场景变化,则在估计新场景中的第一画面的复杂度参数之前,速率控制机构被重置。 还控制视频缓冲器验证器,使得指定图像序列末尾的缓冲器占用是目标值,从而便于循环和编辑应用。

    Bit rate control mechanism for digital image and video data compression
    2.
    发明授权
    Bit rate control mechanism for digital image and video data compression 失效
    用于数字图像和视频数据压缩的比特率控制机制

    公开(公告)号:US5686964A

    公开(公告)日:1997-11-11

    申请号:US566100

    申请日:1995-12-04

    摘要: A bit rate control mechanism for a digital image or video compression system estimates a complexity parameter for a current picture, or block of samples, of a video signal as a function of parameters for a prior picture of the video signal, which parameters include a bit rate. From the complexity parameter a quality factor for the current picture is determined and applied to a quantizer to compress the current picture. A complexity pre-processor may also be used to detect scene changes in the video signal prior to estimating the complexity parameter. If there is a scene change detected, then the rate control mechanism is reset prior to estimating the complexity parameter for the first picture in the new scene.

    摘要翻译: 用于数字图像或视频压缩系统的比特率控制机构估计作为视频信号的先前图像的参数的视频信号的当前图像或采样块的复杂度参数,哪些参数包括位 率。 根据复杂度参数确定当前图像的质量因子并将其应用于量化器以压缩当前图像。 在估计复杂度参数之前,还可以使用复杂度预处理器来检测视频信号中的场景变化。 如果检测到场景变化,则在估计新场景中的第一画面的复杂度参数之前,速率控制机构被重置。

    Coarse representation of visual object's shape for search/query/filtering applications
    3.
    发明授权
    Coarse representation of visual object's shape for search/query/filtering applications 有权
    用于搜索/查询/过滤应用程序的视觉对象形状的粗略表示

    公开(公告)号:US07155033B1

    公开(公告)日:2006-12-26

    申请号:US09494514

    申请日:2000-02-01

    IPC分类号: G06K9/00

    CPC分类号: G06T9/20 Y10S707/99937

    摘要: A method of coarse representation of a visual object's shape for search/query/filtering applications uses a binding box that fully encompasses the object of interest within the image to extract a feature vector. Once the feature vector is available, matching based on specific queries may be performed using a search engine to compare the query number to an appropriate element of the feature vector, performing sorting to pick the best matches.

    摘要翻译: 用于搜索/查询/过滤应用的视觉对象形状的粗略表示的方法使用完全包含图像内的感兴趣对象的绑定框来提取特征向量。 一旦特征向量可用,可以使用搜索引擎来执行基于特定查询的匹配,以将查询号码与特征向量的适当元素进行比较,执行排序以选择最佳匹配。

    Method a group of picture structure in MPEG video
    4.
    发明授权
    Method a group of picture structure in MPEG video 失效
    在MPEG视频中采用一组图像结构

    公开(公告)号:US06243495B1

    公开(公告)日:2001-06-05

    申请号:US09023805

    申请日:1998-02-13

    IPC分类号: G06K936

    摘要: A method of modifying a group of pictures (GOP) structure in an MPEG video signal from a low-delay mode bitstream having I and P pictures to a non-low-delay bitstream having I, P and B pictures uses the motion vectors from the low-delay mode bitstream to derive the motion vectors for the non-low-delay mode bitstream. Motion vectors for anchor pictures for the non-low-delay mode bitstream are converted from the motion vectors for the corresponding pictures in the low-delay mode bitstream. Motion vectors for the B pictures in the non-low-delay mode bitstream are converted from the motion vectors for the corresponding P pictures in the low-delay mode bitstream. The converted motion vectors for the non-low-delay mode bitstream are used in recoding an uncompressed video signal derived from the low-delay mode bitstream to produce the non-low-delay mode bitstream.

    摘要翻译: 将具有I和P图像的低延迟模式比特流的MPEG视频信号中的一组图像(GOP)结构修改为具有I,P和B图像的非低延迟比特流的方法使用来自 低延迟模式比特流,以导出非低延迟模式比特流的运动矢量。 用于非低延迟模式比特流的锚图像的运动矢量从低延迟模式比特流中的对应图像的运动矢量转换。 用于非低延迟模式比特流中的B图像的运动矢量从低延迟模式比特流中的相应P图像的运动矢量转换。 用于非低延迟模式比特流的转换的运动矢量用于对从低延迟模式比特流导出的未压缩视频信号进行编码,以产生非低延迟模式比特流。

    Process to extract regions of homogeneous color in a digital picture
    5.
    发明授权
    Process to extract regions of homogeneous color in a digital picture 失效
    在数字图片中提取均匀颜色区域的过程

    公开(公告)号:US07016531B1

    公开(公告)日:2006-03-21

    申请号:US09496068

    申请日:2000-02-01

    IPC分类号: G06K9/34

    摘要: A method of extracting regions of homogeneous color from a digital picture divides the digital picture into blocks and generates a feature vector for each block as a set of moments of the data for the block. The distance between the feature vector of each block and the feature vectors of the nearest neighboring blocks are determined using either a weighted Euclidean distance metric or a probability mass function-based distance metric. The maximum distance is the gradient value for the block, and the set of gradient values over all the blocks form a color gradient field. The gradient field is digitized and smoothed, and then segmented into regions of similar color characteristics using a watershed algorithm.

    摘要翻译: 从数字图像提取均匀颜色的区域的方法将数字图像划分成块,并且为块的数据的一组时刻产生每个块的特征向量。 使用加权欧几里德距离度量或基于概率质量函数的距离度量来确定每个块的特征向量与最近相邻块的特征向量之间的距离。 最大距离是块的梯度值,并且所有块上的梯度值集合都形成一个颜色渐变字段。 梯度场被数字化和平滑化,然后使用分水岭算法将其分割成具有相似颜色特征的区域。

    Histogram-based segmentation of objects from a video signal via color moments
    6.
    发明授权
    Histogram-based segmentation of objects from a video signal via color moments 有权
    基于直方图的分割从视频信号的对象通过颜色时刻

    公开(公告)号:US06526169B1

    公开(公告)日:2003-02-25

    申请号:US09270233

    申请日:1999-03-15

    IPC分类号: G06K900

    摘要: A histogram-based segmentation of an image, frame or picture of a video signal into objects via color moments is initiated by defining a relatively large area within the object. The defined area is characterized by its color information in the form of a limited set of color moments representing a color histogram for the area. Based upon the set of color moments, color moments generated for small candidate blocks within the image, an automatically generated weighting vector, distance measures for the blocks from a central block in the object and a tolerance the area is grown to encompass the object to the extent of its boundaries. The initial set of color moments are then updated for the entire object. Those candidate blocks within the object serve to segment the object from the image.

    摘要翻译: 通过在对象内定义一个相对较大的区域,启动通过颜色时刻将视频信号的图像,帧或图像分割成对象的基于直方图的分割。 所定义的区域的特征在于其颜色信息的形式是表示该区域的颜色直方图的有限的一组颜色矩。 基于颜色矩的集合,为图像内的小候选块生成的颜色矩,自动生成的权重向量,来自对象中的中心块的块的距离度量以及该区域生长的容差以包含对象到 其边界范围。 然后针对整个对象更新初始颜色时刻集。 对象内的候选块用于从图像中分割对象。

    2-D extended image generation from 3-D data extracted from a video sequence
    7.
    发明授权
    2-D extended image generation from 3-D data extracted from a video sequence 失效
    从视频序列中提取的3-D数据的2-D扩展图像生成

    公开(公告)号:US06504569B1

    公开(公告)日:2003-01-07

    申请号:US09064889

    申请日:1998-04-22

    IPC分类号: H04N1300

    摘要: A method of generating a 2-D extended image from a video sequence representing a natural 3-D scene first determines motion parameters for a camera that recorded the scene with respect to a bakcground object from the video sequence using a structure-from-motion algorithm. The motion parameters include a rotation matrix, a translation vector and a depth map representing the depth of each point in the background object from the camera. Next from the motion parameters and depth map the 2-D extended image is generated for the background object as a composition of the images from the video sequence using a plane perspective projection technique. The background object may be layered as a function of depth and flatness criteria to form a set of layered 2-D extended images for the background object from the video sequence.

    摘要翻译: 从表示自然3-D场景的视频序列中生成2-D扩展图像的方法首先使用运动结构算法来确定相对于来自视频序列的面包对象的记录场景的摄像机的运动参数 。 运动参数包括旋转矩阵,平移矢量和表示来自相机的背景物体中每个点的深度的深度图。 接下来,根据运动参数和深度图,使用平面透视投影技术从背景对象生成2-D扩展图像作为来自视频序列的图像的组合。 作为深度和平坦性标准的函数,背景对象可以被分层,以从视频序列形成背景对象的一组分层的2-D扩展图像。

    Histogram-based segmentation of images and video via color moments
    8.
    发明授权
    Histogram-based segmentation of images and video via color moments 有权
    基于直方图的图像分割和视频通过颜色时刻

    公开(公告)号:US06381363B1

    公开(公告)日:2002-04-30

    申请号:US09326164

    申请日:1999-06-04

    IPC分类号: G06K900

    摘要: A histogram-based segmentation of images in a video signal via color moments is initialized by a user defining regions in objects of interest from one or more images, key frames or pictures of the video signal. For each rectangle a normalized average color moment and associated co-variance matrix are determined which define a color class for that rectangle. From the normalized average color moment and associated co-variance garbage parameters are generated. Segmentation is then performed on a block basis on each image of the video sequence, a normalized color moment being generated for each block. Using a log likelihood test the closest color class for the block is determined. Based upon the closest color class and the garbage parameters for that color class a final determination is made in a two stage test as to whether the block belongs to the closest class or to a “garbage” class. All the continguous blocks that belong to a specific color class form the segmented object, and all of the objects are segmented in this manner.

    摘要翻译: 通过颜色时刻的视频信号中的图像的基于直方图的分割由用户定义来自视频信号的一个或多个图像,关键帧或图片的关注对象的区域来初始化。 对于每个矩形,确定归一化的平均色矩和相关协方差矩阵,其定义该矩形的颜色类。 从标准化的平均颜色时刻和相关的协方差垃圾参数生成。 然后以视频序列的每个图像为基础执行分段,为每个块生成归一化的颜色矩。 使用对数似然检验确定块的最接近的颜色等级。 基于最近的颜色等级和该颜色类的垃圾参数,在两级测试中进行关于块是属于最接近的类还是“垃圾”类的最终确定。 属于特定颜色类的所有相关块形成分段对象,并且所有对象都以此方式分段。

    Kalman tracking of color objects
    9.
    发明授权
    Kalman tracking of color objects 失效
    卡尔曼跟踪颜色对象

    公开(公告)号:US06917692B1

    公开(公告)日:2005-07-12

    申请号:US09318682

    申请日:1999-05-25

    IPC分类号: G06K9/00 G06T7/20

    CPC分类号: G06K9/32 G06T7/277

    摘要: A semi-automatic method of tracking color objects in a video image sequence starts by separating the objects on the basis of color and identifying an object of interest to track. A Kalman predictive algotithm in used to predict the position of the centroid of the object of interest through successive frames. From the predicted position the actual centroid is measured and the position and velocity are smoothed using a Kalman filter. Error recovery is provided in the event the centroid falls outside the field of view or falls into an area of a different color, or in the event the tracking algorithm breaks down.

    摘要翻译: 在视频图像序列中跟踪颜色对象的半自动方法通过基于颜色分离对象并识别要跟踪的感兴趣对象来开始。 卡尔曼预测算法用于通过连续帧预测感兴趣对象的质心的位置。 从预测位置,使用卡尔曼滤波器测量实际质心并使位置和速度平滑。 如果重心落在视野之外或落入不同颜色的区域,或者跟踪算法发生故障的情况下,则会提供错误恢复。