Face recognition in video content
    11.
    发明授权
    Face recognition in video content 有权
    视频内容中的人脸识别

    公开(公告)号:US08494231B2

    公开(公告)日:2013-07-23

    申请号:US12916895

    申请日:2010-11-01

    IPC分类号: G06K9/00

    摘要: The subject disclosure relates to face recognition in video. Face detection data in frames of input data are used to generate face galleries, which are labeled and used in recognizing faces throughout the video. Metadata that associates the video frame and the face are generated and maintained for subsequent identification. Faces other than those found by face detection may be found by face tracking, in which facial landmarks found by the face detection are used to track a face over previous and/or subsequent video frames. Once generated, the maintained metadata may be accessed to efficiently determine the identity of a person corresponding to a viewer-selected face.

    摘要翻译: 本公开涉及视频中的面部识别。 输入数据帧中的脸部检测数据用于生成面部画廊,这些画廊被标记并用于识别整个视频中的脸部。 生成并维护与视频帧和脸部相关联的元数据,以便后续识别。 面部检测以外的脸部可以通过脸部跟踪来发现,其中通过面部检测发现的面部地标用于跟踪先前和/或后续视频帧的面部。 一旦生成,可以访问维护的元数据以有效地确定与观众选择的面对应的人的身份。

    Filtering and dithering as pre-processing before encoding
    12.
    发明申请
    Filtering and dithering as pre-processing before encoding 有权
    过滤和抖动作为编码前的预处理

    公开(公告)号:US20090180555A1

    公开(公告)日:2009-07-16

    申请号:US12008660

    申请日:2008-01-10

    IPC分类号: H04N7/12

    摘要: Techniques and tools are presented for controlling artifacts such as banding artifacts, ringing artifacts and film scan artifacts in video. For example, before encoding, a pre-processor performs combined filtering and dithering on video such that the weight of dithering at a location depends on the results of filtering at the location. For the combined filtering and dithering, the pre-processor can determine a lowpass signal and highpass residual, weight dithering based on local characteristics of the highpass residual, and then combine the lowpass signal with the weighted dithering. Or, to determine the relative weight, the pre-processor can use a filter whose normalization factor varies depending on how many sample values around a location are within a threshold of similarity to a current sample value at the location. The filtering and dithering can use different strengths for luma and chroma channels.

    摘要翻译: 提出了技术和工具,用于控制视频中的伪影,例如条纹伪像,振铃伪影和胶片扫描伪像。 例如,在编码之前,预处理器对视频执行组合滤波和抖动,使得在某个位置的抖动的权重取决于在该位置处的滤波结果。 对于组合的滤波和抖动,预处理器可以基于高通余数的局部特性来确定低通信号和高通余数,重量抖动,然后将低通信号与加权抖动组合。 或者,为了确定相对权重,预处理器可以使用其归一化因子根据位置周围多少个采样值在与该位置处的当前采样值相似的阈值内变化的滤波器。 滤光和抖动可以对亮度和色度通道使用不同的强度。

    Detecting progressive video
    13.
    发明申请
    Detecting progressive video 有权
    检测逐行视频

    公开(公告)号:US20070002169A1

    公开(公告)日:2007-01-04

    申请号:US11171594

    申请日:2005-06-29

    IPC分类号: H04N7/01

    摘要: Whether interlaced video fields form a progressive video frame can be automatically determined. The presence or absence of a first characteristic of one or more video fields can be determined by analysis of the fields and/or related information such as flags, cadence, previous determinations, and others. Similarly, the presence or absence of a second characteristic can be detected. In accordance with the detecting, how likely or whether the two or more video fields form a progressive video frame can be determined based on a possibly predetermined likelihood that fields of progressive video frames in general have or do not have the first characteristic and based on a possibly predetermined likelihood that fields of interlaced video frames in general have or do not have the second characteristic.

    摘要翻译: 可以自动确定隔行视频字段形成逐行视频帧。 一个或多个视频场的第一特征的存在或不存在可以通过字段和/或诸如标志,节奏,先前确定等的相关信息的分析来确定。 类似地,可以检测到第二特性的存在或不存在。 根据检测,可能性或两个或更多个视频场是否形成逐行视频帧的可能性可以基于可能预定的可能性来确定,所述可能性是逐行视频帧的字段通常具有或不具有第一特征并且基于 隔行扫描视频帧的场通常具有或不具有第二特征的可能的预定可能性。

    PARALLEL MULTIPLE BITRATE VIDEO ENCODING
    14.
    发明申请
    PARALLEL MULTIPLE BITRATE VIDEO ENCODING 有权
    并行多重双极化视频编码

    公开(公告)号:US20110305273A1

    公开(公告)日:2011-12-15

    申请号:US12814060

    申请日:2010-06-11

    IPC分类号: H04N7/26

    CPC分类号: H04N19/436 H04N19/30

    摘要: A multiple bitrate (MBR) video encoding management tool utilizes available processing units for parallel MBR video encoding. For example, instead of focusing only on multi-threading of encoding tasks for a single picture or group of pictures (GOP), the management tool parallelizes the encoding of multiple GOPs between different processing units and/or different computing systems. With this parallel MBR video encoding architecture, different GOPs can be encoded in parallel. To facilitate such parallel encoding, data dependencies between GOPs are removed. The management tool can adjust the number of GOPs to encode in parallel on a computing system so as to favor parallelism of encoding for different GOPs at the expense of parallelism of encoding inside a GOP, or vice versa, and thereby set a suitable balance between encoding latency and throughput.

    摘要翻译: 多位比特率(MBR)视频编码管理工具利用可用的处理单元进行并行MBR视频编码。 例如,管理工具不是专注于单个图像或图像组(GOP)的编码任务的多线程,而是在不同的处理单元和/或不同的计算系统之间并行化多个GOP的编码。 利用这种并行MBR视频编码架构,可以并行编码不同的GOP。 为了促进这种并行编码,GOP之间的数据依赖性被去除。 管理工具可以在计算系统上调节GOP并行编码的数量,从而以牺牲GOP内编码并行性为代价的优点,对不同的GOP进行编码的平行化,反之亦然,从而在编码之间设置适当的平衡 延迟和吞吐量。

    Strategies for Producing Quantized Image Information
    17.
    发明申请
    Strategies for Producing Quantized Image Information 有权
    生成量化图像信息的策略

    公开(公告)号:US20060147109A1

    公开(公告)日:2006-07-06

    申请号:US11276788

    申请日:2006-03-14

    IPC分类号: G06K9/00

    CPC分类号: H04N1/4053

    摘要: Strategies are set forth herein for quantizing and dithering original image information to produce quantized image information. According to one exemplary implementation, the strategies involve: quantizing a sum that combines an original value taken from the original image information, a noise value, and an error term, to produce a quantized value; and calculating an error term for a subsequent quantizing operation by computing a difference between the quantized value and the original value. By virtue of his process, the strategies essentially add noise information to the quantization process, not the original image information, which results in quantized image information having reduced artifacts. The strategies can be used in conjunction with the Floyd-Steinberg error dispersion algorithm. According to another feature, the noise value is computed using a random number generator having a long repeat period, which further reduces artifacts.

    摘要翻译: 这里阐述了用于量化和抖动原始图像信息以产生量化的图像信息的策略。 根据一个示例性实现,策略涉及:量化组合从原始图像信息获取的原始值,噪声值和误差项的和以产生量化值; 以及通过计算量化值和原始值之间的差来计算后续量化操作的误差项。 凭借其过程,策略基本上将量化过程的噪声信息添加到原始图像信息中,这导致具有减少的伪影的量化的图像信息。 该策略可以与Floyd-Steinberg误差色散算法结合使用。 根据另一个特征,使用具有长重复周期的随机数发生器来计算噪声值,这进一步减少伪像。

    Strategies for configuring media processing functionality using a hierarchical ordering of control parameters
    18.
    发明申请
    Strategies for configuring media processing functionality using a hierarchical ordering of control parameters 有权
    使用控制参数的分层排序来配置媒体处理功能的策略

    公开(公告)号:US20050273791A1

    公开(公告)日:2005-12-08

    申请号:US11172251

    申请日:2005-06-30

    IPC分类号: G06F9/44 G06F9/46

    CPC分类号: G06F3/0481 G06F8/30

    摘要: Strategies for effectively discovering, selecting, configuring, and controlling components used in media processing applications are described. According to one exemplary implementation, the strategies described configure the components based on profile information, configuration information, and a hierarchical ordering of configuration parameters. The hierarchical ordering may combine different coding paradigms, where one or more high level nodes in the ordering may define configuration parameters which are common to multiple coding paradigms. In this ordering, selection of a configuration parameter may cascade down to affect lower-ranking dependent parameters in the hierarchical ordering. According to one advantage, the hierarchical ordering provides a more uniform, extensible, and problem-free approach to configuring components than unstructured approaches to configuration. Moreover, applications can utilize the hierarchical ordering at different levels of granularity.

    摘要翻译: 描述了有效发现,选择,配置和控制媒体处理应用中使用的组件的策略。 根据一个示例性实现,所描述的策略基于简档信息,配置信息以及配置参数的分级排序配置组件。 分层排序可以组合不同的编码范例,其中排序中的一个或多个高级节点可以定义多个编码范例共有的配置参数。 在该排序中,配置参数的选择可以级联以影响分级排序中的较低级别的依赖参数。 根据一个优点,分层排序提供了比非结构化配置方法更为统一,可扩展和无问题的配置组件的方法。 此外,应用程序可以以不同的粒度级别利用分层排序。

    Image processing using linear light values and other image processing improvements
    19.
    发明申请
    Image processing using linear light values and other image processing improvements 有权
    使用线性光值和其他图像处理改进的图像处理

    公开(公告)号:US20050063586A1

    公开(公告)日:2005-03-24

    申请号:US10987378

    申请日:2004-11-12

    IPC分类号: G06K9/00

    CPC分类号: H04N1/4053

    摘要: Strategies are described for processing image information in a linear form to reduce the amount of artifacts (compared to processing the data in nonlinear form). Exemplary types of processing operations can include, scaling, compositing, alpha-blending, edge detection, and so forth. In a more specific implementation, strategies are described for processing image information that is: a) linear; b) in the RGB color space; c) high precision (e.g., provided by floating point representation); d) progressive; and e) full channel. Other improvements provide strategies for: a) processing image information in a pseudo-linear space to improve processing speed; b) implementing an improved error dispersion technique; c) dynamically calculating and applying filter kernels; d) producing pipeline code in an optimal manner; and e) implementing various processing tasks using novel pixel shader techniques.

    摘要翻译: 描述了以线性形式处理图像信息以减少伪像量的策略(与处理非线性形式的数据相比)。 处理操作的示例性类型可以包括缩放,合成,阿尔法混合,边缘检测等等。 在更具体的实现中,描述了用于处理图像信息的策略,其是:a)线性; b)在RGB颜色空间中; c)高精度(例如,由浮点表示提供); d)进步; 和e)全频道。 其他改进提供了以下策略:a)在伪线性空间中处理图像信息以提高处理速度; b)实施改进的误差色散技术; c)动态计算和应用滤波器内核; d)以最佳方式生产管道代码; 以及e)使用新的像素着色器技术实现各种处理任务。

    Filtering and dithering as pre-processing before encoding
    20.
    发明授权
    Filtering and dithering as pre-processing before encoding 有权
    过滤和抖动作为编码前的预处理

    公开(公告)号:US08750390B2

    公开(公告)日:2014-06-10

    申请号:US12008660

    申请日:2008-01-10

    IPC分类号: H04N7/12 H04N7/26

    摘要: Techniques and tools are presented for controlling artifacts such as banding artifacts, ringing artifacts and film scan artifacts in video. For example, before encoding, a pre-processor performs combined filtering and dithering on video such that the weight of dithering at a location depends on the results of filtering at the location. For the combined filtering and dithering, the pre-processor can determine a lowpass signal and highpass residual, weight dithering based on local characteristics of the highpass residual, and then combine the lowpass signal with the weighted dithering. Or, to determine the relative weight, the pre-processor can use a filter whose normalization factor varies depending on how many sample values around a location are within a threshold of similarity to a current sample value at the location. The filtering and dithering can use different strengths for luma and chroma channels.

    摘要翻译: 提出了技术和工具,用于控制视频中的伪影,例如条纹伪像,振铃伪影和胶片扫描伪像。 例如,在编码之前,预处理器对视频执行组合滤波和抖动,使得在某个位置的抖动的权重取决于在该位置处的滤波结果。 对于组合的滤波和抖动,预处理器可以基于高通余数的局部特性来确定低通信号和高通余数,重量抖动,然后将低通信号与加权抖动组合。 或者,为了确定相对权重,预处理器可以使用其归一化因子根据位置周围多少个采样值在与该位置处的当前采样值相似的阈值内变化的滤波器。 滤光和抖动可以对亮度和色度通道使用不同的强度。