Face recognition in video content
    11.
    发明授权
    Face recognition in video content 有权
    视频内容中的人脸识别

    公开(公告)号:US08494231B2

    公开(公告)日:2013-07-23

    申请号:US12916895

    申请日:2010-11-01

    IPC分类号: G06K9/00

    摘要: The subject disclosure relates to face recognition in video. Face detection data in frames of input data are used to generate face galleries, which are labeled and used in recognizing faces throughout the video. Metadata that associates the video frame and the face are generated and maintained for subsequent identification. Faces other than those found by face detection may be found by face tracking, in which facial landmarks found by the face detection are used to track a face over previous and/or subsequent video frames. Once generated, the maintained metadata may be accessed to efficiently determine the identity of a person corresponding to a viewer-selected face.

    摘要翻译: 本公开涉及视频中的面部识别。 输入数据帧中的脸部检测数据用于生成面部画廊,这些画廊被标记并用于识别整个视频中的脸部。 生成并维护与视频帧和脸部相关联的元数据,以便后续识别。 面部检测以外的脸部可以通过脸部跟踪来发现,其中通过面部检测发现的面部地标用于跟踪先前和/或后续视频帧的面部。 一旦生成,可以访问维护的元数据以有效地确定与观众选择的面对应的人的身份。

    Generation and provision of media metadata
    12.
    发明授权
    Generation and provision of media metadata 有权
    生成和提供媒体元数据

    公开(公告)号:US08763068B2

    公开(公告)日:2014-06-24

    申请号:US12964597

    申请日:2010-12-09

    IPC分类号: H04N7/16

    摘要: Various embodiments related to the generation and provision of media metadata are disclosed. For example, one disclosed embodiment provides a computing device having a logic subsystem configured to execute instructions, and a data holding subsystem comprising instructions stored thereon that are executable by the processor to receive an input of a video and/or audio content item, and to compare the content item to one or more object descriptors each representing an object for locating within the content item to locate instances of one or more of the objects in the content item. The instructions are further executable to generate metadata for each object located in the video content item, and to receive a validating user input related to whether the metadata generated for a selected object is correct.

    摘要翻译: 公开了与生成和提供媒体元数据相关的各种实施例。 例如,一个公开的实施例提供了具有被配置为执行指令的逻辑子系统的计算设备,以及包括存储在其上的指令的数据保持子系统,其可由处理器执行以接收视频和/或音频内容项目的输入,并且 将内容项目与一个或多个对象描述符进行比较,每个对象描述符表示用于在内容项目中定位的对象以定位内容项目中的一个或多个对象的实例。 指令还可执行以为位于视频内容项目中的每个对象生成元数据,并且接收与为所选对象生成的元数据是否正确相关的验证用户输入。

    Rotation and scaling optimization for mobile devices
    13.
    发明授权
    Rotation and scaling optimization for mobile devices 失效
    移动设备的旋转和缩放优化

    公开(公告)号:US07710434B2

    公开(公告)日:2010-05-04

    申请号:US11755082

    申请日:2007-05-30

    申请人: Chuang Gu

    发明人: Chuang Gu

    摘要: Image processing in mobile devices is optimized by combining at least two of the color conversion, rotation, and scaling operations. Received images, such as still images or frames of video stream, are subjected to a combined transformation after decoding, where each pixel is color converted (e.g. from YUV to RGB), rotated, and scaled as needed. By combining two or three of the processes into one, read/write operations consuming significant processing and memory resources are reduced enabling processing of higher resolution images and/or power and processing resource savings.

    摘要翻译: 通过组合至少两个颜色转换,旋转和缩放操作来优化移动设备中的图像处理。 接收到的图像,例如静止图像或视频流的帧,在解码之后进行组合变换,其中每个像素被颜色转换(例如从YUV到RGB),旋转和根据需要进行缩放。 通过将两个或三个进程组合​​成一个,消耗重要处理和存储器资源的读/写操作被减少,使得能够处理更高分辨率图像和/或功率并且处理资源节省。

    Region extraction in vector images
    14.
    发明授权
    Region extraction in vector images 有权
    矢量图像中的区域提取

    公开(公告)号:US07088845B2

    公开(公告)日:2006-08-08

    申请号:US10767135

    申请日:2004-01-28

    IPC分类号: G06K9/00

    摘要: A semantic object tracking method tracks general semantic objects with multiple non-rigid motion, disconnected components and multiple colors throughout a vector image sequence. The method accurately tracks these general semantic objects by spatially segmenting image regions from a current frame and then classifying these regions as to which semantic object they originated from in the previous frame. To classify each region, the method perform a region based motion estimation between each spatially segmented region and the previous frame to computed the position of a predicted region in the previous frame. The method then classifies each region in the current frame as being part of a semantic object based on which semantic object in the previous frame contains the most overlapping points of the predicted region. Using this method, each region in the current image is tracked to one semantic object from the previous frame, with no gaps or overlaps. The method propagates few or no errors because it projects regions into a frame where the semantic object boundaries are previously computed rather than trying to project and adjust a boundary in a frame where the object's boundary is unknown.

    摘要翻译: 语义对象跟踪方法在整个矢量图像序列中跟踪具有多个非刚性运动,断开组件和多种颜色的一般语义对象。 该方法通过对来自当前帧的图像区域进行空间分割,然后对这些区域进行分类,从而准确地跟踪这些一般语义对象,以便它们在前一帧中源自哪个语义对象。 为了对每个区域进行分类,该方法在每个空间分段区域和前一帧之间执行基于区域的运动估计,以计算先前帧中的预测区域的位置。 该方法然后根据前一帧中的哪个语义对象包含预测区域的最重叠点将当前帧中的每个区域分类为语义对象的一部分。 使用该方法,当前图像中的每个区域被跟踪到来自前一帧的一个语义对象,没有间隙或重叠。 该方法传播很少或没有错误,因为它将区域投影到预先计算语义对象边界的框架中,而不是尝试在对象边界未知的框架中投影和调整边界。

    STAGED ELEMENT CLASSIFICATION
    15.
    发明申请
    STAGED ELEMENT CLASSIFICATION 有权
    标准元素分类

    公开(公告)号:US20120281886A1

    公开(公告)日:2012-11-08

    申请号:US13102740

    申请日:2011-05-06

    申请人: Yaming He Chuang Gu

    发明人: Yaming He Chuang Gu

    IPC分类号: G06K9/00

    摘要: Various examples are disclosed herein that relate to staged element classification. For example, one disclosed example provides a method of classifying elements by forming elements for classification into a plurality of first-level sets in a first stage, generating primary groups within the first-level sets based on element similarity, forming a plurality of second-level sets from the first-level sets in a second stage, generating secondary groups within the second-level sets based on element similarity, and merging a plurality of the primary and/or secondary groups based on element similarity.

    摘要翻译: 本文公开了与分段元素分类相关的各种示例。 例如,一个公开的示例提供了一种通过在第一阶段中形成用于分类为多个第一级集合的元素来对元素进行分类的方法,基于元素相似性在第一级集合内生成主组,形成多个第二级组, 基于元素相似度在第二级集合内生成二级组,并且基于元素相似度合并多个主组和/或辅助组。

    GENERATION AND PROVISION OF MEDIA METADATA
    16.
    发明申请
    GENERATION AND PROVISION OF MEDIA METADATA 有权
    媒体元数据的生成和提供

    公开(公告)号:US20120147265A1

    公开(公告)日:2012-06-14

    申请号:US12964597

    申请日:2010-12-09

    IPC分类号: H04N7/00

    摘要: Various embodiments related to the generation and provision of media metadata are disclosed. For example, one disclosed embodiment provides a computing device having a logic subsystem configured to execute instructions, and a data holding subsystem comprising instructions stored thereon that are executable by the processor to receive an input of a video and/or audio content item, and to compare the content item to one or more object descriptors each representing an object for locating within the content item to locate instances of one or more of the objects in the content item. The instructions are further executable to generate metadata for each object located in the video content item, and to receive a validating user input related to whether the metadata generated for a selected object is correct.

    摘要翻译: 公开了与生成和提供媒体元数据相关的各种实施例。 例如,一个公开的实施例提供了具有被配置为执行指令的逻辑子系统的计算设备,以及包括存储在其上的指令的数据保持子系统,其可由处理器执行以接收视频和/或音频内容项目的输入,并且 将内容项目与一个或多个对象描述符进行比较,每个对象描述符表示用于在内容项目中定位的对象以定位内容项目中的一个或多个对象的实例。 指令还可执行以为位于视频内容项目中的每个对象生成元数据,并且接收与为所选对象生成的元数据是否正确相关的验证用户输入。

    Tracking semantic objects in vector image sequences
    17.
    发明授权
    Tracking semantic objects in vector image sequences 有权
    跟踪矢量图像序列中的语义对象

    公开(公告)号:US07162055B2

    公开(公告)日:2007-01-09

    申请号:US11171448

    申请日:2005-06-29

    IPC分类号: G06K9/00 G06K9/34 H04N5/225

    摘要: A semantic object tracking method tracks general semantic objects with multiple non-rigid motion, disconnected components and multiple colors throughout a vector image sequence. The method accurately tracks these general semantic objects by spatially segmenting image regions from a current frame and then classifying these regions as to which semantic object they originated from in the previous frame. To classify each region, the method performs a region based motion estimation between each spatially segmented region and the previous frame to compute the position of a predicted region in the previous frame. The method then classifies each region in the current frame as being part of a semantic object based on which semantic object in the previous frame contains the most overlapping points of the predicted region. Using this method, each region in the current image is tracked to one semantic object from the previous frame, with no gaps or overlaps. The method propagates few or no errors because it projects regions into a frame where the semantic object boundaries are previously computed rather than trying to project and adjust a boundary in a frame where the object's boundary is unknown.

    摘要翻译: 语义对象跟踪方法在整个矢量图像序列中跟踪具有多个非刚性运动,断开组件和多种颜色的一般语义对象。 该方法通过对来自当前帧的图像区域进行空间分割,然后对这些区域进行分类,从而准确地跟踪这些一般语义对象,以便它们在前一帧中源自哪个语义对象。 为了对每个区域进行分类,该方法在每个空间分段区域和前一帧之间执行基于区域的运动估计,以计算前一帧中的预测区域的位置。 该方法然后根据前一帧中的哪个语义对象包含预测区域的最重叠点将当前帧中的每个区域分类为语义对象的一部分。 使用该方法,当前图像中的每个区域被跟踪到来自前一帧的一个语义对象,没有间隙或重叠。 该方法传播很少或没有错误,因为它将区域投影到预先计算语义对象边界的框架中,而不是尝试在对象边界未知的框架中投影和调整边界。

    Semantic video object segmentation and tracking

    公开(公告)号:US06400831B1

    公开(公告)日:2002-06-04

    申请号:US09054280

    申请日:1998-04-02

    IPC分类号: G06K900

    摘要: A semantic video object extraction system using mathematical morphology and perspective motion modeling. A user indicates a rough outline around an image feature of interest for a first frame in a video sequence. Without further user assistance, the rough outline is processed by a morphological segmentation tool to snap the rough outline into a precise boundary surrounding the image feature. Motion modeling is performed on the image feature to track its movement into a subsequent video frame. The motion model is applied to the precise boundary to warp the precise outline into a new rough outline for the image feature in the subsequent video frame. This new rough outline is then snapped to locate a new precise boundary. Automatic processing is repeated for subsequent video frames.

    Staged element classification
    19.
    发明授权
    Staged element classification 有权
    分期元素分类

    公开(公告)号:US08588534B2

    公开(公告)日:2013-11-19

    申请号:US13102740

    申请日:2011-05-06

    申请人: Yaming He Chuang Gu

    发明人: Yaming He Chuang Gu

    IPC分类号: G06K9/68

    摘要: Various examples are disclosed herein that relate to staged element classification. For example, one disclosed example provides a method of classifying elements by forming elements for classification into a plurality of first-level sets in a first stage, generating primary groups within the first-level sets based on element similarity, forming a plurality of second-level sets from the first-level sets in a second stage, generating secondary groups within the second-level sets based on element similarity, and merging a plurality of the primary and/or secondary groups based on element similarity.

    摘要翻译: 本文公开了与分段元素分类相关的各种示例。 例如,一个公开的示例提供了一种通过在第一阶段中形成用于分类为多个第一级集合的元素来对元素进行分类的方法,基于元素相似性在第一级集合内生成主组,形成多个第二级组, 基于元素相似度在第二级集合内生成二级组,并且基于元素相似度合并多个主组和/或辅助组。

    Tracking semantic objects in vector image sequences
    20.
    发明申请
    Tracking semantic objects in vector image sequences 有权
    跟踪矢量图像序列中的语义对象

    公开(公告)号:US20050240629A1

    公开(公告)日:2005-10-27

    申请号:US11171448

    申请日:2005-06-29

    摘要: A semantic object tracking method tracks general semantic objects with multiple non-rigid motion, disconnected components and multiple colors throughout a vector image sequence. The method accurately tracks these general semantic objects by spatially segmenting image regions from a current frame and then classifying these regions as to which semantic object they originated from in the previous frame. To classify each region, the method perform a region based motion estimation between each spatially segmented region and the previous frame to computed the position of a predicted region in the previous frame. The method then classifies each region in the current frame as being part of a semantic object based on which semantic object in the previous frame contains the most overlapping points of the predicted region. Using this method, each region in the current image is tracked to one semantic object from the previous frame, with no gaps or overlaps. The method propagates few or no errors because it projects regions into a frame where the semantic object boundaries are previously computed rather than trying to project and adjust a boundary in a frame where the object's boundary is unknown.

    摘要翻译: 语义对象跟踪方法在整个矢量图像序列中跟踪具有多个非刚性运动,断开组件和多种颜色的一般语义对象。 该方法通过对来自当前帧的图像区域进行空间分割,然后对这些区域进行分类,从而准确地跟踪这些一般语义对象,以便它们在前一帧中源自哪个语义对象。 为了对每个区域进行分类,该方法在每个空间分段区域和前一帧之间执行基于区域的运动估计,以计算先前帧中的预测区域的位置。 该方法然后根据前一帧中的哪个语义对象包含预测区域的最重叠点将当前帧中的每个区域分类为语义对象的一部分。 使用该方法,当前图像中的每个区域被跟踪到来自前一帧的一个语义对象,没有间隙或重叠。 该方法传播很少或没有错误,因为它将区域投影到预先计算语义对象边界的框架中,而不是尝试在对象边界未知的框架中投影和调整边界。