Method and system for segmenting videos using face detection
    2.
    发明申请
    Method and system for segmenting videos using face detection 失效
    使用人脸检测分割视频的方法和系统

    公开(公告)号:US20070091203A1

    公开(公告)日:2007-04-26

    申请号:US11258590

    申请日:2005-10-25

    IPC分类号: H04N7/12

    摘要: A method generates a summary of a video. Faces are detected in a plurality of frames of the video. The frames are classified according to a number of faces detected in each frame and the video is partitioned into segments according to the classifications to produce a summary of the video. For each frame classified as having a single detected face, one or more characteristics of the face is determined. The frames are labeled according to the characteristics to produce labeled clusters and the segments are partitioned into sub-segments according to the labeled clusters.

    摘要翻译: 一种方法生成视频的摘要。 在视频的多个帧中检测到脸部。 帧根据在每个帧中检测到的多个面部进行分类,并且根据分类将视频划分成段,以产生视频的摘要。 对于被分类为具有单个检测面的每个帧,确定面部的一个或多个特征。 根据特征标记帧以产生标记的簇,并且根据标记的簇将片段划分成子片段。

    Visual complexity measure for playing videos adaptively
    3.
    发明申请
    Visual complexity measure for playing videos adaptively 失效
    视觉复杂度自适应播放视频

    公开(公告)号:US20050018881A1

    公开(公告)日:2005-01-27

    申请号:US10616546

    申请日:2003-07-10

    摘要: A method plays frames of a video adaptively according to a visual complexity of the video. First a spatial frequency of pixel within frames of the video is measured, as well as a temporal velocity of corresponding pixels between frames of the video. The spatial frequency is multiplied by the temporal velocity to obtain a measure of the visual complexity of the frames of the video. The frames of the video are then played at a speed that corresponds to the visual complexity.

    摘要翻译: 一种方法根据视频的视觉复杂性自适应地播放视频的帧。 首先,测量视频帧内像素的空间频率,以及视频帧之间对应像素的时间速度。 空间频率乘以时间速度,以获得视频帧的视觉复杂度的度量。 然后以与视觉复杂度相对应的速度播放视频的帧。

    Method for computing food volume in a method for analyzing food
    5.
    发明授权
    Method for computing food volume in a method for analyzing food 有权
    食物分析方法计算食物量的方法

    公开(公告)号:US08345930B2

    公开(公告)日:2013-01-01

    申请号:US12758208

    申请日:2010-04-12

    IPC分类号: G06K9/00

    摘要: A computer-implemented method for estimating a volume of at least one food item on a food plate is disclosed. A first and second plurality of images are received from different positions above a food plate, wherein angular spacing between the positions of the first plurality of images is greater than angular spacing between the positions of the second plurality of images. A first set of poses of each of the first plurality of images is estimated. A second set of poses of each of the second plurality of images is estimated based on at least the first set of poses. A pair of images taken from each of the first and second plurality of images is rectified based on at least the first and second set of poses. A 3D point cloud is reconstructed based on at least the rectified pair of images. At least one surface of the at least one food item above the food plate is estimated based on at least the reconstructed 3D point cloud. The volume of the at least one food item is estimated based on the at least one surface.

    摘要翻译: 公开了一种用于估计食品板上的至少一种食品的体积的计算机实现的方法。 从食品牌上方的不同位置接收第一和第二多个图像,其中第一多个图像的位置之间的角度间隔大于第二多个图像的位置之间的角度间隔。 估计第一多个图像中的每一个的第一组姿势。 基于至少第一组姿势来估计第二组多个图像中的每一个的第二组姿势。 从第一和第二多个图像中的每一个拍摄的一对图像至少基于第一和第二组姿势进行整改。 至少基于整流图像对来重构3D点云。 基于至少重构的3D点云来估计食物板上方的至少一个食物的至少一个表面。 基于至少一个表面来估计至少一个食物的体积。

    Method for pose invariant vessel fingerprinting
    6.
    发明授权
    Method for pose invariant vessel fingerprinting 有权
    姿态不变血管指纹方法

    公开(公告)号:US08330819B2

    公开(公告)日:2012-12-11

    申请号:US12758507

    申请日:2010-04-12

    IPC分类号: H04N7/18

    摘要: A computer-implemented method for for matching objects is disclosed. At least two images where one of the at least two images has a first target object and a second of the at least two images has a second target object are received. At least one first patch from the first target object and at least one second patch from the second target object are extracted. A distance-based part encoding between each of the at least one first patch and the at least one second patch based upon a corresponding codebook of image parts including at least one of part type and pose is constructed. A viewpoint of one of the at least one first patch is warped to a viewpoint of the at least one second patch. A parts level similarity measure based on the view-invariant distance measure for each of the at least one first patch and the at least one second patch is applied to determine whether the first target object and the second target object are the same or different objects.

    摘要翻译: 公开了一种用于匹配对象的计算机实现的方法。 接收至少两个图像,其中至少两个图像中的一个具有第一目标对象,并且至少两个图像中的第二图像具有第二目标对象。 提取来自第一目标对象的至少一个第一补丁和来自第二目标对象的至少一个第二补丁。 构建基于包括部件类型和姿态中的至少一个的图像部件的对应码本的至少一个第一贴片和至少一个第二贴片中的每一个之间的基于距离的部件编码。 所述至少一个第一贴片中的一个的视点弯曲到所述至少一个第二贴片的观点。 应用基于对于至少一个第一贴片和至少一个第二贴片中的每一个的视图不变距离度量的零件级相似性度量来确定第一目标对象和第二目标对象是相同还是不同的对象。

    WEAPON IDENTIFICATION USING ACOUSTIC SIGNATURES ACROSS VARYING CAPTURE CONDITIONS
    7.
    发明申请
    WEAPON IDENTIFICATION USING ACOUSTIC SIGNATURES ACROSS VARYING CAPTURE CONDITIONS 有权
    使用声音识别的武器识别符合各种不同的捕获条件

    公开(公告)号:US20100271905A1

    公开(公告)日:2010-10-28

    申请号:US12766219

    申请日:2010-04-23

    IPC分类号: G01S3/80

    CPC分类号: G10L25/48

    摘要: A computer implemented method for automatically detecting and classifying acoustic signatures across a set of recording conditions is disclosed. A first acoustic signature is received. The first acoustic signature is projected into a space of a minimal set of exemplars of acoustic signature types derived from a larger set of exemplars using a wrapper method. At least one vector distance is calculated between the projected acoustic signature and each exemplar of the minimal set of exemplars. An exemplar is selected from the minimal set of exemplars having the smallest vector distance to the projected acoustic signature as a class corresponding to and classifying the first acoustic signature. The first acoustic signature and the plurality of acoustic signatures may correspond to one of gunshots, musical instruments, songs, and speech. The minimal set of exemplars may correspond to a hierarchy of acoustic signature types.

    摘要翻译: 公开了一种用于在一组记录条件下自动检测和分类声学签名的计算机实现的方法。 接收到第一个声学签名。 第一声​​学签名被投影到使用包装方法从更大的样本集合导出的声学签名类型的最小样本集合的空间中。 在投影的声学特征与最小样本集的每个样本之间计算至少一个矢量距离。 从具有与投影的声学签名的最小向量距离的最小样本集合中选择一个示例作为对应于和分类第一声学签名的类别。 第一声​​学签名和多个声学签名可以对应于枪声,乐器,歌曲和语音之一。 最小的一组样本可以对应于声学签名类型的层级。

    Multimedia event detection and summarization
    8.
    发明授权
    Multimedia event detection and summarization 失效
    多媒体事件检测与总结

    公开(公告)号:US07409407B2

    公开(公告)日:2008-08-05

    申请号:US10840824

    申请日:2004-05-07

    IPC分类号: G06F17/30 G06F17/00

    摘要: A method detects events in multimedia. Features are extracted from the multimedia. The features are sampled using a sliding window to obtain samples. A context model is constructed for each sample. An affinity matrix is determined from the models and a commutative distance metric between each pair of context models. A second generation eigenvector is determined for the affinity matrix, and the samples are then clustered into events according to the second generation eigenvector.

    摘要翻译: 一种方法来检测多媒体中的事件。 功能从多媒体提取。 使用滑动窗口对特征进行采样以获得样品。 为每个样本构建上下文模型。 从模型和每对上下文模型之间的交换距离度量确定亲和度矩阵。 针对亲和度矩阵确定第二代特征向量,然后根据第二代特征向量将样本聚类成事件。

    Method and system for video segmentation
    9.
    发明申请
    Method and system for video segmentation 有权
    视频分割方法和系统

    公开(公告)号:US20080124042A1

    公开(公告)日:2008-05-29

    申请号:US11593897

    申请日:2006-11-07

    IPC分类号: H04N5/93

    摘要: A method segments a video. Audio frames of the video are classified with labels. Dominant labels are assigned to successive time intervals of consecutive labels. A semantic description is constructed for sliding time windows of the successive time intervals, in which the sliding time windows overlap in time, and the semantic description for each time window is a transition matrix determined from the dominant labels of the time intervals. A marker is determined from the transition matrices, in which a frequency of occurrence of the marker is between a low frequency threshold and a high frequency threshold. Then, the video is segmented at the locations of the markers.

    摘要翻译: 一种方法分割视频。 视频的音频帧被分类为标签。 主导标签分配给连续标签的连续时间间隔。 对于连续时间间隔的滑动时间窗口构成语义描述,其中滑动时间窗口在时间上重叠,并且每个时间窗口的语义描述是从时间间隔的主要标签确定的转换矩阵。 从标记的出现频率在低频阈值和高频阈值之间的转移矩阵确定标记。 然后,视频在标记的位置被分割。