专利检索 ap:("Kadir Peker" OR "Ajay Divakaran") AND inv:"Ajay Divakaran" 第 1 页

1.

发明申请
Video presentation using compositional structures 审中-公开
标题翻译：使用组合结构的视频演示

公开(公告)号：US20060075346A1

公开(公告)日：2006-04-06

申请号：US10951192

申请日：2004-09-27

申请人： Tom Lanning , Ajay Divakaran , Kadir Peker , Regunathan Radhakrishnan , Ziyou Xiong , Clifton Forlines

发明人： Tom Lanning , Ajay Divakaran , Kadir Peker , Regunathan Radhakrishnan , Ziyou Xiong , Clifton Forlines

IPC分类号： G11B27/00

CPC分类号： G11B19/025 , G11B27/105 , G11B27/107 , G11B27/11 , G11B27/28 , G11B27/34 , G11B2220/20 , G11B2220/65 , G11B2220/90 , H04N5/85 , H04N9/8042 , H04N9/8233 , H04N21/42646 , H04N21/4312 , H04N21/4314 , H04N21/4325 , H04N21/812 , H04N21/84 , H04N21/8456

摘要： A method presents a video according to compositional structures associated with the video. Each compositional structure has a label, and multiple segments that can be organized temporally or hierarchically. A particular compositional structure is selected with a remote controller, and the video is presented by a playback controller on a display device according to the compositional structure.

摘要翻译： 一种方法根据与视频相关联的组合结构呈现视频。每个组成结构都有一个标签，并且可以在时间上或分层上组织的多个片段。利用遥控器选择特定的组成结构，根据组成结构，视频由显示设备上的播放控制器呈现。

2.

发明申请
Method and system for segmenting videos using face detection 失效
标题翻译：使用人脸检测分割视频的方法和系统

公开(公告)号：US20070091203A1

公开(公告)日：2007-04-26

申请号：US11258590

申请日：2005-10-25

申请人： Kadir Peker , Ajay Divakaran

发明人： Kadir Peker , Ajay Divakaran

IPC分类号： H04N7/12

CPC分类号： G06F17/30793 , G06F17/30843 , G06F17/30852 , G06K9/00711 , G06T7/215 , G11B27/034 , G11B27/28

摘要： A method generates a summary of a video. Faces are detected in a plurality of frames of the video. The frames are classified according to a number of faces detected in each frame and the video is partitioned into segments according to the classifications to produce a summary of the video. For each frame classified as having a single detected face, one or more characteristics of the face is determined. The frames are labeled according to the characteristics to produce labeled clusters and the segments are partitioned into sub-segments according to the labeled clusters.

摘要翻译： 一种方法生成视频的摘要。在视频的多个帧中检测到脸部。帧根据在每个帧中检测到的多个面部进行分类，并且根据分类将视频划分成段，以产生视频的摘要。对于被分类为具有单个检测面的每个帧，确定面部的一个或多个特征。根据特征标记帧以产生标记的簇，并且根据标记的簇将片段划分成子片段。

3.

发明申请
Visual complexity measure for playing videos adaptively 失效
标题翻译：视觉复杂度自适应播放视频

公开(公告)号：US20050018881A1

公开(公告)日：2005-01-27

申请号：US10616546

申请日：2003-07-10

申请人： Kadir Peker , Ajay Divakaran

发明人： Kadir Peker , Ajay Divakaran

IPC分类号： G11B27/00 , G11B27/10 , G11B27/28 , H04N5/783 , H04N9/804 , G06K9/00

CPC分类号： G11B27/105 , G11B27/005 , G11B27/28 , H04N5/783 , H04N9/8042

摘要： A method plays frames of a video adaptively according to a visual complexity of the video. First a spatial frequency of pixel within frames of the video is measured, as well as a temporal velocity of corresponding pixels between frames of the video. The spatial frequency is multiplied by the temporal velocity to obtain a measure of the visual complexity of the frames of the video. The frames of the video are then played at a speed that corresponds to the visual complexity.

摘要翻译： 一种方法根据视频的视觉复杂性自适应地播放视频的帧。首先，测量视频帧内像素的空间频率，以及视频帧之间对应像素的时间速度。空间频率乘以时间速度，以获得视频帧的视觉复杂度的度量。然后以与视觉复杂度相对应的速度播放视频的帧。

4.

发明授权
3-D model based method for detecting and classifying vehicles in aerial imagery 有权
标题翻译：基于3-D模型的航空图像检测和分类车辆的方法

公开(公告)号：US08913783B2

公开(公告)日：2014-12-16

申请号：US12913861

申请日：2010-10-28

申请人： Saad Masood Khan , Hui Cheng , Dennis Lee Matthies , Harpreet Singh Sawhney , Sang-Hack Jung , Chris Broaddus , Bogdan Calin Mihai Matei , Ajay Divakaran

发明人： Saad Masood Khan , Hui Cheng , Dennis Lee Matthies , Harpreet Singh Sawhney , Sang-Hack Jung , Chris Broaddus , Bogdan Calin Mihai Matei , Ajay Divakaran

IPC分类号： G06K9/00 , G06K9/46

CPC分类号： G06K9/00785 , B64C39/024 , B64C2201/123 , B64D47/08 , G06K9/00651 , G06K9/4671 , G06K9/6269 , G06K9/6282 , G06T7/11 , G06T7/20 , G06T2207/10012

摘要： A computer implemented method for determining a vehicle type of a vehicle detected in an image is disclosed. An image having a detected vehicle is received. A number of vehicle models having salient feature points is projected on the detected vehicle. A first set of features derived from each of the salient feature locations of the vehicle models is compared to a second set of features derived from corresponding salient feature locations of the detected vehicle to form a set of positive match scores (p-scores) and a set of negative match scores (n-scores). The detected vehicle is classified as one of the vehicle models models based at least in part on the set of p-scores and the set of n-scores.

摘要翻译： 公开了一种用于确定在图像中检测到的车辆的车辆类型的计算机实现的方法。接收具有检测到的车辆的图像。在检测到的车辆上投影出具有显着特征点的多个车辆模型。将来自车辆模型的每个突出特征位置的第一组特征与从所检测到的车辆的相应突出特征位置导出的第二组特征进行比较，以形成一组正的匹配得分（p分数）和一组负面比赛得分（n分）。检测到的车辆至少部分地基于p分数集合和n分数集合被分类为车辆模型模型之一。

5.

发明授权
Method for computing food volume in a method for analyzing food 有权
标题翻译：食物分析方法计算食物量的方法

公开(公告)号：US08345930B2

公开(公告)日：2013-01-01

申请号：US12758208

申请日：2010-04-12

申请人： Amir Tamrakar , Harpreet Singh Sawhney , Qian Yu , Ajay Divakaran

发明人： Amir Tamrakar , Harpreet Singh Sawhney , Qian Yu , Ajay Divakaran

IPC分类号： G06K9/00

CPC分类号： G06T7/0002 , G06T7/44 , G06T7/593 , G06T7/62 , G06T7/77 , G06T2207/20016 , G06T2207/30128

摘要： A computer-implemented method for estimating a volume of at least one food item on a food plate is disclosed. A first and second plurality of images are received from different positions above a food plate, wherein angular spacing between the positions of the first plurality of images is greater than angular spacing between the positions of the second plurality of images. A first set of poses of each of the first plurality of images is estimated. A second set of poses of each of the second plurality of images is estimated based on at least the first set of poses. A pair of images taken from each of the first and second plurality of images is rectified based on at least the first and second set of poses. A 3D point cloud is reconstructed based on at least the rectified pair of images. At least one surface of the at least one food item above the food plate is estimated based on at least the reconstructed 3D point cloud. The volume of the at least one food item is estimated based on the at least one surface.

摘要翻译： 公开了一种用于估计食品板上的至少一种食品的体积的计算机实现的方法。从食品牌上方的不同位置接收第一和第二多个图像，其中第一多个图像的位置之间的角度间隔大于第二多个图像的位置之间的角度间隔。估计第一多个图像中的每一个的第一组姿势。基于至少第一组姿势来估计第二组多个图像中的每一个的第二组姿势。从第一和第二多个图像中的每一个拍摄的一对图像至少基于第一和第二组姿势进行整改。至少基于整流图像对来重构3D点云。基于至少重构的3D点云来估计食物板上方的至少一个食物的至少一个表面。基于至少一个表面来估计至少一个食物的体积。

6.

发明授权
Method for pose invariant vessel fingerprinting 有权
标题翻译：姿态不变血管指纹方法

公开(公告)号：US08330819B2

公开(公告)日：2012-12-11

申请号：US12758507

申请日：2010-04-12

申请人： Sang-Hack Jung , Ajay Divakaran , Harpreet Singh Sawhney

发明人： Sang-Hack Jung , Ajay Divakaran , Harpreet Singh Sawhney

IPC分类号： H04N7/18

CPC分类号： G06K9/00771 , G06K9/6206 , G06K9/6211

摘要： A computer-implemented method for for matching objects is disclosed. At least two images where one of the at least two images has a first target object and a second of the at least two images has a second target object are received. At least one first patch from the first target object and at least one second patch from the second target object are extracted. A distance-based part encoding between each of the at least one first patch and the at least one second patch based upon a corresponding codebook of image parts including at least one of part type and pose is constructed. A viewpoint of one of the at least one first patch is warped to a viewpoint of the at least one second patch. A parts level similarity measure based on the view-invariant distance measure for each of the at least one first patch and the at least one second patch is applied to determine whether the first target object and the second target object are the same or different objects.

摘要翻译： 公开了一种用于匹配对象的计算机实现的方法。接收至少两个图像，其中至少两个图像中的一个具有第一目标对象，并且至少两个图像中的第二图像具有第二目标对象。提取来自第一目标对象的至少一个第一补丁和来自第二目标对象的至少一个第二补丁。构建基于包括部件类型和姿态中的至少一个的图像部件的对应码本的至少一个第一贴片和至少一个第二贴片中的每一个之间的基于距离的部件编码。所述至少一个第一贴片中的一个的视点弯曲到所述至少一个第二贴片的观点。应用基于对于至少一个第一贴片和至少一个第二贴片中的每一个的视图不变距离度量的零件级相似性度量来确定第一目标对象和第二目标对象是相同还是不同的对象。

7.

发明申请
WEAPON IDENTIFICATION USING ACOUSTIC SIGNATURES ACROSS VARYING CAPTURE CONDITIONS 有权
标题翻译：使用声音识别的武器识别符合各种不同的捕获条件

公开(公告)号：US20100271905A1

公开(公告)日：2010-10-28

申请号：US12766219

申请日：2010-04-23

申请人： Saad Khan , Ajay Divakaran , Harpreet Singh Sawhney

发明人： Saad Khan , Ajay Divakaran , Harpreet Singh Sawhney

IPC分类号： G01S3/80

CPC分类号： G10L25/48

摘要： A computer implemented method for automatically detecting and classifying acoustic signatures across a set of recording conditions is disclosed. A first acoustic signature is received. The first acoustic signature is projected into a space of a minimal set of exemplars of acoustic signature types derived from a larger set of exemplars using a wrapper method. At least one vector distance is calculated between the projected acoustic signature and each exemplar of the minimal set of exemplars. An exemplar is selected from the minimal set of exemplars having the smallest vector distance to the projected acoustic signature as a class corresponding to and classifying the first acoustic signature. The first acoustic signature and the plurality of acoustic signatures may correspond to one of gunshots, musical instruments, songs, and speech. The minimal set of exemplars may correspond to a hierarchy of acoustic signature types.

摘要翻译： 公开了一种用于在一组记录条件下自动检测和分类声学签名的计算机实现的方法。接收到第一个声学签名。第一声学签名被投影到使用包装方法从更大的样本集合导出的声学签名类型的最小样本集合的空间中。在投影的声学特征与最小样本集的每个样本之间计算至少一个矢量距离。从具有与投影的声学签名的最小向量距离的最小样本集合中选择一个示例作为对应于和分类第一声学签名的类别。第一声学签名和多个声学签名可以对应于枪声，乐器，歌曲和语音之一。最小的一组样本可以对应于声学签名类型的层级。

8.

发明授权
Multimedia event detection and summarization 失效
标题翻译：多媒体事件检测与总结

公开(公告)号：US07409407B2

公开(公告)日：2008-08-05

申请号：US10840824

申请日：2004-05-07

申请人： Regunathan Radhakrishnan , Ajay Divakaran

发明人： Regunathan Radhakrishnan , Ajay Divakaran

IPC分类号： G06F17/30 , G06F17/00

CPC分类号： G06F17/30787 , G06F17/30802 , G06F17/30808 , G06F17/30811 , G06F17/30843 , G06K9/00711 , Y10S707/99943 , Y10S707/99945

摘要： A method detects events in multimedia. Features are extracted from the multimedia. The features are sampled using a sliding window to obtain samples. A context model is constructed for each sample. An affinity matrix is determined from the models and a commutative distance metric between each pair of context models. A second generation eigenvector is determined for the affinity matrix, and the samples are then clustered into events according to the second generation eigenvector.

摘要翻译： 一种方法来检测多媒体中的事件。功能从多媒体提取。使用滑动窗口对特征进行采样以获得样品。为每个样本构建上下文模型。从模型和每对上下文模型之间的交换距离度量确定亲和度矩阵。针对亲和度矩阵确定第二代特征向量，然后根据第二代特征向量将样本聚类成事件。

9.

发明申请
Method and system for video segmentation 有权
标题翻译：视频分割方法和系统

公开(公告)号：US20080124042A1

公开(公告)日：2008-05-29

申请号：US11593897

申请日：2006-11-07

申请人： Ajay Divakaran , Feng Niu , Naveen Goela

发明人： Ajay Divakaran , Feng Niu , Naveen Goela

IPC分类号： H04N5/93

CPC分类号： G06K9/00711 , G06F17/30787 , G10L25/48

摘要： A method segments a video. Audio frames of the video are classified with labels. Dominant labels are assigned to successive time intervals of consecutive labels. A semantic description is constructed for sliding time windows of the successive time intervals, in which the sliding time windows overlap in time, and the semantic description for each time window is a transition matrix determined from the dominant labels of the time intervals. A marker is determined from the transition matrices, in which a frequency of occurrence of the marker is between a low frequency threshold and a high frequency threshold. Then, the video is segmented at the locations of the markers.

摘要翻译： 一种方法分割视频。视频的音频帧被分类为标签。主导标签分配给连续标签的连续时间间隔。对于连续时间间隔的滑动时间窗口构成语义描述，其中滑动时间窗口在时间上重叠，并且每个时间窗口的语义描述是从时间间隔的主要标签确定的转换矩阵。从标记的出现频率在低频阈值和高频阈值之间的转移矩阵确定标记。然后，视频在标记的位置被分割。

10.

发明申请
System and method for recording and reproducing multimedia 审中-公开
标题翻译：用于录制和再现多媒体的系统和方法

公开(公告)号：US20050154987A1

公开(公告)日：2005-07-14

申请号：US10757138

申请日：2004-01-14

申请人： Isao Otsuka , Ajay Divakaran , Masaharu Ogawa , Kazuhiko Nakane

发明人： Isao Otsuka , Ajay Divakaran , Masaharu Ogawa , Kazuhiko Nakane

IPC分类号： G06F15/00 , G06F17/30 , G10L19/00 , G11B27/00 , G11B27/28 , H04N5/91 , H04N9/804 , H04N9/82

CPC分类号： H04N9/8205 , G06F16/71 , G06F16/739 , G06F16/7834 , G11B27/28 , H04N9/8042 , H04N21/42646 , H04N21/4325 , H04N21/4394 , H04N21/44008 , H04N21/4508 , H04N21/4542 , H04N21/84

摘要： A system and method summarizes multimedia stored in a compressed multimedia file partitioned into a sequence of segments, where the content of the multimedia is, for example, video signals, audio signals, text, and binary data. An associated metadata file includes index information and an importance level for each segment. The importance information is continuous over as closed interval. An importance level threshold is selected in the closed interval, and only segments of the multimedia having a particular importance level greater than the importance level threshold are reproduced.

摘要翻译： 系统和方法总结了存储在被分割成段序列的压缩多媒体文件中的多媒体，其中多媒体的内容是例如视频信号，音频信号，文本和二进制数据。关联的元数据文件包括索引信息和每个段的重要性级别。重要性信息作为闭合间隔连续。在闭合间隔中选择重要性级别阈值，并且仅再现具有大于重要性级别阈值的特定重要性级别的多媒体段。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类