专利检索 ap:("Ajay Divakaran" OR "Huifang Sun") AND inv:"Huifang Sun" 第 1 页

1.

发明授权
Methods of feature extraction of video sequences 失效
标题翻译：视频序列特征提取方法

公开(公告)号：US06618507B1

公开(公告)日：2003-09-09

申请号：US09236838

申请日：1999-01-25

申请人： Ajay Divakaran , Huifang Sun , Hiroshi Ito

发明人： Ajay Divakaran , Huifang Sun , Hiroshi Ito

IPC分类号： G06K946

CPC分类号： G06K9/00744 , G06F17/30811

摘要： This invention relates to methods of feature extraction from MPEG-2 and MPEG-4 compressed video sequences. The spatio-temporal compression complexity of video sequences is evaluated for feature extraction by inspecting the compressed bitstream and the complexity is used as a descriptor of the spatio-temporal characteristics of the video sequence. The spatio-temporal compression complexity measure is used as a matching criterion and can also be used for absolute indexing. Feature extraction can be accomplished in conjunction with scene change detection techniques and the combination has reasonable accuracy and the advantage of high simplicity since it is based on entropy decoding of signals in compressed form and does not require computationally expensive inverse Discrete Cosine Transformation (DCT).

摘要翻译： 本发明涉及从MPEG-2和MPEG-4压缩视频序列中提取特征的方法。通过检查压缩比特流来评估视频序列的时空压缩复杂度，并将复杂度用作视频序列的时空特征的描述符。时空压缩复杂度测量用作匹配标准，也可用于绝对索引。特征提取可以结合场景变化检测技术来实现，并且组合具有合理的精度和高简单性的优点，因为它是基于压缩形式的信号的熵解码，并且不需要计算上昂贵的反离散余弦变换（DCT）。

2.

发明授权
Descriptor for spatial distribution of motion activity in compressed video 有权
标题翻译：压缩视频中运动活动空间分布的描述符

公开(公告)号：US06600784B1

公开(公告)日：2003-07-29

申请号：US09496707

申请日：2000-02-02

申请人： Ajay Divakaran , Kadir A. Peker , Huifang Sun

发明人： Ajay Divakaran , Kadir A. Peker , Huifang Sun

IPC分类号： H04N712

CPC分类号： G06F17/30781 , G06K9/00335 , G06K9/00711 , G06T7/215 , G06T2207/10016 , G06T2207/20021

摘要： A method describes motion activity in a video sequence. A motion activity matrix is determined for the video sequence. A threshold for the motion activity matrix is determined. Connected regions of motion vectors at least equal to the threshold are identified and measured for size. A histogram of the distribution of the sizes of the connected areas is constructed for the entire video sequence. The histogram is normalized to characterize the spatial distribution of the video sequence in a motion activity descriptor.

摘要翻译： 一种方法描述视频序列中的运动活动。为视频序列确定运动活动矩阵。确定运动活动矩阵的阈值。至少等于阈值的运动矢量的连接区域被识别并测量尺寸。为整个视频序列构建连接区域的大小分布的直方图。将直方图归一化以表征运动活动描述符中的视频序列的空间分布。

3.

发明授权
Structural analysis of videos with hidden markov models and dynamic programming 失效
标题翻译：具有隐马尔可夫模型和动态规划的视频的结构分析

公开(公告)号：US06865226B2

公开(公告)日：2005-03-08

申请号：US10005623

申请日：2001-12-05

申请人： Lexing Xie , Shih-Fu Chang , Ajay Divakaran , Huifang Sun

发明人： Lexing Xie , Shih-Fu Chang , Ajay Divakaran , Huifang Sun

IPC分类号： H04N5/92 , G06F17/30 , H04N5/91 , H04N7/26 , H04N7/12

CPC分类号： G06K9/00771 , G06F17/30799 , G06F17/30811 , G06F17/30852 , G06K9/00711

摘要： A method analyzes a high-level syntax and structure of a continuous compressed video according to a plurality of states. First, a set of hidden Markov models for each of the states is trained with a training video segmented into known states. Then, a set of domain specific features are extracted from a fixed-length sliding window of the continuous compressed video, and a set of maximum likelihoods is determined for each set of domain specific features using the sets of trained hidden Markov models. Finally, dynamic programming is applied to each set of maximum likelihoods to determine a specific state for each fixed-length sliding window of frames of the compressed video.

摘要翻译： 一种方法根据多种状态分析连续压缩视频的高级语法和结构。首先，针对每个州的一组隐马尔可夫模型训练有一个分为已知状态的训练视频。然后，从连续压缩视频的固定长度的滑动窗口中提取一组特定于域的特征，并且使用训练的隐马尔科夫模型集合针对每组特定特征确定一组最大似然。最后，将动态规划应用于每组最大似然度，以确定压缩视频帧的每个固定长度滑动窗口的特定状态。

4.

发明授权
Extraction of high-level features from low-level features of multimedia content 有权
标题翻译：从多媒体内容的低级功能中提取高级功能

公开(公告)号：US06763069B1

公开(公告)日：2004-07-13

申请号：US09610763

申请日：2000-07-06

申请人： Ajay Divakaran , Anthony Vetro , Huifang Sun , Peng Xu , Shih-Fu Chang

发明人： Ajay Divakaran , Anthony Vetro , Huifang Sun , Peng Xu , Shih-Fu Chang

IPC分类号： H04N712

CPC分类号： G06K9/00711

摘要： A method extracts high-level features from a video including a sequence of frames. Low-level features are extracted from each frame of the video. Each frame of the video is labeled according to the extracted low-level features to generate sequences of labels. Each sequence of labels is associated with one of the extracted low-level feature. The sequences of labels are analyzed using learning machine learning techniques to extract high-level features of the video.

摘要翻译： 一种方法从包括帧序列的视频中提取高级特征。从视频的每个帧中提取低级功能。视频的每个帧根据提取的低级特征进行标记，以生成标签序列。标签的每个序列与提取的低级特征之一相关联。使用学习机器学习技术来分析标签序列以提取视频的高级特征。

5.

发明授权
Methods of scene change detection and fade detection for indexing of video sequences 有权
标题翻译：视频序列索引的场景变化检测和淡入淡出检测方法

公开(公告)号：US06449392B1

公开(公告)日：2002-09-10

申请号：US09231698

申请日：1999-01-14

申请人： Ajay Divakaran , Huifang Sun , Hiroshi Ito , Tommy C. Poon

发明人： Ajay Divakaran , Huifang Sun , Hiroshi Ito , Tommy C. Poon

IPC分类号： G06K946

CPC分类号： G06K9/00711 , H04N5/147 , H04N19/142 , H04N19/179 , H04N19/48 , H04N19/87

摘要： This invention relates to methods of abrupt scene change detection and fade detection for indexing of MPEG-2 and MPEG-4 compressed video sequences. Abrupt scene change and fade-detection techniques applied to signals in compressed form have reasonable accuracy and the advantage of high simplicity since they are based on entropy decoding and do not require computationally expensive inverse Discrete Cosine Transformation (DCT).

摘要翻译： 本发明涉及用于索引MPEG-2和MPEG-4压缩视频序列的突发场景变化检测和淡入淡出检测方法。应用于压缩形式的信号的突发场景变化和衰落检测技术具有合理的精度和高简单性的优点，因为它们基于熵解码，并且不需要计算上昂贵的反离散余弦变换（DCT）。

6.

发明授权
Activity descriptor for video sequences 失效

公开(公告)号：US07003038B2

公开(公告)日：2006-02-21

申请号：US10217918

申请日：2002-08-13

申请人： Ajay Divakaran , Huifang Sun , Hae-Kwang Kim , Chul-Soo Park , Xinding Sun , Bangalore S. Manjunath , Vinod V. Vasudevan , Manoranjan D. Jesudoss , Ganesh Rattinassababady , Hyundoo Shin

发明人： Ajay Divakaran , Huifang Sun , Hae-Kwang Kim , Chul-Soo Park , Xinding Sun , Bangalore S. Manjunath , Vinod V. Vasudevan , Manoranjan D. Jesudoss , Ganesh Rattinassababady , Hyundoo Shin

IPC分类号： H04B1/66

CPC分类号： G06K9/00711 , G06F17/30811 , G06F17/30843 , G06F17/30852 , G06T7/20 , G11B27/28 , H04N19/196 , H04N19/463 , H04N19/61

摘要： A method describes activity in a video sequence. The method measures intensity, direction, spatial, and temporal attributes in the video sequence, and the measured attributes are combined in a digital descriptor of the activity of the video sequence.

7.

发明授权
Method for summarizing a video using motion and color descriptors 失效
标题翻译：使用运动和颜色描述符总结视频的方法

公开(公告)号：US06697523B1

公开(公告)日：2004-02-24

申请号：US09634364

申请日：2000-08-09

申请人： Ajay Divakaran , Kadir A. Peker , Huifang Sun

发明人： Ajay Divakaran , Kadir A. Peker , Huifang Sun

IPC分类号： G06K934

CPC分类号： G06K9/00711 , G06F17/30802 , G06F17/30811 , G06F17/30843

摘要： A method extracts an intensity of motion activity from shots in a compressed video. The method then uses the intensity of motion activity to segment the video into easy and difficult segments to summarize. Easy to summarize segments are represented by any frames selected from the easy to summarize segments, while a color based summarization process extracts generates sequences of frames from each difficult to summarize segment. The selected and generated frames of each segment in each shot are combined to form the summary of the compressed video.

摘要翻译： 一种方法从压缩视频中的拍摄中提取运动活动的强度。然后，该方法使用运动活动的强度将视频分成容易和困难的部分来总结。易于汇总的段由易于总结段选择的任何帧表示，而基于颜色的汇总过程提取从每个难以总结的段生成帧序列。每个镜头中每个片段的选定和生成的帧被组合以形成压缩视频的摘要。

8.

发明授权
Video transcoding using syntactic and semantic clues 有权
标题翻译：视频转码使用句法和语义线索

公开(公告)号：US06574279B1

公开(公告)日：2003-06-03

申请号：US09547159

申请日：2000-04-11

申请人： Anthony Vetro , Ajay Divakaran , Huifang Sun

发明人： Anthony Vetro , Ajay Divakaran , Huifang Sun

IPC分类号： H04N718

CPC分类号： H04L29/06027 , G06T9/001 , H04L47/10 , H04L65/605 , H04L65/80 , H04L69/04 , H04N19/124 , H04N19/147 , H04N19/152 , H04N19/172 , H04N19/19 , H04N19/20 , H04N19/25 , H04N19/29 , H04N19/33 , H04N19/40 , H04N19/436 , H04N19/61 , H04N21/23418 , H04N21/2343 , H04N21/234318 , H04N21/23439 , H04N21/2662 , H04N21/84 , H04N21/8549

摘要： A method for transcoding a compressed video partitions the compressed video into hierarchical levels, and extracts features from each of the hierarchical levels. One of a number of conversion modes of a transcoder is selected dependent on the features extracted from the hierarchical levels. The compressed video is then transcoded according to the selected conversion mode.

摘要翻译： 用于对压缩视频进行代码转换的方法将压缩视频分割成分级，并且从每个层级提取特征。代码转换器的多种转换模式之一是根据从分层级提取的特征来选择的。然后根据所选择的转换模式对压缩视频进行转码。

9.

发明授权
Adaptively processing a video based on content characteristics of frames in a video 失效
标题翻译：基于视频中的帧的内容特征自适应地处理视频

公开(公告)号：US07003154B1

公开(公告)日：2006-02-21

申请号：US09715639

申请日：2000-11-17

申请人： Kadir A. Peker , Ajay Divakaran , Huifang Sun

发明人： Kadir A. Peker , Ajay Divakaran , Huifang Sun

IPC分类号： G06K9/34 , H04N5/91 , G11B27/00

CPC分类号： H04N19/587 , H04N19/132 , H04N19/136 , H04N19/17 , H04N19/172 , H04N19/186 , H04N19/61

摘要： A system and method for temporally processing an input video including input frames. Each frame has an associated frame play time, and the input video has a total input video play time that is a sum of the input frame play times of all of the input frames. Each of the input frames is classified according to a content characteristic of each frames. An output frame play time is allocated to each of the input frames that is based on the classified content characteristic of each of the input frames to generate a plurality of output frames that form an output video.

摘要翻译： 一种用于在时间上处理包括输入帧的输入视频的系统和方法。每帧具有关联的帧播放时间，并且输入视频具有作为所有输入帧的输入帧播放时间之和的总输入视频播放时间。每个输入帧根据每帧的内容特征进行分类。输出帧播放时间被分配给基于每个输入帧的分类内容特征的每个输入帧，以产生形成输出视频的多个输出帧。

10.

发明授权
Compressed bit-stream segment identification and descriptor 失效
标题翻译：压缩位流段识别和描述符

公开(公告)号：US06778708B1

公开(公告)日：2004-08-17

申请号：US09345452

申请日：1999-07-01

申请人： Ajay Divakaran , Huifang Sun

发明人： Ajay Divakaran , Huifang Sun

IPC分类号： G06K946

CPC分类号： G06K9/00744 , G06F17/30811

摘要： A compressed bit-stream represents a corresponding sequence having intra-coded frames and inter-coded frames. The compressed bit-stream includes bits associated with each of the inter-coded frames representing a displacement from the associated inter-coded frame to a closest matching of the intra-coded frames. A magnitude of the displacement of a first of the inter-coded frames is determined based on the bits in the compressed bit-stream associated with that inter-coded frame. The inter-coded frame is then identified based on the determined displacement magnitude. The inter-coded frame includes macro-blocks. Each macro-block is associated with a respective portion of the inter-coded frame bits which represent the displacement from that macro-block to the closest matching intra-coded frame. The displacement magnitude is an average of the displacement magnitudes of all the macro-blocks associated with the inter-coded frame. The displacement magnitudes of those macro-blocks which are less than the average displacement magnitude are set to zero. The number of run lengths of the zero magnitude macro-blocks is determined and also used to identify the first inter-coded frame.

摘要翻译： 压缩比特流表示具有帧内编码帧和帧间编码帧的对应序列。压缩比特流包括与帧间编码帧中的每一个相关联的比特，该比特表示从相关联的帧间编码帧到帧内编码帧的最接近的匹配的位移。基于与该帧间编码帧相关联的压缩比特流中的比特来确定帧间编码帧中的第一帧的位移的大小。然后基于确定的位移幅度来识别帧间编码帧。帧间编码帧包括宏块。每个宏块与表示从该宏块到最接近的匹配帧内编码帧的位移的帧间编码帧比特的相应部分相关联。位移幅度是与帧间编码帧相关联的所有宏块的位移幅度的平均值。小于平均位移幅度的这些宏块的位移量设为零。确定零幅度宏块的行程长度的数量，并且还用于识别第一帧间编码帧。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类