专利检索 ap:("Lie Lu" OR "Yan-Feng Sun" OR "Mingjing Li" OR "Xian-Sheng Hua" OR "Hong-Jiang Zhang") AND inv:"Xian-Sheng Hua" 第 1 页

1.

发明授权
Automatic detection and segmentation of music videos in an audio/video stream 有权
标题翻译：在音频/视频流中自动检测和分割音乐视频

公开(公告)号：US07336890B2

公开(公告)日：2008-02-26

申请号：US10370314

申请日：2003-02-19

申请人： Lie Lu , Yan-Feng Sun , Mingjing Li , Xian-Sheng Hua , Hong-Jiang Zhang

发明人： Lie Lu , Yan-Feng Sun , Mingjing Li , Xian-Sheng Hua , Hong-Jiang Zhang

IPC分类号： H04N7/00 , H04N5/91 , H04N5/93 , G06K9/46 , G10L11/00 , G06F15/173

CPC分类号： H04H60/37 , G06F17/30787 , G06F17/30796 , G06K9/00711 , G11B27/034 , G11B27/28 , G11B2220/218 , G11B2220/2545 , G11B2220/2562 , H04H60/58 , H04H60/59 , H04H60/74 , H04N5/147 , H04N5/4401 , H04N5/602 , H04N21/4332 , H04N21/4394 , H04N21/44008 , H04N21/44204 , H04N21/8456

摘要： A “music video parser” automatically detects and segments music videos in a combined audio-video media stream. Automatic detection and segmentation is achieved by integrating shot boundary detection, video text detection and audio analysis to automatically detect temporal boundaries of each music video in the media stream. In one embodiment, song identification information, such as, for example, a song name, artist name, album name, etc., is automatically extracted from the media stream using video optical character recognition (OCR). This information is then used in alternate embodiments for cataloging, indexing and selecting particular music videos, and in maintaining statistics such as the times particular music videos were played, and the number of times each music video was played.

摘要翻译： “音乐视频解析器”自动检测并分割组合的音频 - 视频媒体流中的音乐视频。通过集成镜头边界检测，视频文本检测和音频分析来自动检测和分割，可以自动检测媒体流中每个音乐视频的时间边界。在一个实施例中，使用视频光学字符识别（OCR）从媒体流中自动提取歌曲识别信息，例如歌曲名称，歌手姓名，专辑名称等。然后，将该信息用于编目，索引和选择特定音乐视频以及维持诸如特定音乐视频播放的时间的统计数据以及播放每个音乐视频的次数的备选实施例。

2.

发明授权
Systems and methods for automatically editing a video 有权
标题翻译：自动编辑视频的系统和方法

公开(公告)号：US07127120B2

公开(公告)日：2006-10-24

申请号：US10286348

申请日：2002-11-01

申请人： Xian-Sheng Hua , Lie Lu , Yu-Fei Ma , Mingjing Li , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Lie Lu , Yu-Fei Ma , Mingjing Li , Hong-Jiang Zhang

IPC分类号： G06K9/40

CPC分类号： G11B27/034 , G11B27/28

摘要： Systems and methods to automatically edit a video to generate a video summary are described. In one aspect, sub-shots are extracted from the video. Importance measures are calculated for at least a portion of the extracted sub-shots. Respective relative distributions for sub-shots having relatively higher importance measures as compared to importance measures of other sub-shots are determined. Based on the determined relative distributions, sub-shots that do not exhibit a uniform distribution with respect to other sub-shots in the particular ones are dropped. The remaining sub-shots are connected with respective transitions to generate the video summary.

摘要翻译： 描述了自动编辑视频以生成视频摘要的系统和方法。在一方面，从视频中提取子拍摄。对于提取的子拍摄的至少一部分计算重要性度量。确定与其他子拍摄重要度量相比具有相对较高重要度量的子投影的相对相对分布。基于所确定的相对分布，相对于特定的其他子投影，不显示均匀分布的子投影。剩余的子拍摄与各自的转换相连接以产生视频摘要。

3.

发明授权
Learning-based automatic commercial content detection 失效
标题翻译：基于学习的自动商业内容检测

公开(公告)号：US07164798B2

公开(公告)日：2007-01-16

申请号：US10368235

申请日：2003-02-18

申请人： Xian-Sheng Hua , Lie Lu , Mingjing Li , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Lie Lu , Mingjing Li , Hong-Jiang Zhang

IPC分类号： G06K9/72 , H04H9/00

CPC分类号： G06K9/00711

摘要： Systems and methods for learning-based automatic commercial content detection are described. In one aspect, program data is divided into multiple segments. The segments are analyzed to determine visual, audio, and context-based feature sets that differentiate commercial content from non-commercial content. The context-based features are a function of single-side left and/or right neighborhoods of segments of the multiple segments.

摘要翻译： 描述了基于学习的自动商业内容检测的系统和方法。在一个方面，程序数据被分成多个段。分析段以确定将商业内容与非商业内容区分开的视觉，音频和基于上下文的特征集。基于上下文的特征是多个段的单侧左和/或右邻域的函数。

4.

发明授权
Learning-based automatic commercial content detection 失效
标题翻译：基于学习的自动商业内容检测

公开(公告)号：US07565016B2

公开(公告)日：2009-07-21

申请号：US11623304

申请日：2007-01-15

申请人： Xian-Sheng Hua , Lie Lu , Mingjing Li , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Lie Lu , Mingjing Li , Hong-Jiang Zhang

IPC分类号： G06K9/72 , H04H9/00

CPC分类号： G06K9/00711

摘要： Systems and methods for learning-based automatic commercial content detection are described. In one aspect, the systems and methods include a training component and an analyzing component. The training component trains a commercial content classification model using a kernel support vector machine. The analyzing component analyzes program data such as video and audio data using the commercial content classification model and one or more of single-side left neighborhood(s) and right neighborhood(s) of program data segments. Based on this analysis, each of the program data segments are classified as being commercial or non-commercial segments.

摘要翻译： 描述了基于学习的自动商业内容检测的系统和方法。一方面，系统和方法包括训练组件和分析组件。训练组件使用内核支持向量机训练商业内容分类模型。分析组件使用商业内容分类模型和程序数据段的单侧左邻居和右邻居中的一个或多个来分析诸如视频和音频数据的程序数据。基于此分析，每个程序数据段被分类为商业或非商业领域。

5.

发明申请
Learning-Based Automatic Commercial Content Detection 失效
标题翻译：基于学习的自动商业内容检测

公开(公告)号：US20070112583A1

公开(公告)日：2007-05-17

申请号：US11623304

申请日：2007-01-15

申请人： Xian-Sheng Hua , Lie Lu , Mingjing Li , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Lie Lu , Mingjing Li , Hong-Jiang Zhang

IPC分类号： G06Q10/00

CPC分类号： G06K9/00711

摘要： Systems and methods for learning-based automatic commercial content detection are described. In one aspect, the systems and methods include a training component and an analyzing component. The training component trains a commercial content classification model using a kernel support vector machine. The analyzing component analyzes program data such as video and audio data using the commercial content classification model and one or more of single-side left neighborhood(s) and right neighborhood(s) of program data segments. Based on this analysis, each of the program data segments are classified as being commercial or non-commercial segments.

摘要翻译： 描述了基于学习的自动商业内容检测的系统和方法。一方面，系统和方法包括训练组件和分析组件。训练组件使用内核支持向量机训练商业内容分类模型。分析组件使用商业内容分类模型和程序数据段的单侧左邻居和右邻居中的一个或多个来分析诸如视频和音频数据的程序数据。基于此分析，每个程序数据段被分类为商业或非商业领域。

6.

发明申请
Systems and methods for personalized karaoke 审中-公开
标题翻译：个性化卡拉OK的系统和方法

公开(公告)号：US20050123886A1

公开(公告)日：2005-06-09

申请号：US10723049

申请日：2003-11-26

申请人： Xian-Sheng Hua , Lie Lu , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Lie Lu , Hong-Jiang Zhang

IPC分类号： G10H1/36 , G10H7/00

CPC分类号： G10H1/368 , G10H1/361 , G10H2220/011

摘要： Systems and methods are described that implement personalized karaoke, wherein a user's personal home video and photographs are used to form a background for the lyrics during a karaoke performance. An exemplary karaoke apparatus is configured to segment visual content to produce a plurality of sub-shots and to segment music to produce a plurality of music sub-clips. Having produced the visual content sub-shots and music sub-clips, the exemplary karaoke apparatus shortens some of the plurality of sub-shots to a length of a corresponding music sub-clip from within the plurality of music sub-clips. The plurality of sub-shots is then displayed as a background to lyrics associated with the music, thereby adding interest to a karaoke performance.

摘要翻译： 描述了实现个性化卡拉OK的系统和方法，其中用户的个人家庭视频和照片用于在卡拉OK演奏期间形成歌词的背景。示例性卡拉OK装置被配置为分割可视内容以产生多个子拍摄并分割音乐以产生多个音乐子剪辑。在产生视觉内容子拍摄和音乐子剪辑之后，示例性卡拉OK装置将多个子拍摄中的一些从多个音乐子剪辑中缩短到相应音乐子剪辑的长度。然后将多个子拍摄显示为与音乐相关联的歌词的背景，从而增加对卡拉OK演奏的兴趣。

7.

发明授权
Content-based dynamic photo-to-video methods and apparatuses 有权
标题翻译：基于内容的动态照片到视频方法和设备

公开(公告)号：US07904815B2

公开(公告)日：2011-03-08

申请号：US10610105

申请日：2003-06-30

申请人： Xian-Sheng Hua , Lie Lu , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Lie Lu , Hong-Jiang Zhang

IPC分类号： G06K9/00 , H04N5/783

CPC分类号： H04N1/32101 , H04N2201/3247 , H04N2201/3264

摘要： Methods and apparatuses are provided for automatically generating video data based on still image data. Certain aspects of the video may also be configured to correspond to audio features identified within associated audio data.

摘要翻译： 提供了用于基于静止图像数据自动生成视频数据的方法和装置。视频的某些方面也可以被配置为对应于在相关联的音频数据内标识的音频特征。

8.

发明授权
Tag association with image regions 有权
标题翻译：标签与图像区域的关联

公开(公告)号：US09047319B2

公开(公告)日：2015-06-02

申请号：US12972203

申请日：2010-12-17

申请人： Xian-Sheng Hua , Kuiyuan Yang , Meng Wang , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Kuiyuan Yang , Meng Wang , Hong-Jiang Zhang

IPC分类号： G06F17/30 , G06K9/00

CPC分类号： G06F17/30256 , G06F17/30268

摘要： A computing device configured to determine that one or more regions of an image are associated with a tag of the image is described herein. The computing device is further configured to determine one or more attribute tags describing at least one of the content or context of the one or more regions. Upon determining the attribute tags, the computing device associates the attribute tags with the tag to enable image searching based on the tag and attribute tags.

摘要翻译： 在此描述了被配置为确定图像的一个或多个区域与图像的标签相关联的计算设备。计算设备还被配置为确定描述一个或多个区域的内容或上下文中的至少一个的一个或多个属性标签。在确定属性标签之后，计算设备将属性标签与标签相关联，以便基于标签和属性标签启用图像搜索。

9.

发明申请
Robust Large-Scale Visual Codebook Construction 有权
标题翻译：坚固的大型视觉代码簿构建

公开(公告)号：US20120251007A1

公开(公告)日：2012-10-04

申请号：US13077735

申请日：2011-03-31

申请人： Linjun Yang , Darui Li , Xian-Sheng Hua , Hong-Jiang Zhang

发明人： Linjun Yang , Darui Li , Xian-Sheng Hua , Hong-Jiang Zhang

IPC分类号： G06K9/46

CPC分类号： G06K9/6223

摘要： Techniques for construction of a visual codebook are described herein. Feature points may be extracted from large numbers of images. In one example, images providing N feature points may be used to construct a codebook of K words. The centers of each of K clusters of feature points may be initialized. In a looping or iterative manner, an assignment step assigns each feature point to a cluster and an update step locates a center of each cluster. The feature points may be assigned to a cluster based on a lesser of a distance to a center of a previously assigned cluster and a distance to a center derived by operation of an approximate nearest neighbor algorithm having aspects of randomization. The loop terminates when the feature points have sufficiently converged to their respective clusters. Centers of the clusters represent visual words, which may be used to construct the visual codebook.

摘要翻译： 本文描述了构建视觉码本的技术。特征点可以从大量图像中提取出来。在一个示例中，提供N个特征点的图像可以用于构造K个字的码本。可以初始化K个特征点中的每一个的中心。以循环或迭代的方式，分配步骤将每个特征点分配给集群，并且更新步骤定位每个集群的中心。可以基于距先前分配的簇的中心的距离中较小的一个特征点来分配特征点，以及通过具有随机化方面的近似最近邻算法的操作导出的到中心的距离。当特征点已经充分收敛到它们各自的簇时，环路终止。集群的中心表示视觉词，可用于构建视觉码本。

10.

发明申请
Tag Association with Image Regions 有权
标题翻译：标签与图像区域的关联

公开(公告)号：US20120158721A1

公开(公告)日：2012-06-21

申请号：US12972203

申请日：2010-12-17

申请人： Xian-Sheng Hua , Kuiyuan Yang , Meng Wang , Hong-Jiang Zhang

发明人： Xian-Sheng Hua , Kuiyuan Yang , Meng Wang , Hong-Jiang Zhang

IPC分类号： G06F17/30

CPC分类号： G06F17/30256 , G06F17/30268

摘要： A computing device configured to determine that one or more regions of an image are associated with a tag of the image is described herein. The computing device is further configured to determine one or more attribute tags describing at least one of the content or context of the one or more regions. Upon determining the attribute tags, the computing device associates the attribute tags with the tag to enable image searching based on the tag and attribute tags.

摘要翻译： 在此描述了被配置为确定图像的一个或多个区域与图像的标签相关联的计算设备。计算设备还被配置为确定描述一个或多个区域的内容或上下文中的至少一个的一个或多个属性标签。在确定属性标签之后，计算设备将属性标签与标签相关联，以便基于标签和属性标签启用图像搜索。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类