Video concept detection using multi-layer multi-instance learning
    1.
    发明授权
    Video concept detection using multi-layer multi-instance learning 有权
    使用多层多实例学习的视频概念检测

    公开(公告)号:US08804005B2

    公开(公告)日:2014-08-12

    申请号:US12111202

    申请日:2008-04-29

    IPC分类号: G06K9/62 G06K9/34

    摘要: Visual concepts contained within a video clip are classified based upon a set of target concepts. The clip is segmented into shots and a multi-layer multi-instance (MLMI) structured metadata representation of each shot is constructed. A set of pre-generated trained models of the target concepts is validated using a set of training shots. An MLMI kernel is recursively generated which models the MLMI structured metadata representation of each shot by comparing prescribed pairs of shots. The MLMI kernel is subsequently utilized to generate a learned objective decision function which learns a classifier for determining if a particular shot (that is not in the set of training shots) contains instances of the target concepts. A regularization framework can also be utilized in conjunction with the MLMI kernel to generate modified learned objective decision functions. The regularization framework introduces explicit constraints which serve to maximize the precision of the classifier.

    摘要翻译: 视频剪辑中包含的视觉概念基于一组目标概念进行分类。 剪辑被分割成镜头,并且构建每个镜头的多层多实例(MLMI)结构化元数据表示。 使用一组训练镜头验证了一组预先生成的目标概念训练模型。 通过比较规定的拍摄对,递归地生成MLMI内核,以对每个镜头的MLMI结构化元数据表示进行建模。 MLMI内核随后被用于生成学习的客观决策函数,该函数学习用于确定特定镜头(不在该组训练镜头中)是否包含目标概念的实例的分类器。 正则化框架也可以与MLMI内核一起使用,以生成修改后的学习目标决策函数。 正则化框架引入明确的约束,用于最大化分类器的精度。

    VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING
    2.
    发明申请
    VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING 有权
    使用多层次多实例学习的视频概念检测

    公开(公告)号:US20090274434A1

    公开(公告)日:2009-11-05

    申请号:US12111202

    申请日:2008-04-29

    IPC分类号: G11B27/00

    摘要: Visual concepts contained within a video clip are classified based upon a set of target concepts. The clip is segmented into shots and a multi-layer multi-instance (MLMI) structured metadata representation of each shot is constructed. A set of pre-generated trained models of the target concepts is validated using a set of training shots. An MLMI kernel is recursively generated which models the MLMI structured metadata representation of each shot by comparing prescribed pairs of shots. The MLMI kernel is subsequently utilized to generate a learned objective decision function which learns a classifier for determining if a particular shot (that is not in the set of training shots) contains instances of the target concepts. A regularization framework can also be utilized in conjunction with the MLMI kernel to generate modified learned objective decision functions. The regularization framework introduces explicit constraints which serve to maximize the precision of the classifier.

    摘要翻译: 视频剪辑中包含的视觉概念基于一组目标概念进行分类。 剪辑被分割成镜头,并且构建每个镜头的多层多实例(MLMI)结构化元数据表示。 使用一组训练镜头验证了一组预先生成的目标概念训练模型。 通过比较规定的拍摄对,递归地生成MLMI内核,以对每个镜头的MLMI结构化元数据表示进行建模。 MLMI内核随后被用于生成学习的客观决策函数,该函数学习用于确定特定镜头(不在该组训练镜头中)是否包含目标概念的实例的分类器。 正则化框架也可以与MLMI内核一起使用,以生成修改后的学习目标决策函数。 正则化框架引入明确的约束,用于最大化分类器的精度。

    Enhancing photo browsing through music and advertising
    3.
    发明授权
    Enhancing photo browsing through music and advertising 有权
    通过音乐和广告加强照片浏览

    公开(公告)号:US08504422B2

    公开(公告)日:2013-08-06

    申请号:US12786020

    申请日:2010-05-24

    IPC分类号: G06Q30/00

    摘要: Techniques for recommending music and advertising to enhance a user's experience while photo browsing are described. In some instances, songs and ads are ranked for relevance to at least one photo from a photo album. The songs, ads and photo(s) from the photo album are then mapped to a style and mood ontology to obtain vector-based representations. The vector-based representations can include real valued terms, each term associated with a human condition defined by the ontology. A re-ranking process generates a relevancy term for each song and each ad indicating relevancy to the photo album. The relevancy terms can be calculated by summing weighted terms from the ranking and the mapping. Recommended music and ads may then be provided to a user, as the user browses a series of photos obtained from the photo album. The ads may be seamlessly embedded into the music in a nonintrusive manner.

    摘要翻译: 描述用于推荐音乐和广告以提高用户在照相浏览时体验的技术。 在某些情况下,歌曲和广告的排名与相册中的至少一张照片相关。 然后将相册中的歌曲,广告和照片映射到风格和心境本体以获得基于矢量的表示。 基于向量的表示可以包括实际值,每个术语与由本体定义的人类条件相关联。 重新排序过程产生每个歌曲的相关术语,每个广告指示相册的相关性。 可以通过从排名和映射求和加权项来计算相关项。 然后,当用户浏览从相册获得的一系列照片时,推荐的音乐和广告可以被提供给用户。 广告可以无缝地嵌入到音乐中。

    Intelligent overlay for video advertising
    4.
    发明授权
    Intelligent overlay for video advertising 有权
    视频广告的智能覆盖

    公开(公告)号:US08369686B2

    公开(公告)日:2013-02-05

    申请号:US12571373

    申请日:2009-09-30

    IPC分类号: H04N9/80

    摘要: Video advertising overlay technique embodiments are presented that generally detect a set of spatio-temporal nonintrusive positions within a series of consecutive video frames in shots of a digital video and then overlay contextually relevant ads on these positions. In one general embodiment, this is accomplished by decomposing the video into a series of shots, and then identifying a video advertisement for each of a selected set of the shots. The identified video advertisement is one that is determined to be the most relevant to the content of the shot. An overlay area is also identified in each of the shots, where the selected overlay area is the least intrusive among a plurality of prescribed areas to a viewer of the video. The video advertisements identified for the shots are then respectively scheduled to be overlaid in the identified overlay area of a shot, whenever the shot is played.

    摘要翻译: 提供了视频广告覆盖技术实施例,其通常在数字视频的拍摄中检测一系列连续视频帧内的一组时空非侵入位置,然后在这些位置上重叠相关的相关广告。 在一个一般实施例中,这通过将视频分解成一系列镜头,然后为所选择的一组拍摄中的每一个识别视频广告来实现。 所识别的视频广告是被确定为与拍摄内容最相关的广告。 在每个拍摄中还识别覆盖区域,其中所选覆盖区域在多个规定区域中对于视频的观看者是最小的侵入。 每当拍摄被拍摄时,为拍摄而识别的视频广告然后分别被调度为覆盖在所识别的拍摄的重叠区域中。

    Enhancing Photo Browsing through Music and Advertising
    5.
    发明申请
    Enhancing Photo Browsing through Music and Advertising 有权
    通过音乐和广告增强照片浏览

    公开(公告)号:US20110288929A1

    公开(公告)日:2011-11-24

    申请号:US12786020

    申请日:2010-05-24

    IPC分类号: G06Q30/00 G06F17/30 G06Q10/00

    摘要: Techniques for recommending music and advertising to enhance a user's experience while photo browsing are described. In some instances, songs and ads are ranked for relevance to at least one photo from a photo album. The songs, ads and photo(s) from the photo album are then mapped to a style and mood ontology to obtain vector-based representations. The vector-based representations can include real valued terms, each term associated with a human condition defined by the ontology. A re-ranking process generates a relevancy term for each song and each ad indicating relevancy to the photo album. The relevancy terms can be calculated by summing weighted terms from the ranking and the mapping. Recommended music and ads may then be provided to a user, as the user browses a series of photos obtained from the photo album. The ads may be seamlessly embedded into the music in a nonintrusive manner.

    摘要翻译: 描述用于推荐音乐和广告以提高用户在照相浏览时体验的技术。 在某些情况下,歌曲和广告的排名与相册中的至少一张照片相关。 然后将相册中的歌曲,广告和照片映射到风格和心境本体以获得基于矢量的表示。 基于向量的表示可以包括实际值,每个术语与由本体定义的人类条件相关联。 重新排序过程产生每个歌曲的相关术语,每个广告指示相册的相关性。 可以通过从排名和映射求和加权项来计算相关项。 然后,当用户浏览从相册获得的一系列照片时,推荐的音乐和广告可以被提供给用户。 广告可以无缝地嵌入到音乐中。

    Video Collage Presentation
    6.
    发明申请
    Video Collage Presentation 审中-公开
    视频拼贴演示

    公开(公告)号:US20090003712A1

    公开(公告)日:2009-01-01

    申请号:US12055267

    申请日:2008-03-25

    IPC分类号: G06K9/62

    摘要: A method, a computer-readable storage media, and a user interface describe techniques for creating a video collage synthesized from video content, selecting representative images from the video content, extracting and resizing regions of interest (ROI) from the representative images from the video content, and arranging the regions of interest on a canvas without seams while preserving a temporal structure of the video content. The described method, computer-readable storage, and user interface enhance the experience of the user in browsing a video collage that is compact.

    摘要翻译: 一种方法,计算机可读存储介质和用户界面描述了用于创建从视频内容合成的视频拼贴的技术,从视频内容中选择代表图像,从视频中提取和调整来自代表图像的感兴趣区域(ROI) 内容,并且在没有接缝的情况下在画布上排列感兴趣的区域,同时保留视频内容的时间结构。 所描述的方法,计算机可读存储和用户界面增强了用户浏览紧凑的视频拼贴的体验。

    SPONSORED MULTI-MEDIA BLOGGING
    7.
    发明申请
    SPONSORED MULTI-MEDIA BLOGGING 审中-公开
    赞助多媒体博客

    公开(公告)号:US20120109754A1

    公开(公告)日:2012-05-03

    申请号:US12939132

    申请日:2010-11-03

    IPC分类号: G06Q30/00 G06F15/16

    CPC分类号: G06Q30/0251 G06Q30/0267

    摘要: The sponsored multi-media blogging technique is an advertising-driven service on a computing device, such as a mobile phone, that makes the multi-media micro-blog or blog an effective carrier for advertising. The data collected while employing the sponsored multi-media blogging technique is used for user intent mining and increasing advertisement relevance for mobile advertising projects. The benefits to the sponsored multi-media blogging technique's users are a natural interface for composing multi-media micro-blogs/blogs and instant experience sharing, while the benefits to advertisers is the promoted brand impression from the contextual advertising in rich media micro-blogs/blogs.

    摘要翻译: 赞助的多媒体博客技术是在诸如手机之类的计算设备上的广告驱动服务,使得多媒体微博或博客成为广告的有效载体。 采用赞助的多媒体博客技术收集的数据用于用户意图挖掘和增加移动广告项目的广告相关性。 赞助的多媒体博客技术用户的好处是构成多媒体微博/博客和即时体验共享的自然界面,而对广告客户的好处是从富媒体微博上下文广告中提升的品牌印象 /博客。

    Incentive Selection of Region-of-Interest and Advertisements for Image Advertising
    8.
    发明申请
    Incentive Selection of Region-of-Interest and Advertisements for Image Advertising 审中-公开
    激励选择图像广告领域和广告

    公开(公告)号:US20120095825A1

    公开(公告)日:2012-04-19

    申请号:US12906899

    申请日:2010-10-18

    IPC分类号: G06Q30/00

    摘要: Techniques for image selection and region of interest analysis are described herein. A pair of two or more users is configured, and an image is displayed to the pair. The image can be a still image (i.e., a picture) or a moving image (i.e., video). In some instances, a plurality of advertisements is suggested for possible association with the image. Input is received from both users in the pair, indicating a positive or a negative association between each advertisement and the image. When the pair positively rates an advertisement, the advertisement is associated with the image. A plurality of regions of interest within the image may be suggested. In response, positive or negative input is received from the pair indicating whether each of the plurality of regions of interest is appropriately suggested for placement of an advertisement.

    摘要翻译: 本文描述了用于图像选择和感兴趣区域分析的技术。 配置一对两个以上的用户,并且一对图像被显示。 图像可以是静止图像(即,图像)或运动图像(即,视频)。 在一些情况下,建议多个广告以与图像可能的关联。 从该对中的两个用户接收到输入,指示每个广告和图像之间的正或负关联。 当该对对广告进行正比率时,该广告与图像相关联。 可以建议图像内的多个感兴趣区域。 作为响应,从该对接收到正或负输入,指示是否适当地建议多个感兴趣区域中的每一个用于放置广告。

    Capture-intention detection for video content analysis
    9.
    发明授权
    Capture-intention detection for video content analysis 有权
    视频内容分析的捕获意图检测

    公开(公告)号:US07773813B2

    公开(公告)日:2010-08-10

    申请号:US11263081

    申请日:2005-10-31

    IPC分类号: G06K9/62

    摘要: Systems and methods are described for detecting capture-intention in order to analyze video content. In one implementation, a system decomposes video structure into sub-shots, extracts intention-oriented features from the sub-shots, delineates intention units via the extracted features, and classifies the intention units into intention categories via the extracted features. A video library can be organized via the categorized intention units.

    摘要翻译: 描述了用于检测捕获意图以分析视频内容的系统和方法。 在一个实现中,系统将视频结构分解为子照片,从子拍摄中提取面向意图的特征,通过提取的特征描绘意图单元,并通过提取的特征将意图单元分类成意图类别。 可以通过分类意图单位组织视频库。

    MULTI-VIDEO SYNTHESIS
    10.
    发明申请
    MULTI-VIDEO SYNTHESIS 有权
    多视频合成

    公开(公告)号:US20100149419A1

    公开(公告)日:2010-06-17

    申请号:US12334231

    申请日:2008-12-12

    IPC分类号: H04N9/74

    CPC分类号: G11B27/036

    摘要: Embodiments that provide multi-video synthesis are disclosed. In accordance with one embodiment, multi-video synthesis includes breaking a main video into a plurality of main frames and break a supplementary video into a plurality of supplementary frames. The multi-video synthesis also includes assigning one or more supplementary frames into each of a plurality of states of a Hidden Markov Model (HMM), where each of the plurality of states corresponding to one or more main frames. The multi-video synthesis further includes determining optimal frames in the plurality of main frames for insertion of the plurality of supplementary frames based on the plurality of states and visual properties. The optimal frames include optimal insertion positions. The multi-video synthesis additionally includes inserting the plurality of supplementary frames into the optimal insertion positions to form a synthesized video.

    摘要翻译: 公开了提供多视频合成的实施例。 根据一个实施例,多视频合成包括将主视频分解成多个主帧并将辅助视频分解成多个补充帧。 多视频合成还包括将一个或多个补充帧分配给隐马尔可夫模型(HMM)的多个状态中的每个状态,其中多个状态中的每一个对应于一个或多个主帧。 多视频合成还包括基于多个状态和视觉属性来确定多个主帧中的最佳帧以插入多个补充帧。 最佳帧包括最佳插入位置。 多视频合成还包括将多个辅助帧插入最佳插入位置以形成合成视频。