Determining regions of interest in synthetic images
    32.
    发明申请
    Determining regions of interest in synthetic images 有权
    确定合成图像中的感兴趣区域

    公开(公告)号:US20060062456A1

    公开(公告)日:2006-03-23

    申请号:US10948823

    申请日:2004-09-23

    IPC分类号: G06K9/34 G06K9/46

    CPC分类号: G06K9/3233

    摘要: An algorithm for finding regions of interest (ROI) in synthetic images based on an information driven approach in which sub-blocks of a set of synthetic image are analyzed for information content or compressibility based on textural and color features. A DCT may be used to analyze the textural features of a set of images and a color histogram may be used to analyze the color features of the set of images. Sub-blocks of low compressibility are grouped into ROIs using a type of morphological technique. Unlike other algorithms that are geared for highly specific types of ROI (e.g. OCR text detection), the method of the present invention is generally applicable to arbitrary synthetic images. The present invention can be used with several other image applications, including Stained-Glass collages presentations.

    摘要翻译: 一种基于信息驱动方法在合成图像中发现感兴趣区域(ROI)的算法,其中基于纹理和颜色特征分析一组合成图像的子块的信息内容或可压缩性。 可以使用DCT来分析一组图像的纹理特征,并且可以使用颜色直方图来分析该组图像的颜色特征。 使用一种形态学技术将低压缩性的子块分组成ROI。 与专门针对高度特定类型的ROI(例如OCR文本检测)的其他算法不同,本发明的方法通常适用于任意合成图像。 本发明可以与数种其它图像应用一起使用,包括彩色玻璃拼贴画展示。

    Media browser using multimodal analysis
    33.
    发明授权
    Media browser using multimodal analysis 有权
    媒体浏览器采用多模态分析

    公开(公告)号:US06366296B1

    公开(公告)日:2002-04-02

    申请号:US09151285

    申请日:1998-09-11

    IPC分类号: G06F300

    摘要: A media browser, graphical user interface and method for browsing a media file wherein a user selects at least one feature in a media file and is provided with information regarding the existence of the selected feature in the media file. Based on the information, the user can identify and playback portions of interest in a media file. Features in a media file, such as a speaker's identity, applause, silence, motion, or video cuts, are preferably automatically time-wise evaluated in the media file using known methods. Metadata generated based on the time-wise feature evaluation are preferably mapped to confidence score values that represent a probability of a corresponding feature's existence in the media file. Confidence score information is preferably presented graphically to a user as part of a graphical user interface, and is used to interactively browse the media file.

    摘要翻译: 用于浏览媒体文件的媒体浏览器,图形用户界面和方法,其中用户选择媒体文件中的至少一个特征并且提供有关媒体文件中所选特征的存在的信息。 基于该信息,用户可以识别并播放媒体文件中感兴趣的部分。 媒体文件中的特征,例如扬声器的身份,掌声,静音,运动或视频剪辑,优选地使用已知方法在媒体文件中自动地逐时评估。 基于时间特征评估生成的元数据优选地映射到表示媒体文件中对应特征存在概率的置信度分数值。 信心分数信息优选地以图形方式呈现给用户,作为图形用户界面的一部分,并且用于交互地浏览媒体文件。

    Systems and methods for organizing files in a graph-based layout
    34.
    发明授权
    Systems and methods for organizing files in a graph-based layout 有权
    用于在基于图形的布局中组织文件的系统和方法

    公开(公告)号:US08832119B2

    公开(公告)日:2014-09-09

    申请号:US12137718

    申请日:2008-06-12

    CPC分类号: G06F17/30058

    摘要: An adaptive, interactive visual workspace for viewing groups of files based on their relationships. Relationships of files are visualized using iterative refinement of categories through a direct-manipulation graph-based layout. The visual workspace starts with a fully connected graph linking thumbnail images of related files that is then partitioned into neighborhoods in response to a user creating file stacks corresponding to different categories. Normalized spring lengths improve the overall quality of the layout. Different modes for membership in neighborhoods avoid confusing motion of files and help a user to manually organize the workspace. Additionally, retrieved files can be added without having to significantly move the previous files. Different visualization techniques indicate which files are related to each other. Different zoom rates are used for file location, and surrogate sizes allow users to increase the separation between files while still increasing the surrogate sizes.

    摘要翻译: 一个自适应的交互式视觉工作区,用于根据他们的关系来查看文件组。 文件的关系通过直接操作图形布局的类别迭代细化进行可视化。 可视化工作区以完全连接的图形开始,将图形相关联的文件链接起来,然后将其分成邻域,以响应用户创建对应于不同类别的文件堆栈。 标准化的弹簧长度提高了布局的整体质量。 用于居民区成员身份的不同模式可避免混淆文件运动,并帮助用户手动组织工作区。 此外,可以添加检索到的文件,而不必显着移动以前的文件。 不同的可视化技术指出哪些文件是相互关联的。 文件位置使用不同的缩放比例,替代尺寸允许用户增加文件之间的间隔,同时仍然增加代理大小。

    Annealing algorithm for non-rectangular shaped stained glass collages
    35.
    发明授权
    Annealing algorithm for non-rectangular shaped stained glass collages 有权
    非矩形彩色玻璃拼贴的退火算法

    公开(公告)号:US08144919B2

    公开(公告)日:2012-03-27

    申请号:US11525589

    申请日:2006-09-22

    IPC分类号: G06K9/00

    CPC分类号: H04N1/387

    摘要: The present invention relates to a method to make effective use of non rectangular display space for displaying a collage. In an embodiment of the invention, a heterogeneous set of images can be arranged to display the region of interest of the images to avoid overlapping regions of interest. The background gaps between the regions of interest can be filled by extending the regions of interest using a Voronoi technique. This produces a stained glass effect for the collage. In an embodiment of the present invention, the technique can be applied to irregular shapes including circular shapes with a hole in the middle. In an embodiment of the present invention, the technique can be used to print labels for disks.

    摘要翻译: 本发明涉及一种有效利用非矩形显示空间来显示拼贴画的方法。 在本发明的实施例中,可以布置异构的图像集合以显示图像的感兴趣区域以避免重叠的感兴趣区域。 可以使用Voronoi技术扩展感兴趣的区域来填补感兴趣区域之间的背景差距。 这产生了拼贴画的彩色玻璃效果。 在本发明的一个实施例中,该技术可以应用于包括在中间具有孔的圆形的不规则形状。 在本发明的一个实施例中,该技术可用于打印盘的标签。

    Annealing algorithm for non-rectangular shaped stained glass collages
    36.
    发明申请
    Annealing algorithm for non-rectangular shaped stained glass collages 有权
    非矩形彩色玻璃拼贴的退火算法

    公开(公告)号:US20080075390A1

    公开(公告)日:2008-03-27

    申请号:US11525589

    申请日:2006-09-22

    IPC分类号: G06K9/36 G06K9/32

    CPC分类号: H04N1/387

    摘要: The present invention relates to a method to make effective use of non rectangular display space for displaying a collage. In an embodiment of the invention, a heterogeneous set of images can be arranged to display the region of interest of the images to avoid overlapping regions of interest. The background gaps between the regions of interest can be filled by extending the regions of interest using a Voronoi technique. This produces a stained glass effect for the collage. In an embodiment of the present invention, the technique can be applied to irregular shapes including circular shapes with a hole in the middle. In an embodiment of the present invention, the technique can be used to print labels for disks.

    摘要翻译: 本发明涉及一种有效利用非矩形显示空间来显示拼贴画的方法。 在本发明的实施例中,可以布置异构的图像集合以显示图像的感兴趣区域以避免重叠的感兴趣区域。 可以使用Voronoi技术扩展感兴趣的区域来填补感兴趣区域之间的背景差距。 这产生了拼贴画的彩色玻璃效果。 在本发明的一个实施例中,该技术可以应用于包括在中间具有孔的圆形的不规则形状。 在本发明的一个实施例中,该技术可用于打印盘的标签。

    Extracting video regions of interest
    38.
    发明申请
    Extracting video regions of interest 有权
    提取感兴趣的视频区域

    公开(公告)号:US20050220348A1

    公开(公告)日:2005-10-06

    申请号:US10815389

    申请日:2004-03-31

    摘要: In one embodiment, the present invention extracts video regions of interest from one or more videos and generates a highly condensed visual summary of the videos. The video regions of interest are extracted based on to energy, movement, face or other object detection methods, associated data or external input, or some other feature of the video. In another embodiment, the present invention extracts regions of interest from images and generates highly condensed visual summaries of the images. The highly condensed visual summary is generated by laying out germs on a canvas and then filling the spaces between the germs. The result is a visual summary that resembles a stained glass window having cells of varying shape. The germs may be laid out by temporal order, color histogram, similarity, according to a desired pattern, size, or some other manner. The people, objects and other visual content in the germs appear larger and become easier to see. The visual summary of the present invention utilizes important regions within the key frames, leading to more condensed summaries that are well suitable for small screens.

    摘要翻译: 在一个实施例中,本发明从一个或多个视频中提取感兴趣的视频区域,并且生成高分辨率的视频概要。 基于能量,移动,面部或其他物体检测方法,相关联的数据或外部输入或视频的某些其他特征来提取感兴趣的视频区域。 在另一个实施例中,本发明从图像中提取感兴趣的区域并且产生图像的高度浓缩的视觉摘要。 高度浓缩的视觉总结是通过在画布上布置细菌,然后填充细菌之间的空间来产生的。 结果是类似于具有不同形状的细胞的彩色玻璃窗的视觉总结。 根据期望的图案,大小或某种其他方式,细菌可以通过时间顺序,颜色直方图,相似性布置。 细菌中的人,物和其他视觉内容看起来更大,变得更容易看到。 本发明的视觉总结利用关键帧内的重要区域,导致更加精简的摘要,其非常适合于小屏幕。

    Generating a highly condensed visual summary
    39.
    发明申请
    Generating a highly condensed visual summary 有权
    生成高度浓缩的视觉总结

    公开(公告)号:US20050220345A1

    公开(公告)日:2005-10-06

    申请号:US10815354

    申请日:2004-03-31

    IPC分类号: G06T7/00 G06K9/00 G06K9/66

    摘要: In one embodiment, the present invention extracts video regions of interest from one or more videos and generates a highly condensed visual summary of the videos. The video regions of interest are extracted based on to energy, movement, face or other object detection methods, associated data or external input, or some other feature of the video. In another embodiment, the present invention extracts regions of interest from images and generates highly condensed visual summaries of the images. The highly condensed visual summary is generated by laying out germs on a canvas and then filling the spaces between the germs. The result is a visual summary that resembles a stained glass window having cells of varying shape. The germs may be laid out by temporal order, color histogram, similarity, according to a desired pattern, size, or some other manner. The people, objects and other visual content in the germs appear larger and become easier to see. The visual summary of the present invention utilizes important regions within the key frames, leading to more condensed summaries that are well suitable for small screens.

    摘要翻译: 在一个实施例中,本发明从一个或多个视频中提取感兴趣的视频区域,并且生成高分辨率的视频概要。 基于能量,移动,面部或其他物体检测方法,相关联的数据或外部输入或视频的某些其他特征来提取感兴趣的视频区域。 在另一个实施例中,本发明从图像中提取感兴趣的区域并且产生图像的高度浓缩的视觉摘要。 高度浓缩的视觉总结是通过在画布上布置细菌,然后填充细菌之间的空间来产生的。 结果是类似于具有不同形状的细胞的彩色玻璃窗的视觉总结。 根据期望的图案,大小或某种其他方式,细菌可以通过时间顺序,颜色直方图,相似性布置。 细菌中的人,物和其他视觉内容看起来更大,变得更容易看到。 本发明的视觉总结利用关键帧内的重要区域,导致更加精简的摘要,其非常适合于小屏幕。