Methods for multisource color normalization
    31.
    发明授权
    Methods for multisource color normalization 失效
    多源色彩归一化方法

    公开(公告)号:US07260258B2

    公开(公告)日:2007-08-21

    申请号:US10459470

    申请日:2003-06-12

    Abstract: A method for improvement of the consistency of color and brightness across boundaries of multicamera and/or multidisplayed overlapping or near overlapping composite images, reduction of objectionable artifacts at overlapping image seams of composite images, adjustment of color and brightness/intensity on either side of overlapping image seams, application of color correction across a composite image, reduction of color and intensity mismatches of composite images, gradual change of color across image seams of a composite image, including altering color outside of overlap regions is provided so that the seams are less discernible and to avoid sharp color changes across a composite or mosaic image are reduced or avoided. Color difference between two source images which form a composite is estimated by looking at color pixels from each source image region, determining a centroid of a cluster in a color space formed by the pixels of each source image, and determining the difference between cluster centroids for overlapping or nearly overlapping source image regions which are a measure of the vector color difference between such regions. The vector color difference between at least two overlapping source image regions are interpolated or mapped across those images.

    Abstract translation: 一种用于改善多面体和/或多重重叠或近重叠合成图像的边界上的颜色和亮度的一致性的方法,减少复合图像的重叠图像接缝处的令人反感的伪像,在重叠的任一侧上调整颜色和亮度/强度 图像接缝,跨合成图像的颜色校正的应用,复合图像的颜色和强度不匹配的减少,复合图像的图像接缝的颜色逐渐变化,包括改变重叠区域外的颜色,使得接缝较少可辨别 并避免在复合或马赛克图像之间的尖锐颜色变化被减少或避免。 通过观察来自每个源图像区域的颜色像素,确定由每个源图像的像素形成的颜色空间中的聚类的质心,并且确定聚类质心之间的差异来估计形成复合的两个源图像之间的色差 重叠或几乎重叠的源图像区域,其是这些区域之间的矢量色差的度量。 至少两个重叠源图像区域之间的矢量色差被插值或映射到这些图像上。

    Method, system and article of manufacture for linking a video to a scanned document
    32.
    发明授权
    Method, system and article of manufacture for linking a video to a scanned document 有权
    用于将视频链接到扫描文档的方法,系统和制品

    公开(公告)号:US07051271B1

    公开(公告)日:2006-05-23

    申请号:US09584205

    申请日:2000-05-31

    Abstract: Video recordings of meetings and scanned paper documents are natural digital documents that come out of a meeting. These can be placed on the Internet for easy access, with links generated between them by matching scanned documents to a segment of the video referencing the scanned document. Furthermore, annotations made on the paper documents during the meeting can be extracted and used as indexes to the video. An orthonormal transform, such as a Digital Cosine Transform (DCT) is used to compare scanned documents to video frames.

    Abstract translation: 会议和扫描纸文件的视频录制是出席会议的自然数字文件。 这些可以放在互联网上以便于访问,通过将扫描的文档与参考扫描文档的视频片段相匹配,在它们之间生成链接。 此外,可以提取会议期间在纸质文件上的注释,并将其用作视频的索引。 使用诸如数字余弦变换(DCT)的正交变换将扫描的文档与视频帧进行比较。

    Media browser using multimodal analysis
    33.
    发明授权
    Media browser using multimodal analysis 有权
    媒体浏览器采用多模态分析

    公开(公告)号:US06366296B1

    公开(公告)日:2002-04-02

    申请号:US09151285

    申请日:1998-09-11

    Abstract: A media browser, graphical user interface and method for browsing a media file wherein a user selects at least one feature in a media file and is provided with information regarding the existence of the selected feature in the media file. Based on the information, the user can identify and playback portions of interest in a media file. Features in a media file, such as a speaker's identity, applause, silence, motion, or video cuts, are preferably automatically time-wise evaluated in the media file using known methods. Metadata generated based on the time-wise feature evaluation are preferably mapped to confidence score values that represent a probability of a corresponding feature's existence in the media file. Confidence score information is preferably presented graphically to a user as part of a graphical user interface, and is used to interactively browse the media file.

    Abstract translation: 用于浏览媒体文件的媒体浏览器,图形用户界面和方法,其中用户选择媒体文件中的至少一个特征并且提供有关媒体文件中所选特征的存在的信息。 基于该信息,用户可以识别并播放媒体文件中感兴趣的部分。 媒体文件中的特征,例如扬声器的身份,掌声,静音,运动或视频剪辑,优选地使用已知方法在媒体文件中自动地逐时评估。 基于时间特征评估生成的元数据优选地映射到表示媒体文件中对应特征存在概率的置信度分数值。 信心分数信息优选地以图形方式呈现给用户,作为图形用户界面的一部分,并且用于交互地浏览媒体文件。

    System and method for detecting and ranking images in order of usefulness based on vignette score
    34.
    发明授权
    System and method for detecting and ranking images in order of usefulness based on vignette score 有权
    用于基于小插曲得分的有用性检测和排序图像的系统和方法

    公开(公告)号:US07492921B2

    公开(公告)日:2009-02-17

    申请号:US11032576

    申请日:2005-01-10

    CPC classification number: G06F17/30247

    Abstract: A system and method for detecting useful images and for ranking images in order of usefulness based on a vignette score describing how closely each one resembles a “vignette,” or a central object or image surrounded by a featureless or deemphasized background. Several methods for determining an image's vignette score are disclosed as examples. Variance ratio analysis entails calculation of the ratio of variance between the edge region of the image and the entire image. Statistical model analysis entails developing a statistical classifier capable of determining a statistical model of each image class based on pre-entered training data. Spatial frequency analysis involves estimating the energy at different spatial frequencies in the central and edge regions and in the image as a whole. A vignette score is calculated as the ratio of mid-frequency energies in the edge region to the mid-frequency energies of the entire image.

    Abstract translation: 一种用于检测有用图像并根据用于评估图像的顺序对图像进行排序的系统和方法,所述小插曲得分描述了每个图像类似于“小插曲”的密切程度,或由无特征或不加重背景包围的中心对象或图像。 作为示例公开了用于确定图像晕影得分的几种方法。 方差比分析需要计算图像的边缘区域与整个图像之间的方差比。 统计模型分析需要开发能够基于预先输入的训练数据来确定每个图像类别的统计模型的统计分类器。 空间频率分析涉及估计中央和边缘区域以及整个图像中不同空间频率的能量。 晕影得分被计算为边缘区域中的中频能量与整个图像的中频能量的比率。

    Method, apparatus, and system for remotely annotating a target
    35.
    发明授权
    Method, apparatus, and system for remotely annotating a target 有权
    用于远程注释目标的方法,设备和系统

    公开(公告)号:US07333135B2

    公开(公告)日:2008-02-19

    申请号:US10271133

    申请日:2002-10-15

    CPC classification number: H04N7/18

    Abstract: A system, method and apparatus for remotely annotating an object. An embodiment of the present invention includes a video camera projector that captures video images of a local object and projects annotations made by a user at a remote location onto said local object.

    Abstract translation: 一种用于远程注释对象的系统,方法和装置。 本发明的一个实施例包括摄像机投影仪,其捕获本地对象的视频图像,并将远程位置处的用户作出的注释投影到所述本地对象上。

    Summarization of digital files
    36.
    发明授权
    Summarization of digital files 有权
    数字文件汇总

    公开(公告)号:US07284004B2

    公开(公告)日:2007-10-16

    申请号:US10271407

    申请日:2002-10-15

    Abstract: Embodiments of the present invention provide a method for producing a summary of a digital file on one or more computers. The method includes segmenting the digital file into a plurality of segments, clustering said segments into a plurality of clusters and selecting a cluster from said plurality of clusters wherein said selected cluster includes segments representative of said digital file. Upon selection of a cluster a segment of the cluster is provided as a summary of said digital file.

    Abstract translation: 本发明的实施例提供一种用于在一个或多个计算机上产生数字文件概要的方法。 该方法包括将数字文件分割成多个段,将所述段聚类成多个群集,并从所述多个群集中选择群集,其中所述选定的群集包括表示所述数字文件的段。 在选择集群时,提供集群的一部分作为所述数字文件的概要。

    Method for automatically producing optimal summaries of linear media

    公开(公告)号:US07068723B2

    公开(公告)日:2006-06-27

    申请号:US10086817

    申请日:2002-02-28

    Abstract: Optimal summaries of a linear media source are automatically produced by parameterizing a linear media source. The parameterized linear media source is used to create a similarity array in which each array element includes the value of a similarity measurement between a two portions of the parameterized media signal. A segment fitness function, adapted for measuring the similarity between a segment of the parameterized media signal and the entire parameterized media signal, is optimized to find an optimal segment location. The portion of the linear media source corresponding to the optimal segment location is selected as the optimal summary. This method produces optimal summaries of any type of linear media, such as video, audio, or text information.

Patent Agency Ranking