Automated synchronization of video image sequences to new soundtracks
    1.
    发明授权
    Automated synchronization of video image sequences to new soundtracks 失效
    视频图像序列自动同步到新的配乐

    公开(公告)号:US5880788A

    公开(公告)日:1999-03-09

    申请号:US620949

    申请日:1996-03-25

    申请人: Christoph Bregler

    发明人: Christoph Bregler

    摘要: The synchronization of an existing video to a new soundtrack is carried out through the phonetic analysis of the original soundtrack and the new soundtrack. Individual speech sounds, such as phones, are identified in the soundtrack for the original video recording, and the images corresponding thereto are stored. The new soundtrack is similarly analyzed to identify individual speech sounds, which are used to select the stored images and create a new video sequence. The sequence of images are then smoothly fitted to one another, to provide a video stream that is synchronized to the new soundtrack. This approach permits a given video sequence to be synchronized to any arbitrary utterance. Furthermore, the matching of the video images to the new speech sounds can be carried out in a highly automated manner, thereby reducing required manual effort.

    摘要翻译: 通过原始音轨和新配乐的语音分析,实现现有视频与新音轨的同步。 在原始视频记录的声带中识别出诸如电话的个别语音,并且存储与之对应的图像。 类似地分析新的配乐以识别用于选择所存储的图像并创建新的视频序列的单个语音。 然后,图像的顺序彼此平滑地拟合,以提供与新音轨同步的视频流。 这种方法允许给定的视频序列与任何任意的话语同步。 此外,视频图像与新的语音的匹配可以以高度自动化的方式进行,从而减少了所需的手动努力。

    Visual tracking framework
    2.
    发明授权
    Visual tracking framework 有权
    视觉跟踪框架

    公开(公告)号:US08649555B1

    公开(公告)日:2014-02-11

    申请号:US12607480

    申请日:2009-10-28

    IPC分类号: G06K9/00 G06K9/62

    摘要: A computer program product tangibly embodied in a computer-readable storage medium includes instructions that when executed by a processor perform a method. The method includes identifying a frame of a video sequence, transforming a model into an initial guess for how the region appears in the frame, performing an exhaustive search of the frame, performing a plurality of optimization procedures, wherein at least one additional model parameter is taken into account as each subsequent optimization procedure is initiated. A system includes a computer readable storage medium, a graphical user interface, an input device, a model for texture and shape of the region, the model generated using the video sequence and stored in the computer readable storage medium, and a solver component.

    摘要翻译: 有形地体现在计算机可读存储介质中的计算机程序产品包括当由处理器执行时执行方法的指令。 该方法包括识别视频序列的帧,将模型变换成如何区域出现在帧中的初始猜测,执行帧的穷尽搜索,执行多个优化过程,其中至少一个附加模型参数是 考虑到每个随后的优化过程被启动。 系统包括计算机可读存储介质,图形用户界面,输入设备,用于区域的纹理和形状的模型,使用视频序列生成并存储在计算机可读存储介质中的模型以及求解器组件。

    Principle component analysis of images for the automatic location of control points
    4.
    发明授权
    Principle component analysis of images for the automatic location of control points 失效
    控制点自动定位图像的原理分量分析

    公开(公告)号:US06188776B1

    公开(公告)日:2001-02-13

    申请号:US08651108

    申请日:1996-05-21

    IPC分类号: G06K900

    摘要: The identification of hidden data, such as feature-based control points in an image, from a set of observable data, such as the image, is achieved through a two-stage approach. The first stage involves a learning process, in which a number of sample data sets, e.g. images, are analyzed to identify the correspondence between observable data, such as visual aspects of the image, and the desired hidden data, such as the control points. Two models are created. A feature appearance-only model is created from aligned examples of the feature in the observed data. In addition, each labeled data set is processed to generate a coupled model of the aligned observed data and the associated hidden data. In the image processing embodiment, these two models might be affine manifold models of an object's appearance and of the coupling between that appearance and a set of locations on the object's surface. In the second stage of the process, the modeled feature is located in an unmarked, unaligned data set, using the feature appearance-only model. This location is used as an alignment point and the coupled model is then applied to the aligned data, giving an estimate of the hidden data values for that data set. In the image processing example, the object's appearance model is compared to different image locations. The matching locations are then used as alignment points for estimating the locations on the object's surface from the appearance in that aligned image and form the coupled model.

    摘要翻译: 通过两阶段方法,可以从一组可观察数据(如图像)中识别隐藏数据,如图像中基于特征的控制点。 第一阶段涉及学习过程,其中多个样本数据集,例如, 分析图像以识别诸如图像的视觉方面的可观察数据与期望的隐藏数据(例如控制点)之间的对应关系。 创建了两个模型。 仅从观察数据中的特征的对齐示例创建仅出现特征的模型。 此外,处理每个标记的数据集以生成对准的观察数据和相关联的隐藏数据的耦合模型。 在图像处理实施例中,这两个模型可以是对象的外观和该外观与物体表面上的一组位置之间的耦合的仿射歧管模型。 在该过程的第二阶段,建模特征位于未标记的未对齐数据集中,使用仅特征外观模型。 该位置用作对齐点,然后将耦合模型应用于对齐的数据,给出该数据集的隐藏数据值的估计。 在图像处理示例中,将对象的外观模型与不同的图像位置进行比较。 然后将匹配位置用作对准点,以从该对准图像中的外观估计对象表面上的位置并形成耦合模型。

    SYSTEM, METHOD AND COMPUTER-ACCESSIBLE MEDIUM FOR PROVIDING BODY SIGNATURE RECOGNITION
    5.
    发明申请
    SYSTEM, METHOD AND COMPUTER-ACCESSIBLE MEDIUM FOR PROVIDING BODY SIGNATURE RECOGNITION 审中-公开
    用于提供身体识别识别的系统,方法和计算机可访问介质

    公开(公告)号:US20100104018A1

    公开(公告)日:2010-04-29

    申请号:US12539306

    申请日:2009-08-11

    IPC分类号: H04N11/02

    CPC分类号: G06K9/00342

    摘要: Provided and described herein are, e.g., exemplary embodiments of systems, methods, procedures, devices, computer-accessible media, computing arrangements and processing arrangements in accordance with the present disclosure related to body signature recognition and acoustic speaker verification utilizing body language features. For example, certain exemplary embodiments can include a computer-accessible medium containing executable instructions thereon. When one or more computing arrangements executes the instructions, the computing arrangement(s) can be configured to perform certain exemplary procedures, including (i) receiving first information relating to one or more visual features from a video, (ii) determining second information relating to motion vectors as a function of the first information, and (iii) computing a statistical representation of a plurality of frames of the video based on the second information. Further, the computing arrangement(s) can be configured to provide the statistical representation to a display device and/or recording the statistical representation on a computer-accessible medium, for example.

    摘要翻译: 本文提供和描述的是例如根据本公开的系统,方法,程序,设备,计算机可访问介质,计算布置和处理布置的示例性实施例,其涉及使用身体语言特征的身体签名识别和声音说话人验证。 例如,某些示例性实施例可以包括其上包含可执行指令的计算机可访问介质。 当一个或多个计算装置执行指令时,计算装置可以被配置为执行某些示例性过程,包括(i)从视频接收与一个或多个视觉特征有关的第一信息,(ii)确定与视频相关的第二信息 作为第一信息的函数的运动矢量,以及(iii)基于第二信息来计算视频的多个帧的统计表示。 此外,计算装置可以被配置为例如向计算机可访问介质提供统计表示给显示装置和/或记录统计表示。