-
公开(公告)号:US08085982B1
公开(公告)日:2011-12-27
申请号:US12143590
申请日:2008-06-20
申请人: Minyoung Kim , Sanjiv Kumar , Henry A. Rowley
发明人: Minyoung Kim , Sanjiv Kumar , Henry A. Rowley
CPC分类号: G06K9/00261 , G06K9/6214 , G06K9/6264 , G06K9/6277
摘要: Embodiments of the present invention relate to object tracking in video. In an embodiment, a computer-implemented method tracks an object in a frame of a video. An adaptive term value is determined based on an adaptive model and at least a portion of the frame. A pose constraint value is determined based on a pose model and at least a portion the frame. An alignment confidence score is determined based on an alignment model and at least a portion the frame. Based on the adaptive term value, the pose constraint value, and the alignment confidence score, an energy value is determined. Based on the energy value, a resultant tracking state is determined. The resultant tracking state defines a likely position of the object in the frame given the object's likely position in a set of previous frames in the video.
摘要翻译: 本发明的实施例涉及视频中的对象跟踪。 在一个实施例中,计算机实现的方法跟踪视频帧中的对象。 基于自适应模型和帧的至少一部分来确定自适应项值。 基于姿态模型和帧的至少一部分来确定姿势约束值。 基于对准模型和框架的至少一部分来确定对准置信度得分。 基于自适应项值,姿态约束值和对准置信度得分,确定能量值。 基于能量值,确定合成的跟踪状态。 所得到的跟踪状态定义了给定对象在视频中的一组先前帧中的可能位置的帧中的对象的可能位置。
-
公开(公告)号:US08477998B1
公开(公告)日:2013-07-02
申请号:US13309999
申请日:2011-12-02
申请人: Minyoung Kim , Sanjiv Kumar , Henry A. Rowley
发明人: Minyoung Kim , Sanjiv Kumar , Henry A. Rowley
IPC分类号: G06K9/00
CPC分类号: G06K9/00261 , G06K9/6214 , G06K9/6264 , G06K9/6277
摘要: Embodiments of the present invention relate to object tracking in video. In an embodiment, a computer-implemented method tracks an object in a frame of a video. An adaptive term value is determined based on an adaptive model and at least a portion of the frame. A pose constraint value is determined based on a pose model and at least a portion the frame. An alignment confidence score is determined based on an alignment model and at least a portion the frame. Based on the adaptive term value, the pose constraint value, and the alignment confidence score, an energy value is determined. Based on the energy value, a resultant tracking state is determined. The resultant tracking state defines a likely position of the object in the frame given the object's likely position in a set of previous frames in the video.
摘要翻译: 本发明的实施例涉及视频中的对象跟踪。 在一个实施例中,计算机实现的方法跟踪视频帧中的对象。 基于自适应模型和帧的至少一部分来确定自适应项值。 基于姿态模型和帧的至少一部分来确定姿势约束值。 基于对准模型和框架的至少一部分来确定对准置信度得分。 基于自适应项值,姿态约束值和对准置信度得分,确定能量值。 基于能量值,确定合成的跟踪状态。 所得到的跟踪状态定义了给定对象在视频中的一组先前帧中的可能位置的帧中的对象的可能位置。
-
公开(公告)号:US08781231B1
公开(公告)日:2014-07-15
申请号:US12547303
申请日:2009-08-25
申请人: Sanjiv Kumar , Henry A. Rowley , Ameesh Makadia
发明人: Sanjiv Kumar , Henry A. Rowley , Ameesh Makadia
IPC分类号: G06K9/54
CPC分类号: G06F17/30274 , G06F17/30247 , G06K9/6215 , G06K9/6224 , G06K2209/27
摘要: Methods, systems, and apparatus, including computer program products, for ranking search results for queries. The method includes calculating a visual similarity score for one or more pairs of images in a plurality of images based on visual features of images in each of the one or more pairs; building a graph of images by linking each of one or more images in the plurality of images to one or more nearest neighbor images based on the visual similarity scores; associating a respective score with each of one or more images in the graph based on data indicative of user behavior relative to the image as a search result for a query; and determining a new score for each of one or more images in the graph based on the respective score of the image, and the respective scores of one or more nearest neighbors to the image.
摘要翻译: 方法,系统和装置,包括计算机程序产品,用于对查询的搜索结果进行排名。 该方法包括基于一个或多个对中的每一个中的图像的视觉特征来计算多个图像中的一对或多对图像的视觉相似性分数; 通过基于所述视觉相似性得分将所述多个图像中的一个或多个图像的每一个链接到一个或多个最近邻图像来构建图像的图; 基于表示用户相对于图像的行为的数据作为查询的搜索结果,将各个分数与图中的一个或多个图像中的每一个相关联; 以及基于所述图像的相应分数以及所述图像的一个或多个最近邻居的各个分数来确定所述图中的一个或多个图像中的每一个的新分数。
-
公开(公告)号:US08843478B1
公开(公告)日:2014-09-23
申请号:US13617976
申请日:2012-09-14
IPC分类号: G06F17/30
CPC分类号: G06F17/30247 , G06F17/30244 , G06F17/30265 , G06F17/30274 , G06F17/3028 , G06F17/30554 , G06F17/30598 , G06F17/30867
摘要: This specification relates to presenting image search results. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving an image query, the image query being a query for image search results; receiving ranked image search results responsive to the image query, the image search results each including an identification of a corresponding image resource; generating a similarity matrix for images identified by the image search results; generating a hierarchical grouping of the images using the similarity matrix; identifying a canonical image for each group in the hierarchical grouping using a ranking measure; and presenting a visual representation of the image search results based on the hierarchical grouping and the identified canonical images.
摘要翻译: 本说明书涉及呈现图像搜索结果。 通常,本说明书中描述的主题的一个方面可以体现在包括接收图像查询的动作,图像查询是图像搜索结果的查询的方法中; 响应于图像查询接收排序图像搜索结果,图像搜索结果各自包括相应图像资源的标识; 生成由图像搜索结果识别的图像的相似性矩阵; 使用相似性矩阵生成图像的分层分组; 使用排序度量来识别分层分组中的每个组的规范图像; 并且基于分层分组和识别的规范图像来呈现图像搜索结果的视觉表示。
-
公开(公告)号:US08611689B1
公开(公告)日:2013-12-17
申请号:US12968825
申请日:2010-12-15
申请人: Jay Yagnik , Henry A. Rowley , Sergey Ioffe
发明人: Jay Yagnik , Henry A. Rowley , Sergey Ioffe
CPC分类号: G06K9/00711 , H04N21/23418
摘要: A method and system generates and compares fingerprints for videos in a video library. The video fingerprints provide a compact representation of the spatial and sequential characteristics of the video that can be used to quickly and efficiently identify video content. Because the fingerprints are based on spatial and sequential characteristics rather than exact bit sequences, visual content of videos can be effectively compared even when there are small differences between the videos in compression factors, source resolutions, start and stop times, frame rates, and so on. Comparison of video fingerprints can be used, for example, to search for and remove copyright protected videos from a video library. Further, duplicate videos can be detected and discarded in order to preserve storage space.
摘要翻译: 方法和系统生成并比较视频库中视频的指纹。 视频指纹提供了可用于快速有效地识别视频内容的视频的空间和顺序特征的紧凑表示。 因为指纹是基于空间和顺序特征而不是精确的比特序列,所以即使在压缩因素,源分辨率,开始和停止时间,帧率等之间的视频之间存在小的差异,也可以有效地比较视频的视觉内容 上。 可以使用比较视频指纹,例如,从视频库搜索和删除受版权保护的视频。 此外,为了保存存储空间,可以检测和丢弃重复的视频。
-
公开(公告)号:US08363949B2
公开(公告)日:2013-01-29
申请号:US13345311
申请日:2012-01-06
申请人: Henry A. Rowley , Franz Och , Yang Li
发明人: Henry A. Rowley , Franz Och , Yang Li
CPC分类号: G06F3/017 , G06F3/04883 , G06F17/276 , G06K9/00436 , G06K9/033 , G06K9/222 , G06K2209/01
摘要: Techniques described herein may recognize handwritten characters that are written at least partially over the top of one another that are input to a computing device. The handwritten characters may be formed of one or more strokes. A user may write characters or parts of words over approximately the same area of graphical user interface (i.e., on top of each other) without having to wait for a timeout between character input and without having to select a button or provide another input indicating the character is complete before entering input for another character. Once a character is at least partially recognized, a graphical indication corresponding to the user input displayed on a screen may be altered. Such alterations may include fading or changing size or location of the graphical indication.
摘要翻译: 本文所描述的技术可以识别至少部分地在输入到计算设备的彼此的顶部上写入的手写字符。 手写字符可以由一个或多个笔画形成。 用户可以在图形用户界面的大致相同区域(即,彼此之上)上写入字符或部分字词,而不必等待字符输入之间的超时,而不必选择按钮或提供指示 字符在输入另一个字符之前已经完成。 一旦字符被至少部分地识别,则可以改变对应于在屏幕上显示的用户输入的图形指示。 这种改变可以包括图形指示的衰落或改变大小或位置。
-
公开(公告)号:US08094941B1
公开(公告)日:2012-01-10
申请号:US13158795
申请日:2011-06-13
申请人: Henry A. Rowley , Franz Och , Yang Li
发明人: Henry A. Rowley , Franz Och , Yang Li
IPC分类号: G06K9/00
CPC分类号: G06F3/017 , G06F3/04883 , G06F17/276 , G06K9/00436 , G06K9/033 , G06K9/222 , G06K2209/01
摘要: Techniques described herein may recognize handwritten characters that are written at least partially over the top of one another that are input to a computing device. The handwritten characters may be formed of one or more strokes. A user may write characters or parts of words over approximately the same area of graphical user interface (i.e., on top of each other) without having to wait for a timeout between character input and without having to select a button or provide another input indicating the character is complete before entering input for another character. Once a character is at least partially recognized, a graphical indication corresponding to the user input displayed on a screen may be altered. Such alterations may include fading or changing size or location of the graphical indication.
-
公开(公告)号:US07978882B1
公开(公告)日:2011-07-12
申请号:US12785021
申请日:2010-05-21
IPC分类号: G06K9/00 , G06F15/173
CPC分类号: G06K9/6293
摘要: A system identifies an image and determines whether the image contains inappropriate content based on first data associated with the image, second data associated with a document that contains the image or refers to the image, and/or third data associated with a group of documents with which the image is associated.
摘要翻译: 系统识别图像并基于与图像相关联的第一数据确定图像是否包含不适当的内容,与包含图像的文档相关联的第二数据或参考图像,和/或与一组文档相关联的第三数据, 图像相关联。
-
公开(公告)号:US20150161147A1
公开(公告)日:2015-06-11
申请号:US13098362
申请日:2011-04-29
申请人: Ming Zhao , Yang Song , Hartwig Adam , Ullas Gargi , Yushi Jing , Henry A. Rowley
发明人: Ming Zhao , Yang Song , Hartwig Adam , Ullas Gargi , Yushi Jing , Henry A. Rowley
CPC分类号: G06F17/3005 , G06F17/10 , G06F17/3002 , G06F17/30029 , G06F17/3079 , G06F17/30864 , G06F19/00 , G06K9/00751 , G06K9/629
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating still images and videos. One method includes receiving a plurality of images and a plurality of videos and determining whether the images are related to the videos. The determining includes, for an image and a video, extracting features from the image and extracting features frames of the video, and comparing the features to determine whether the image is related to the video. The method further includes maintaining a data store storing data associating each image with each video determined to be related to the image.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于关联静止图像和视频。 一种方法包括接收多个图像和多个视频并确定图像是否与视频相关。 该确定包括对于图像和视频,从图像中提取特征并提取视频的特征帧,并且比较特征以确定图像是否与视频相关。 该方法还包括维护数据存储器,存储将每个图像与确定为与图像相关的每个视频相关联的数据。
-
公开(公告)号:US08374400B1
公开(公告)日:2013-02-12
申请号:US13460539
申请日:2012-04-30
CPC分类号: G06K9/6293
摘要: A system identifies an image and determines whether the image contains inappropriate content based on first data associated with the image, second data associated with a document that contains the image or refers to the image, and/or third data associated with a group of documents with which the image is associated.
-
-
-
-
-
-
-
-
-