发明授权
- 专利标题: Methods and apparatuses for interactive similarity searching, retrieval and browsing of video
- 专利标题(中): 视频互动相似搜索,检索和浏览的方法和装置
-
申请号: US10859832申请日: 2004-06-03
-
公开(公告)号: US07246314B2公开(公告)日: 2007-07-17
- 发明人: Jonathan T. Foote , Andreas Girgensohn , Lynn Wilcox
- 申请人: Jonathan T. Foote , Andreas Girgensohn , Lynn Wilcox
- 申请人地址: JP Tokyo US CT Stamford
- 专利权人: Fuji Xerox Co., Ltd.,Xerox Corporation
- 当前专利权人: Fuji Xerox Co., Ltd.,Xerox Corporation
- 当前专利权人地址: JP Tokyo US CT Stamford
- 代理机构: Fliesler Meyer LLP
- 主分类号: G06F15/00
- IPC分类号: G06F15/00 ; G06F14/00
摘要:
Methods for interactive selecting video queries consisting of training images from a video for a video similarity search and for displaying the results of the similarity search are disclosed. The user selects a time interval in the video as a query definition of training images for training an image class statistical model. Time intervals can be as short as one frame or consist of disjoint segments or shots. A statistical model of the image class defined by the training images is calculated on-the-fly from feature vectors extracted from transforms of the training images. For each frame in the video, a feature vector is extracted from the transform of the frame, and a similarity measure is calculated using the feature vector and the image class statistical model. The similarity measure is derived from the likelihood of a Gaussian model producing the frame. The similarity is then presented graphically, which allows the time structure of the video to be visualized and browsed. Similarity can be rapidly calculated for other video files as well, which enables content-based retrieval by example. A content-aware video browser featuring interactive similarity measurement is presented. A method for selecting training segments involves mouse click-and-drag operations over a time bar representing the duration of the video; similarity results are displayed as shades in the time bar. Another method involves selecting periodic frames of the video as endpoints for the training segment.