Video summarization using audio and visual cues

发明授权

US10134440B2 Video summarization using audio and visual cues 有权

请登陆查看更多内容

专利标题： Video summarization using audio and visual cues
申请号： US13099391

申请日： 2011-05-03
公开(公告)号： US10134440B2

公开(公告)日： 2018-11-20
发明人: Wei Jiang , Alexander C. Loui , Courtenay Cotton
申请人： Wei Jiang , Alexander C. Loui , Courtenay Cotton
申请人地址： US NY Rochester
专利权人： KODAK ALARIS INC.
当前专利权人： KODAK ALARIS INC.
当前专利权人地址： US NY Rochester
代理机构： Hogan Lovells US LLP
主分类号： G11B27/034
IPC分类号： G11B27/034 ; G11B27/11

Video summarization using audio and visual cues

摘要：

A method for producing an audio-visual slideshow for a video sequence having an audio soundtrack and a corresponding video track including a time sequence of image frames, comprising: segmenting the audio soundtrack into a plurality of audio segments; subdividing the audio segments into a sequence of audio frames; determining a corresponding audio classification for each audio frame; automatically selecting a subset of the audio segments responsive to the audio classification for the corresponding audio frames; for each of the selected audio segments automatically analyzing the corresponding image frames to select one or more key image frames; merging the selected audio segments to form an audio summary; forming an audio-visual slideshow by combining the selected key frames with the audio summary, wherein the selected key frames are displayed synchronously with their corresponding audio segment; and storing the audio-visual slideshow in a processor-accessible storage memory.

公开/授权文献

US20120281969A1 VIDEO SUMMARIZATION USING AUDIO AND VISUAL CUES 公开/授权日：2012-11-08

信息查询

Espacenet

IPC分类:

G	物理
G11	信息存储
G11B	基于记录载体和换能器之间的相对运动而实现的信息存储（以不需要通过换能器重现记录值的方式记录测量值的入G01D9/00；利用有机械标记的带子，例如，穿孔纸带或利用单元记录卡，如穿孔卡片或具有磁性标记的卡片的记录或重现设备入G06K；将数据从记录载体的一种类型转移到另一种类型上的入G06K1/18；将重放装置的输出耦合到无线电接收机上去的电路入H04B1/20；唱机拾音器之类的声音机电传感器或为此所用的电路入H04R）
G11B27/00	编辑；索引；寻址；定时或同步；监控；磁带行程的测量
G11B27/02	.编辑，例如，改变记录在记录载体上或从记录载体上重现的信息信号的次序
G11B27/031	..数字模拟信息信号的电子编辑，如音频或视频信号
G11B27/034	...盘上的（G11B27/036，G11B27/038优先）