-
公开(公告)号:US07383509B2
公开(公告)日:2008-06-03
申请号:US10243220
申请日:2002-09-13
IPC分类号: G06F3/00
CPC分类号: G09B7/00 , G09B5/00 , G10L15/26 , G10L2021/105
摘要: The present invention provides a system and method for automatically combining image and audio data to create a multimedia presentation. In one embodiment, audio and image data are received by the system. The audio data includes a list of events that correspond to points of interest in an audio file. The audio data may also include an audio file or audio stream. The received images are then matched to the audio file or stream using the time. In one embodiment, the events represent times within the audio file or stream at which there is a certain feature or characteristic in the audio file. The audio events list may be processed to remove, sort or predict or otherwise generate audio events. Images processing may also occur, and may include image analysis to determine image matching to the event list, deleting images, and processing images to incorporate effects. Image effects may include cropping, panning, zooming and other visual effects.
摘要翻译: 本发明提供一种用于自动组合图像和音频数据以创建多媒体呈现的系统和方法。 在一个实施例中,系统接收音频和图像数据。 音频数据包括与音频文件中的兴趣点对应的事件的列表。 音频数据还可以包括音频文件或音频流。 然后使用该时间将接收到的图像与音频文件或流进行匹配。 在一个实施例中,事件表示在音频文件或音频文件中具有特定特征的音频文件或流中的时间。 可以处理音频事件列表以移除,排序或预测或以其他方式生成音频事件。 也可能发生图像处理,并且可以包括图像分析以确定与事件列表的图像匹配,删除图像以及处理图像以合并效果。 图像效果可能包括裁剪,平移,缩放和其他视觉效果。