发明授权
- 专利标题: System and method for inserting a description of images into audio recordings
- 专利标题(中): 将图像描述插入音频记录的系统和方法
-
申请号: US11866495申请日: 2007-10-03
-
公开(公告)号: US07996227B2公开(公告)日: 2011-08-09
- 发明人: Peter C. Boyle , Yu Zhang
- 申请人: Peter C. Boyle , Yu Zhang
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Hoffman Warnick LLC
- 优先权: CA2567505 20061109
- 主分类号: G10L11/00
- IPC分类号: G10L11/00 ; G10L15/26 ; G06F17/27 ; G06K9/72
摘要:
There is disclosed a system and method for interpreting and describing graphic images. In an embodiment, the method of inserting a description of an image into an audio recording includes: interpreting an image and producing a word description of the image including at least one image keyword; parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. The word description of the image can then be appended to the selected audio clip to produce an augmented audio recording including the interpreted word description of the image.
公开/授权文献
信息查询