发明授权
US5649060A Automatic indexing and aligning of audio and text using speech
recognition
失效
使用语音识别自动索引和对齐音频和文本
- 专利标题: Automatic indexing and aligning of audio and text using speech recognition
- 专利标题(中): 使用语音识别自动索引和对齐音频和文本
-
申请号: US547113申请日: 1995-10-23
-
公开(公告)号: US5649060A公开(公告)日: 1997-07-15
- 发明人: Hamed A. Ellozy , Dimitri Kanevsky , Michelle Y. Kim , David Nahamoo , Michael Alan Picheny , Wlodek Wlodzimierz Zadrozny
- 申请人: Hamed A. Ellozy , Dimitri Kanevsky , Michelle Y. Kim , David Nahamoo , Michael Alan Picheny , Wlodek Wlodzimierz Zadrozny
- 申请人地址: NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: NY Armonk
- 主分类号: G03B31/00
- IPC分类号: G03B31/00 ; G06F17/30 ; G10L15/00 ; G10L15/18 ; G10L15/22 ; G10L15/26 ; G11B27/028 ; G11B27/10 ; G11B27/28 ; H04N5/91 ; G10L9/00
摘要:
A method of automatically aligning a written transcript with speech in video and audio clips. The disclosed technique involves as a basic component an automatic speech recognizer. The automatic speech recognizer decodes speech (recorded on a tape) and produces a file with a decoded text. This decoded text is then matched with the original written transcript via identification of similar words or clusters of words. The results of this matching is an alignment of the speech with the original transcript. The method can be used (a) to create indexing of video clips, (b) for "teleprompting" (i.e. showing the next portion of text when someone is reading from a television screen), or (c) to enhance editing of a text that was dictated to a stenographer or recorded on a tape for its subsequent textual reproduction by a typist.
公开/授权文献
- USD318151S Litter removal tool or the like 公开/授权日:1991-07-09
信息查询