Automatic indexing and aligning of audio and text using speech recognition

发明授权

US5649060A Automatic indexing and aligning of audio and text using speech recognition 失效

标题翻译：使用语音识别自动索引和对齐音频和文本

请登陆查看更多内容

专利标题： Automatic indexing and aligning of audio and text using speech recognition
专利标题（中）： 使用语音识别自动索引和对齐音频和文本
申请号： US547113

申请日： 1995-10-23
公开(公告)号： US5649060A

公开(公告)日： 1997-07-15
发明人: Hamed A. Ellozy , Dimitri Kanevsky , Michelle Y. Kim , David Nahamoo , Michael Alan Picheny , Wlodek Wlodzimierz Zadrozny
申请人： Hamed A. Ellozy , Dimitri Kanevsky , Michelle Y. Kim , David Nahamoo , Michael Alan Picheny , Wlodek Wlodzimierz Zadrozny
申请人地址： NY Armonk
专利权人： International Business Machines Corporation
当前专利权人： International Business Machines Corporation
当前专利权人地址： NY Armonk
主分类号： G03B31/00
IPC分类号： G03B31/00 ; G06F17/30 ; G10L15/00 ; G10L15/18 ; G10L15/22 ; G10L15/26 ; G11B27/028 ; G11B27/10 ; G11B27/28 ; H04N5/91 ; G10L9/00

Automatic indexing and aligning of audio and text using speech
recognition

摘要：

A method of automatically aligning a written transcript with speech in video and audio clips. The disclosed technique involves as a basic component an automatic speech recognizer. The automatic speech recognizer decodes speech (recorded on a tape) and produces a file with a decoded text. This decoded text is then matched with the original written transcript via identification of similar words or clusters of words. The results of this matching is an alignment of the speech with the original transcript. The method can be used (a) to create indexing of video clips, (b) for "teleprompting" (i.e. showing the next portion of text when someone is reading from a television screen), or (c) to enhance editing of a text that was dictated to a stenographer or recorded on a tape for its subsequent textual reproduction by a typist.

摘要（中）：

自动将书面誊本与视频和音频剪辑中的语音对齐的方法。所公开的技术涉及作为自动语音识别器的基本组件。自动语音识别器解码语音（记录在磁带上）并产生具有解码文本的文件。然后，通过识别类似的单词或单词集合，将该解码的文本与原始的书面记录相匹配。这种匹配的结果是语音与原始誊本的一致。该方法可用于（a）创建视频剪辑的索引，（b）“电视提示”（即，当有人从电视屏幕读取时显示文本的下一部分），或（c）增强文本的编辑这是由速记员决定的，或者录制在磁带上，以便打字员随后进行文字复制。

公开/授权文献

USD318151S Litter removal tool or the like 公开/授权日：1991-07-09

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G03	摄影术；电影术；利用了光波以外其他波的类似技术；电记录术；全息摄影术
G03B	摄影、放映或观看用的装置或设备；利用了光波以外其他波的类似技术的装置或设备；以及有关的附件（这些装置的光学部分入G02B；照相用的感光材料或加工方法入G03C；加工曝光后的照相材料的设备入G03D）
G03B31/00	摄影（照相）机或放映机同录音机或放音机的协同工作