-
公开(公告)号:US20150279390A1
公开(公告)日:2015-10-01
申请号:US14224511
申请日:2014-03-25
申请人: YAHOO! INC.
发明人: Inderjeet Mani
CPC分类号: G10L25/48 , G06F17/30719 , G10L15/18 , G10L15/26
摘要: A multimedia content item is summarized based on its audio track and a desired compression budget. The audio track is extracted and processed by an automatic speech recognizer to obtain a time-aligned text transcript. The text-transcript is partitioned into a plurality of segment sequences. An informativeness score based on a salience score and a diversity score is computed for each of the segments. A coherence score is also computed for the segments in the plurality of sequences. A subsequence of one of the segment sequences that optimizes for informativeness and coherence is selected for generating a new content item summarizing the multimedia content item.
摘要翻译: 基于其音频轨道和期望的压缩预算来总结多媒体内容项目。 音频轨道由自动语音识别器提取和处理,以获得时间对齐的文本记录。 文本转录本被分割成多个片段序列。 针对每个段计算基于显着性分数和多样性分数的信息分数。 还针对多个序列中的片段计算相干分数。 选择优化信息性和一致性的段序列之一的子序列,用于生成总结多媒体内容项的新内容项。