SYSTEMS AND METHODS FOR MANIPULATING ELECTRONIC CONTENT BASED ON SPEECH RECOGNITION
    3.
    发明申请
    SYSTEMS AND METHODS FOR MANIPULATING ELECTRONIC CONTENT BASED ON SPEECH RECOGNITION 审中-公开
    基于语音识别来处理电子内容的系统和方法

    公开(公告)号:US20160182957A1

    公开(公告)日:2016-06-23

    申请号:US15057414

    申请日:2016-03-01

    申请人: AOL Inc.

    摘要: Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.

    摘要翻译: 公开了用于向用户显示电子多媒体内容的系统和方法。 一种用于操纵电子多媒体内容的计算机实现的方法包括使用处理器生成单个扬声器的语音模型和至少一个扬声器模型。 该方法还包括通过网络接收电子媒体内容; 从电子媒体内容中提取音轨; 以及基于所述语音模型检测所述电子媒体内容内的语音段。 该方法还包括检测电子媒体内容内的扬声器段,并且基于至少一个扬声器模型来计算涉及单个扬声器的检测到的扬声器段的概率。