SYSTEMS AND METHODS FOR TEXTUAL CONTENT CREATION FROM SOURCES OF AUDIO THAT CONTAIN SPEECH
    1.
    发明申请
    SYSTEMS AND METHODS FOR TEXTUAL CONTENT CREATION FROM SOURCES OF AUDIO THAT CONTAIN SPEECH 审中-公开
    用于包含语音的音频源创建文本内容的系统和方法

    公开(公告)号:WO2015008162A2

    公开(公告)日:2015-01-22

    申请号:PCT/IB2014002304

    申请日:2014-07-14

    Abstract: A system and method of creating textual content from audio streams is present. The system can include a computing device configured to receive audio streams containing speech and identify the different speakers in the speech. The system breaks apart an audio stream into separate audio streams using speaker diarization and each audio stream is sent separately to a speech-to-text transcriber. Each audio stream includes only the speech of a single speaker, which is more easily converted into text by the speech-to-text transcriber. The text streams can be assembled into a transcript of the speech portions of the audio stream. A web page of the transcript can be published. High frequency words in the transcript can be tagged in the metadata of the web page to assist search engines and increase the value of the web page.

    Abstract translation: 存在从音频流创建文本内容的系统和方法。 该系统可以包括计算设备,该计算设备被配置为接收包含语音的音频流并识别语音中的不同说话者。 该系统使用扬声器二元化将音频流分解为单独的音频流,并且将每个音频流分别发送到语音到文本转录器。 每个音频流只包含单个扬声器的语音,通过语音到文本转录器更容易将其转换为文本。 文本流可以被组合成音频流的语音部分的转录。 成绩单的网页可以发布。 可以在网页的元数据中标记抄本中的高频词以帮助搜索引擎并增加网页的价值。

    SYSTEMS AND METHODS FOR TEXTUAL CONTENT CREATION FROM SOURCES OF AUDIO THAT CONTAIN SPEECH

    公开(公告)号:WO2015008162A3

    公开(公告)日:2015-01-22

    申请号:PCT/IB2014/002304

    申请日:2014-07-14

    Abstract: A system and method of creating textual content from audio streams is present. The system can include a computing device configured to receive audio streams containing speech and identify the different speakers in the speech. The system breaks apart an audio stream into separate audio streams using speaker diarization and each audio stream is sent separately to a speech-to-text transcriber. Each audio stream includes only the speech of a single speaker, which is more easily converted into text by the speech-to-text transcriber. The text streams can be assembled into a transcript of the speech portions of the audio stream. A web page of the transcript can be published. High frequency words in the transcript can be tagged in the metadata of the web page to assist search engines and increase the value of the web page.

Patent Agency Ranking