Automatic collection of speaker name pronunciations
    1.
    发明授权
    Automatic collection of speaker name pronunciations 有权
    自动收集扬声器名称发音

    公开(公告)号:US09240181B2

    公开(公告)日:2016-01-19

    申请号:US13970850

    申请日:2013-08-20

    Abstract: An audio stream is segmented into a plurality of time segments using speaker segmentation and recognition (SSR), with each time segment corresponding to the speaker's name, producing an SSR transcript. The audio stream is transcribed into a plurality of word regions using automatic speech recognition (ASR), with each of the word regions having a measure of the confidence in the accuracy of the translation, producing an ASR transcript. Word regions with a relatively low confidence in the accuracy of the translation are identified. The low confidence regions are filtered using named entity recognition (NER) rules to identify low confidence regions that a likely names. The NER rules associate a region that is identified as a likely name with the name of the speaker corresponding to the current, the previous, or the next time segment. All of the likely name regions associated with that speaker's name are selected.

    Abstract translation: 使用说话者分割和识别(SSR)将音频流分割成多个时间段,每个时间段对应于说话人的姓名,产生SSR记录。 使用自动语音识别(ASR)将音频流转录成多个单词区域,每个单词区域具有对翻译精度的置信度的度量,产生ASR记录。 确定了对翻译准确性相对较低置信度的词区域。 使用命名实体识别(NER)规则过滤低置信区域以识别可能名称的低置信区域。 NER规则将被识别为可能的名称的区域与与当前的,先前的或下一个时间段相对应的说话者的名称相关联。 选择与该扬声器名称相关联的所有可能的名称区域。

    SYSTEM AND METHOD FOR DETERMINATION OF AN INTERACTION MAP
    2.
    发明申请
    SYSTEM AND METHOD FOR DETERMINATION OF AN INTERACTION MAP 有权
    用于确定交互地图的系统和方法

    公开(公告)号:US20150012844A1

    公开(公告)日:2015-01-08

    申请号:US13936690

    申请日:2013-07-08

    CPC classification number: H04L65/403 H04L65/1096

    Abstract: An example method is provided and includes receiving recorded meeting information, selecting a meeting participant from the recorded meeting information, determining at least one of meeting participant emotion information, meeting participant speaker role information, or meeting participant engagement information based, at least in part, on the meeting information, and determining an interaction map associated with the meeting participant based, at least in part, on at least one of the meeting participant emotion information, the meeting participant speaker role information, or the meeting participant engagement information.

    Abstract translation: 提供了一种示例性方法,包括:从记录的会议信息中接收记录的会议信息,选择会议参与者,至少部分地确定会议参与者情感信息,会议参与者角色信息或会议参与者参与信息中的至少一个, 至少部分地基于会议参与者情绪信息,会议参与者发言人角色信息或会议参与者参与信息中的至少一个来确定与会议参与者相关联的交互地图。

    Crowd Sourcing Audio Transcription Via Re-Speaking
    3.
    发明申请
    Crowd Sourcing Audio Transcription Via Re-Speaking 有权
    人群采购音频转录通过重新说话

    公开(公告)号:US20150199966A1

    公开(公告)日:2015-07-16

    申请号:US14156032

    申请日:2014-01-15

    Abstract: Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

    Abstract translation: 接收用于转录为文本形式的语音音频。 接收的语音音频被分成第一语音段。 识别出多个扬声器。 扬声器被配置为以口头形式重复说话者已经听过的第一语音段。 确定扬声器的子集用于发送每个第一语音段。 每个第一语音段被发送到为特定的第一语音段确定的扬声器的子集。 从扬声器接收第二语音段。 第二语音片段是已由扬声器通过以口头形式重复第一语音片段而产生的第一语音片段的重新说明版本。 处理第二语音段以产生部分转录。 组合部分抄本以产生完整抄本,其是对应于接收的语音音频的文本表示。

    Automatic Collection of Speaker Name Pronunciations
    4.
    发明申请
    Automatic Collection of Speaker Name Pronunciations 有权
    自动收集扬声器名称发音

    公开(公告)号:US20150058005A1

    公开(公告)日:2015-02-26

    申请号:US13970850

    申请日:2013-08-20

    Abstract: An audio stream is segmented into a plurality of time segments using speaker segmentation and recognition (SSR), with each time segment corresponding to the speaker's name, producing an SSR transcript. The audio stream is transcribed into a plurality of word regions using automatic speech recognition (ASR), with each of the word regions having a measure of the confidence in the accuracy of the translation, producing an ASR transcript. Word regions with a relatively low confidence in the accuracy of the translation are identified. The low confidence regions are filtered using named entity recognition (NER) rules to identify low confidence regions that a likely names. The NER rules associate a region that is identified as a likely name with the name of the speaker corresponding to the current, the previous, or the next time segment. All of the likely name regions associated with that speaker's name are selected.

    Abstract translation: 使用说话者分割和识别(SSR)将音频流分割成多个时间段,每个时间段对应于说话人的姓名,产生SSR记录。 使用自动语音识别(ASR)将音频流转录成多个单词区域,每个单词区域具有对翻译精度的置信度的度量,产生ASR记录。 确定了对翻译准确性相对较低置信度的词区域。 使用命名实体识别(NER)规则过滤低置信区域以识别可能名称的低置信区域。 NER规则将被识别为可能的名称的区域与与当前的,先前的或下一个时间段相对应的说话者的名称相关联。 选择与该扬声器名称相关联的所有可能的名称区域。

    SYSTEM FOR GENERATING MEANINGFUL TOPIC LABELS AND IMPROVING AUTOMATIC TOPIC SEGMENTATION
    5.
    发明申请
    SYSTEM FOR GENERATING MEANINGFUL TOPIC LABELS AND IMPROVING AUTOMATIC TOPIC SEGMENTATION 审中-公开
    用于产生明显的主题标签和改进自动主题分段的系统

    公开(公告)号:US20140325335A1

    公开(公告)日:2014-10-30

    申请号:US13870467

    申请日:2013-04-25

    CPC classification number: G06F17/2745 G06F16/685 G06F16/7834 G06F17/2241

    Abstract: In one embodiment, a method includes obtaining a text representation, and identifying a current topic structure for the text representation. The first topic structure is initially identified as an initial first topic structure. The method also includes identifying at least a first document that has a first document topic structure that is similar to the current first topic structure, refining the current first topic structure based on the first document topic structure, and introducing topic labels in the text representation based on the current first topic structure.

    Abstract translation: 在一个实施例中,一种方法包括获得文本表示,以及识别文本表示的当前主题结构。 第一个主题结构最初被确定为初始的第一个主题结构。 该方法还包括识别具有与当前第一主题结构相似的第一文档主题结构的至少第一文档,基于第一文档主题结构来改进当前第一主题结构,以及基于文本表示形式引入主题标签 关于目前的第一个主题结构。

    Crowd sourcing audio transcription via re-speaking
    6.
    发明授权
    Crowd sourcing audio transcription via re-speaking 有权
    人群通过重演来录音

    公开(公告)号:US09418660B2

    公开(公告)日:2016-08-16

    申请号:US14156032

    申请日:2014-01-15

    Abstract: Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

    Abstract translation: 接收用于转录为文本形式的语音音频。 接收的语音音频被分成第一语音段。 识别出多个扬声器。 扬声器被配置为以口头形式重复说话者已经听过的第一语音段。 确定扬声器的子集用于发送每个第一语音段。 每个第一语音段被发送到为特定的第一语音段确定的扬声器的子集。 从扬声器接收第二语音段。 第二语音片段是已由扬声器通过以口头形式重复第一语音片段而产生的第一语音片段的重新说明版本。 处理第二语音段以产生部分转录。 组合部分抄本以产生完整抄本,其是对应于接收的语音音频的文本表示。

    System and method for determination of an interaction map
    7.
    发明授权
    System and method for determination of an interaction map 有权
    用于确定交互图的系统和方法

    公开(公告)号:US09338199B2

    公开(公告)日:2016-05-10

    申请号:US13936690

    申请日:2013-07-08

    CPC classification number: H04L65/403 H04L65/1096

    Abstract: An example method is provided and includes receiving recorded meeting information, selecting a meeting participant from the recorded meeting information, determining at least one of meeting participant emotion information, meeting participant speaker role information, or meeting participant engagement information based, at least in part, on the meeting information, and determining an interaction map associated with the meeting participant based, at least in part, on at least one of the meeting participant emotion information, the meeting participant speaker role information, or the meeting participant engagement information.

    Abstract translation: 提供了一种示例性方法,包括:从记录的会议信息中接收记录的会议信息,选择会议参与者,至少部分地确定会议参与者情感信息,会议参与者角色信息或会议参与者参与信息中的至少一个, 至少部分地基于会议参与者情绪信息,会议参与者发言人角色信息或会议参与者参与信息中的至少一个来确定与会议参与者相关联的交互地图。

Patent Agency Ranking