Automated removal of private information

    公开(公告)号:US10747797B2

    公开(公告)日:2020-08-18

    申请号:US15728616

    申请日:2017-10-10

    Abstract: Systems, methods, and media for the automated removal of private information are provided herein. In an example implementation, a method for automatic removal of private information may include: receiving a transcript of communication data; applying a private information rule to the transcript in order to identify private information in the transcript; tagging the identified private information with a tag comprising an identification of the private information; applying a complicate rule to the tagged transcript in order to evaluate a compliance of the transcript with privacy standards; removing the identified private information from the transcript to produce a redacted transaction; and storing the redacted transcript.

    AUTOMATED REMOVAL OF PRIVATE INFORMATION
    12.
    发明申请

    公开(公告)号:US20180089313A1

    公开(公告)日:2018-03-29

    申请号:US15728616

    申请日:2017-10-10

    CPC classification number: G06F16/335

    Abstract: Systems, methods, and media for the automated removal of private information are provided herein. In an example implementation, a method for automatic removal of private information may include: receiving a transcript of communication data; applying a private information rule to the transcript in order to identify private information in the transcript; tagging the identified private information with a tag comprising an identification of the private information; applying a complicate rule to the tagged transcript in order to evaluate a compliance of the transcript with privacy standards; removing the identified private information from the transcript to produce a redacted transaction; and storing the redacted transcript.

    Themes surfacing for communication data analysis

    公开(公告)号:US09697246B1

    公开(公告)日:2017-07-04

    申请号:US14501519

    申请日:2014-09-30

    CPC classification number: G06F17/30616 G06F17/30734

    Abstract: An embodiment of the method of processing communication data to identify one or more themes within the communication data includes identifying terms in a set of communication data, wherein a term is a word or short phrase, and defining relations in the set of communication data based on the terms, wherein the relation is a pair of terms that appear in proximity to one another. The method further includes identifying themes in the set of communication data based on the relations, wherein a theme is a group of one or more relations that have similar meanings, and storing the terms, the relations, and the themes in the database.

    SYSTEM AND METHOD OF TEXT ZONING
    14.
    发明申请

    公开(公告)号:US20220122609A1

    公开(公告)日:2022-04-21

    申请号:US17567491

    申请日:2022-01-03

    Abstract: A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.

    SYSTEM AND METHOD OF TREND IDENTIFICATION

    公开(公告)号:US20210398149A1

    公开(公告)日:2021-12-23

    申请号:US17360025

    申请日:2021-06-28

    Abstract: Improved systems and method as disclosed herein, provide automated analysis tools for more refined trend analysis and evaluation of identified trends. Communication data may be recognized as either audio or textual data which may be processed and analyzed in real-time (as in the case of streaming audio data) or processed at a time apart from the acquisition of the communication data. If the communication data is audio data, then the audio data, may undergo a transcription, which may employ the exemplary technique of large vocabulary continuous speech recognition (LVCSR) or other known speech-to-text algorithms or techniques. Alternatively, the communication data may already be in the form of a transcription or the communication data may have originated as textual data, exemplarily the communication data is from an internet web chat, email, text message, or social media.

    SYSTEM AND METHOD OF TEXT ZONING
    16.
    发明申请

    公开(公告)号:US20200090660A1

    公开(公告)日:2020-03-19

    申请号:US16553451

    申请日:2019-08-28

    Abstract: A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.

    AUTOMATED ONTOLOGY DEVELOPMENT
    17.
    发明申请

    公开(公告)号:US20190325324A1

    公开(公告)日:2019-10-24

    申请号:US16458482

    申请日:2019-07-01

    Abstract: Systems and methods of automated ontology development include a corpus of communication data. The corpus of communication data includes communication data from a plurality of interactions and is processed. A plurality of terms are extracted from the corpus. Each term of the plurality is a plurality of words that identify a single concept within the corpus. An ontology is automatedly generated from the extracted terms.

    System and Method of Trend Identification
    18.
    发明申请
    System and Method of Trend Identification 审中-公开
    趋势识别系统与方法

    公开(公告)号:US20150220946A1

    公开(公告)日:2015-08-06

    申请号:US14610232

    申请日:2015-01-30

    CPC classification number: G06Q30/0202

    Abstract: Improved systems and method as disclosed herein, provide automated analysis tools for more refined trend analysis and evaluation of identified trends. Communication data may be recognized as either audio or textual data which may be processed and analyzed in real-time (as in the case of streaming audio data) or processed at a time apart from the acquisition of the communication data. If the communication data is audio data, then the audio data, may undergo a transcription, which may employ the exemplary technique of large vocabulary continuous speech recognition (LVCSR) or other known speech-to-text algorithms or techniques. Alternatively, the communication data may already be in the form of a transcription or the communication data may have originated as textual data, exemplarily the communication data is from an internet web chat, email, text message, or social media.

    Abstract translation: 改进的系统和方法如本文所公开的,提供用于更精确的趋势分析和鉴定趋势的评估的自动分析工具。 通信数据可以被识别为可以被实时地处理和分析的音频或文本数据(如在流式传输音频数据的情况下)或除了获取通信数据之外的处理。 如果通信数据是音频数据,则音频数据可以经历转录,其可以采用大词汇连续语音识别(LVCSR)或其他已知的语音到文本算法或技术的示例性技术。 或者,通信数据可以已经是转录的形式,或者通信数据可以源自文本数据,例如通信数据来自互联网聊天,电子邮件,文本消息或社交媒体。

    TAGGING RELATIONS WITH N-BEST
    19.
    发明申请
    TAGGING RELATIONS WITH N-BEST 审中-公开
    与N-BEST的标签关系

    公开(公告)号:US20150220618A1

    公开(公告)日:2015-08-06

    申请号:US14608737

    申请日:2015-01-29

    Abstract: Systems, methods, and media for developing ontologies and analyzing communication data are provided herein. In an example implementation, the method includes: identifying terms in in a set of communication data; identifying a list of possible relations of the identified terms; scoring the possible relations according to a set of predefined merits; ranking the possible relations into a list of possible relations in descending order according to their score; and tagging relations in the set of communication data. The relations may be tagged by identifying the possible relations in the communication data in order corresponding with the list of possible relations. The possible relations that have lower rankings that conflict with higher ranking relations are not tagged. The conflicts may be determined by a predefined set of conflict criteria.

    Abstract translation: 本文提供了用于开发本体和分析通信数据的系统,方法和媒体。 在示例实现中,该方法包括:识别一组通信数据中的术语; 确定所识别术语的可能关系的列表; 根据一组预定义的优点对可能的关系进行评分; 根据他们的得分将可能的关系排列成可能的关系列表,降序排列; 并在该组通信数据中标记关系。 可以通过根据可能的关系列表来确定通信数据中的可能关系来标记关系。 与较高排名关系冲突的排名较低的可能关系未被标记。 冲突可以通过预定义的一组冲突标准来确定。

    System and Method of Text Zoning
    20.
    发明申请
    System and Method of Text Zoning 审中-公开
    文本分区系统与方法

    公开(公告)号:US20150066506A1

    公开(公告)日:2015-03-05

    申请号:US14467783

    申请日:2014-08-25

    CPC classification number: G10L15/26 G10L15/04 G10L15/18 G10L15/1822

    Abstract: A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.

    Abstract translation: 对音频数据的转录进行分区的方法包括将音频数据的转录分离成多个话语。 一个话语中的每个单词都是一个意义单位边界的计算。 在最大计算概率的工作中,话语分为两个新的话语。 两个新语句中的至少一个短于最大话语阈值被识别为意义单元。

Patent Agency Ranking