Data obfuscation of text data using entity detection and replacement
    1.
    发明授权
    Data obfuscation of text data using entity detection and replacement 失效
    使用实体检测和替换对文本数据进行数据混淆

    公开(公告)号:US08649552B2

    公开(公告)日:2014-02-11

    申请号:US12061783

    申请日:2008-04-03

    IPC分类号: G06K9/00

    摘要: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.

    摘要翻译: 公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊化,以防止在披露文档时公开私人信息。 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。

    DATA OBFUSCATION OF TEXT DATA USING ENTITY DETECTION AND REPLACEMENT
    2.
    发明申请
    DATA OBFUSCATION OF TEXT DATA USING ENTITY DETECTION AND REPLACEMENT 失效
    使用实体检测和替换的文本数据的数据欺骗

    公开(公告)号:US20080118150A1

    公开(公告)日:2008-05-22

    申请号:US11562559

    申请日:2006-11-22

    IPC分类号: G06K9/34

    摘要: Data obfuscation of text data using entity detection and replacement A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.

    摘要翻译: 使用实体检测和替换的文本数据的数据模糊公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊以防止如果文档是 披露 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。

    Data obfuscation of text data using entity detection and replacement
    3.
    发明授权
    Data obfuscation of text data using entity detection and replacement 失效
    使用实体检测和替换对文本数据进行数据混淆

    公开(公告)号:US07724918B2

    公开(公告)日:2010-05-25

    申请号:US11562559

    申请日:2006-11-22

    IPC分类号: G06K9/00

    摘要: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.

    摘要翻译: 公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊化,以防止在披露文档时公开私人信息。 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。

    DATA OBFUSCATION OF TEXT DATA USING ENTITY DETECTION AND REPLACEMENT
    4.
    发明申请
    DATA OBFUSCATION OF TEXT DATA USING ENTITY DETECTION AND REPLACEMENT 失效
    使用实体检测和替换的文本数据的数据欺骗

    公开(公告)号:US20080181396A1

    公开(公告)日:2008-07-31

    申请号:US12061783

    申请日:2008-04-03

    IPC分类号: H04L9/28 G06F17/00

    摘要: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.

    摘要翻译: 公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊化,以防止在披露文档时公开私人信息。 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。

    System and method for focused re-crawling of web sites
    6.
    发明授权
    System and method for focused re-crawling of web sites 失效
    网站重点重新抓取的系统和方法

    公开(公告)号:US07882099B2

    公开(公告)日:2011-02-01

    申请号:US12054482

    申请日:2008-03-25

    IPC分类号: G06F17/30

    摘要: A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.

    摘要翻译: 公开了一种爬行网(620)的方法(100)。 该方法(100)从给定的(110)种子通用资源定位符(URL)集合起,爬行(120)Web上的网页。 抓取的网页被分割(140)成相关和不相关的页面集合。 从相关和不相关页面的集合中发现一组排除和/或包含模式(150),并且通过一组排除和/或包含模式来限制Web的后续爬网。

    System and a method for focused re-crawling of Web sites
    7.
    发明授权
    System and a method for focused re-crawling of Web sites 有权
    系统和重点重新抓取网站的方法

    公开(公告)号:US07379932B2

    公开(公告)日:2008-05-27

    申请号:US11314432

    申请日:2005-12-21

    IPC分类号: G06F17/30

    摘要: A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.

    摘要翻译: 公开了一种爬行网(620)的方法(100)。 该方法(100)从给定的(110)种子通用资源定位符(URL)集合起,爬行(120)Web上的网页。 抓取的网页被分割(140)成相关和不相关的页面集合。 从相关和不相关页面的集合中发现一组排除和/或包含模式(150),并且通过一组排除和/或包含模式来限制Web的后续爬网。

    Method and System for Hybrid Call Handling
    8.
    发明申请
    Method and System for Hybrid Call Handling 审中-公开
    混合呼叫处理方法与系统

    公开(公告)号:US20080086690A1

    公开(公告)日:2008-04-10

    申请号:US11534000

    申请日:2006-09-21

    IPC分类号: G06F15/177

    CPC分类号: H04L12/66

    摘要: The present invention provides a hybrid call handling method and system. The method comprises navigating a plurality of received calls from a plurality of callers. The method further comprises monitoring a call health status for each of the plurality of the calls being navigated for entire call duration and notifying a bad call health status of the monitored call to a human agent for employing at least one rectification action. The call health status is determined by monitoring and measuring one or more call parameters. The invention provides for a system for call handling and navigation by an automated system with a human agent assisting the automated system for rectification of calls with bad call health status. Once the call with a bad health is transferred to the human agent, he assists the automated system either by directly communicating with the caller or by communicating using a machine interface.

    摘要翻译: 本发明提供一种混合呼叫处理方法和系统。 该方法包括从多个呼叫者导航多个接收的呼叫。 所述方法还包括:监视针对整个呼叫持续时间进行导航的多个呼叫中的每个呼叫的呼叫健康状态,并将被监视呼叫的不良呼叫健康状况通知给人类代理以采用至少一个整流动作。 通过监视和测量一个或多个呼叫参数来确定呼叫健康状态。 本发明提供了一种用于通过自动化系统进行呼叫处理和导航的系统,其中人体代理协助自动化系统来校正具有不良呼叫健康状态的呼叫。 一旦身体不好的呼叫转移给人类代理,他可以通过与呼叫者直接通信或通过使用机器接口进行通信来协助自动化系统。

    SYSTEM AND METHOD FOR FOCUSED RE-CRAWLING OF WEB SITES
    10.
    发明申请
    SYSTEM AND METHOD FOR FOCUSED RE-CRAWLING OF WEB SITES 失效
    网站重点破解的系统与方法

    公开(公告)号:US20080168041A1

    公开(公告)日:2008-07-10

    申请号:US12054482

    申请日:2008-03-25

    IPC分类号: G06F17/30

    摘要: A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.

    摘要翻译: 公开了一种爬行网(620)的方法(100)。 该方法(100)从给定的(110)种子通用资源定位符(URL)集合起,爬行(120)Web上的网页。 抓取的网页被分割(140)成相关和不相关的页面集合。 从一组相关和不相关的页面中发现一组排除和/或包含模式(150),并且通过一组排除和/或包含模式来限制Web的后续爬网。