-
1.
公开(公告)号:US08649552B2
公开(公告)日:2014-02-11
申请号:US12061783
申请日:2008-04-03
IPC分类号: G06K9/00
CPC分类号: G06F17/2276 , G06F21/62 , G06F21/6209
摘要: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.
摘要翻译: 公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊化,以防止在披露文档时公开私人信息。 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。
-
2.
公开(公告)号:US20080118150A1
公开(公告)日:2008-05-22
申请号:US11562559
申请日:2006-11-22
IPC分类号: G06K9/34
CPC分类号: G06F17/2276 , G06F21/62 , G06F21/6209
摘要: Data obfuscation of text data using entity detection and replacement A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.
摘要翻译: 使用实体检测和替换的文本数据的数据模糊公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊以防止如果文档是 披露 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。
-
3.
公开(公告)号:US07724918B2
公开(公告)日:2010-05-25
申请号:US11562559
申请日:2006-11-22
IPC分类号: G06K9/00
CPC分类号: G06F17/2276 , G06F21/62 , G06F21/6209
摘要: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.
摘要翻译: 公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊化,以防止在披露文档时公开私人信息。 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。
-
4.
公开(公告)号:US20080181396A1
公开(公告)日:2008-07-31
申请号:US12061783
申请日:2008-04-03
CPC分类号: G06F17/2276 , G06F21/62 , G06F21/6209
摘要: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.
摘要翻译: 公开了一种数据混淆方法,装置和计算机程序产品,其中至少选择的文本实体(例如文档中的单词或缩写)被模糊化,以防止在披露文档时公开私人信息。 用户为所需的模糊化文本实体建立各种配置参数。 文档被处理,与配置参数匹配的文本实体被标记为混淆。 标记的实体然后在文档中用混淆文本替换。 混淆文本可以从散列表导出。 哈希表可以用于提供反向混淆方法,通过该方法可以将原始数据恢复到混淆的文档。
-
公开(公告)号:US07464031B2
公开(公告)日:2008-12-09
申请号:US10724536
申请日:2003-11-28
申请人: Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Ramesh A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig
发明人: Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Ramesh A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig
CPC分类号: G10L15/063 , G10L15/02 , G10L15/14 , G10L2015/085
摘要: In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models the posterior probability of linguistic units relevant to speech recognition using a log-linear model. The posterior model captures the probability of the linguistic unit given the observed speech features and the parameters of the posterior model. The posterior model may be determined using the probability of the word sequence hypotheses given a multitude of speech features. Log-linear models are used with features derived from sparse or incomplete data. The speech features that are utilized may include asynchronous, overlapping, and statistically non-independent speech features. Not all features used in training need to appear in testing/recognition.
摘要翻译: 在语音识别系统中,提供了具有多个语音特征的对数线性模型的组合来识别未知语音语音。 语音识别系统使用对数线性模型对与语音识别相关的语言单位的后验概率进行建模。 后验模型捕获了语言单位给出观察到的语音特征和后验模型参数的概率。 可以使用给定多个语音特征的单词序列假设的概率来确定后验模型。 对数线性模型与来自稀疏或不完整数据的特征一起使用。 所使用的语音特征可以包括异步,重叠和统计上非独立的语音特征。 培训中使用的并非所有功能都需要出现在测试/识别中。
-
公开(公告)号:US07882099B2
公开(公告)日:2011-02-01
申请号:US12054482
申请日:2008-03-25
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99935
摘要: A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.
摘要翻译: 公开了一种爬行网(620)的方法(100)。 该方法(100)从给定的(110)种子通用资源定位符(URL)集合起,爬行(120)Web上的网页。 抓取的网页被分割(140)成相关和不相关的页面集合。 从相关和不相关页面的集合中发现一组排除和/或包含模式(150),并且通过一组排除和/或包含模式来限制Web的后续爬网。
-
公开(公告)号:US07379932B2
公开(公告)日:2008-05-27
申请号:US11314432
申请日:2005-12-21
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99935
摘要: A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.
摘要翻译: 公开了一种爬行网(620)的方法(100)。 该方法(100)从给定的(110)种子通用资源定位符(URL)集合起,爬行(120)Web上的网页。 抓取的网页被分割(140)成相关和不相关的页面集合。 从相关和不相关页面的集合中发现一组排除和/或包含模式(150),并且通过一组排除和/或包含模式来限制Web的后续爬网。
-
公开(公告)号:US20080086690A1
公开(公告)日:2008-04-10
申请号:US11534000
申请日:2006-09-21
IPC分类号: G06F15/177
CPC分类号: H04L12/66
摘要: The present invention provides a hybrid call handling method and system. The method comprises navigating a plurality of received calls from a plurality of callers. The method further comprises monitoring a call health status for each of the plurality of the calls being navigated for entire call duration and notifying a bad call health status of the monitored call to a human agent for employing at least one rectification action. The call health status is determined by monitoring and measuring one or more call parameters. The invention provides for a system for call handling and navigation by an automated system with a human agent assisting the automated system for rectification of calls with bad call health status. Once the call with a bad health is transferred to the human agent, he assists the automated system either by directly communicating with the caller or by communicating using a machine interface.
摘要翻译: 本发明提供一种混合呼叫处理方法和系统。 该方法包括从多个呼叫者导航多个接收的呼叫。 所述方法还包括:监视针对整个呼叫持续时间进行导航的多个呼叫中的每个呼叫的呼叫健康状态,并将被监视呼叫的不良呼叫健康状况通知给人类代理以采用至少一个整流动作。 通过监视和测量一个或多个呼叫参数来确定呼叫健康状态。 本发明提供了一种用于通过自动化系统进行呼叫处理和导航的系统,其中人体代理协助自动化系统来校正具有不良呼叫健康状态的呼叫。 一旦身体不好的呼叫转移给人类代理,他可以通过与呼叫者直接通信或通过使用机器接口进行通信来协助自动化系统。
-
公开(公告)号:US20080312921A1
公开(公告)日:2008-12-18
申请号:US12195123
申请日:2008-08-20
申请人: Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Rameah A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig
发明人: Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Rameah A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig
CPC分类号: G10L15/063 , G10L15/02 , G10L15/14 , G10L2015/085
摘要: In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models the posterior probability of linguistic units relevant to speech recognition using a log-linear model. The posterior model captures the probability of the linguistic unit given the observed speech features and the parameters of the posterior model. The posterior model may be determined using the probability of the word sequence hypotheses given a multitude of speech features. Log-linear models are used with features derived from sparse or incomplete data. The speech features that are utilized may include asynchronous, overlapping, and statistically non-independent speech features. Not all features used in training need to appear in testing/recognition.
摘要翻译: 在语音识别系统中,提供了具有多个语音特征的对数线性模型的组合来识别未知语音语音。 语音识别系统使用对数线性模型对与语音识别相关的语言单位的后验概率进行建模。 后验模型捕获了语言单位给出观察到的语音特征和后验模型参数的概率。 可以使用给定多个语音特征的单词序列假设的概率来确定后验模型。 对数线性模型与来自稀疏或不完整数据的特征一起使用。 所使用的语音特征可以包括异步,重叠和统计上非独立的语音特征。 培训中使用的并非所有功能都需要出现在测试/识别中。
-
公开(公告)号:US20080168041A1
公开(公告)日:2008-07-10
申请号:US12054482
申请日:2008-03-25
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99935
摘要: A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.
摘要翻译: 公开了一种爬行网(620)的方法(100)。 该方法(100)从给定的(110)种子通用资源定位符(URL)集合起,爬行(120)Web上的网页。 抓取的网页被分割(140)成相关和不相关的页面集合。 从一组相关和不相关的页面中发现一组排除和/或包含模式(150),并且通过一组排除和/或包含模式来限制Web的后续爬网。
-
-
-
-
-
-
-
-
-