-
公开(公告)号:US10073839B2
公开(公告)日:2018-09-11
申请号:US13930660
申请日:2013-06-28
IPC分类号: G10L21/00 , G06F17/27 , G06F17/30 , G06Q50/00 , G10L21/0208
CPC分类号: G06F17/2795 , G06F16/374 , G06F16/951 , G06F17/2785 , G06Q50/01 , G10L21/0208
摘要: Arrangements described herein relate to language enhancement. Source text can be automatically gathered from a plurality of text sources, the plurality of text sources including at least one social media website, and storing the source text to a thesaurus data infrastructure. Subject text being exposed to thesaurus processing can be received, a context of the subject text can be identified, and the thesaurus data infrastructure can be accessed while the thesaurus queries previously acquired source texts or documents having similar context to identify source text having context similar to the context of the subject text. The identified source text can be analyzed to identify at least one candidate word or phrase contained in the source text to recommend as a replacement for at least one word or phrase contained in the subject text. The identified at least one candidate word or phrase can be recommended as the replacement for the at least one word or phrase contained in the subject text.
-
2.
公开(公告)号:US20150006149A1
公开(公告)日:2015-01-01
申请号:US13930660
申请日:2013-06-28
IPC分类号: G06F17/27
CPC分类号: G06F17/2795 , G06F17/2785 , G06F17/30737 , G06F17/30864 , G06Q50/01 , G10L21/0208
摘要: Arrangements described herein relate to language enhancement. Source text can be automatically gathered from a plurality of text sources, the plurality of text sources including at least one social media website, and storing the source text to a thesaurus data infrastructure. Subject text being exposed to thesaurus processing can be received, a context of the subject text can be identified, and the thesaurus data infrastructure can be accessed to identify source text having context similar to the context of the subject text. The identified source text can be analyzed to identify at least one candidate word or phrase contained in the source text to recommend as a replacement for at least one word or phrase contained in the subject text. The identified at least one candidate word or phrase can be recommended as the replacement for the at least one word or phrase contained in the subject text.
摘要翻译: 这里描述的安排涉及语言增强。 可以从多个文本源自动收集源文本,所述多个文本源包括至少一个社交媒体网站,并将源文本存储到词库数据基础设施。 可以接收暴露于词表处理的主题文本,可以识别主题文本的上下文,并且可以访问同义词库数据基础设施以识别具有与主题文本的上下文相似的上下文的源文本。 可以分析识别的源文本以识别源文本中包含的至少一个候选词或短语,以推荐作为主题文本中包含的至少一个单词或短语的替代。 可以推荐识别的至少一个候选词或短语作为对象文本中包含的至少一个单词或短语的替换。
-