-
公开(公告)号:US08909516B2
公开(公告)日:2014-12-09
申请号:US13313034
申请日:2011-12-07
CPC分类号: G06F17/27
摘要: Computing functionality converts an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input linguistic item corresponds to a complaint by a person receiving medical care, and the normalized linguistic item corresponds to a definitive and error-free version of that complaint. In operation, the computing functionality uses plural reference resources to expand the input linguistic item, creating an expanded linguistic item. The computing functionality then forms a graph based on candidate tokens that appear in the expanded linguistic item, and then finds a shortest path through the graph; that path corresponds to the normalized linguistic item. The computing functionality may use a statistical language model to assign weights to edges in the graph, and to determine whether the normalized linguistic incorporates two or more component linguistic items.
摘要翻译: 计算功能将输入语言项目转换为归一化语言项目,表示输入语言项目的归一化对应项。 在一个环境中,输入语言项目对应于接受医疗护理的人的投诉,而归一化语言项目对应于该投诉的确定和无错误的版本。 在操作中,计算功能使用多个参考资源来扩展输入语言项,创建扩展的语言项。 然后,计算功能基于出现在扩展语言项目中的候选令牌形成图形,然后找到通过图形的最短路径; 该路径对应于归一化语言项。 计算功能可以使用统计语言模型来向图中的边缘分配权重,并且确定归一化语言是否包含两个或多个组件语言项。
-
公开(公告)号:US20130110497A1
公开(公告)日:2013-05-02
申请号:US13313034
申请日:2011-12-07
IPC分类号: G06F17/27
CPC分类号: G06F17/27
摘要: Functionality is described herein for converting an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input linguistic item corresponds to a complaint by a person receiving medical care, and the normalized linguistic item corresponds to a definitive and error-free version of that complaint. In operation, the functionality uses plural reference resources to expand the input linguistic item, creating an expanded linguistic item. The functionality then forms a graph based on candidate tokens that appear in the expanded linguistic item, and then finds a shortest path through the graph; that path corresponds to the normalized linguistic item. The functionality may use a statistical language model to assign weights to edges in the graph, and to determine whether the normalized linguistic incorporates two or more component linguistic items.
摘要翻译: 这里描述了将输入语言项转换成标准化语言项的功能,表示输入语言项的归一化对应物。 在一个环境中,输入语言项目对应于接受医疗护理的人的投诉,而归一化的语言项目对应于该投诉的明确且无错误的版本。 在操作中,功能使用多个参考资源来扩展输入语言项,创建扩展的语言项。 然后,功能基于出现在扩展语言项目中的候选令牌形成图形,然后通过图形找到最短路径; 该路径对应于归一化语言项。 功能可以使用统计语言模型来向图中的边缘分配权重,并且确定归一化语言是否包含两个或多个组件语言项。
-