-
公开(公告)号:US20120130705A1
公开(公告)日:2012-05-24
申请号:US13298941
申请日:2011-11-17
申请人: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
发明人: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
IPC分类号: G06F17/27
CPC分类号: G06F17/2863 , G06F17/271 , G06F17/277 , G06F17/2785 , G06F17/28
摘要: Text processing includes: segmenting received text based on a lexicon of smallest semantic units to obtain medium-grained segmentation results; merging the medium-grained segmentation results to obtain coarse-grained segmentation results, the coarse-grained segmentation results having coarser granularity than the medium-grained segmentation results; looking up in the lexicon of smallest semantic units respective search elements that correspond to segments in the medium-grained segmentation results; and forming fine-grained segmentation results based on the respective search elements, the fine-grained segmentation results having finer granularity than the medium-grained segmentation results.
摘要翻译: 文本处理包括:基于最小语义单元的词典对接收的文本进行分段,以获得中粒度分割结果; 合并中粒度分割结果以获得粗粒度分割结果,粗粒度分割结果具有比中粒度分割结果更粗糙的粒度; 在最小语义单元的词典中查找与中粒度分割结果中的段对应的相应搜索元素; 并且基于相应的搜索元素形成细粒度分割结果,细粒度分割结果具有比中粒度分割结果更精细的粒度。
-
公开(公告)号:US08892420B2
公开(公告)日:2014-11-18
申请号:US13298941
申请日:2011-11-17
申请人: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
发明人: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
IPC分类号: G06F17/27
CPC分类号: G06F17/2863 , G06F17/271 , G06F17/277 , G06F17/2785 , G06F17/28
摘要: Text processing includes: segmenting received text based on a lexicon of smallest semantic units to obtain medium-grained segmentation results; merging the medium-grained segmentation results to obtain coarse-grained segmentation results, the coarse-grained segmentation results having coarser granularity than the medium-grained segmentation results; looking up in the lexicon of smallest semantic units respective search elements that correspond to segments in the medium-grained segmentation results; and forming fine-grained segmentation results based on the respective search elements, the fine-grained segmentation results having finer granularity than the medium-grained segmentation results.
摘要翻译: 文本处理包括:基于最小语义单元的词典对接收的文本进行分段,以获得中粒度分割结果; 合并中粒度分割结果以获得粗粒度分割结果,粗粒度分割结果具有比中粒度分割结果更粗糙的粒度; 在最小语义单元的词典中查找与中粒度分割结果中的段对应的相应搜索元素; 并且基于相应的搜索元素形成细粒度分割结果,细粒度分割结果具有比中粒度分割结果更精细的粒度。
-