-
公开(公告)号:US09223779B2
公开(公告)日:2015-12-29
申请号:US14514279
申请日:2014-10-14
Applicant: Alibaba Group Holding Limited
Inventor: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
CPC classification number: G06F17/2863 , G06F17/271 , G06F17/277 , G06F17/2785 , G06F17/28
Abstract: Text processing includes: segmenting received text based on a lexicon of smallest semantic units to obtain medium-grained segmentation results; merging the medium-grained segmentation results to obtain coarse-grained segmentation results, the coarse-grained segmentation results having coarser granularity than the medium-grained segmentation results; looking up in the lexicon of smallest semantic units respective search elements that correspond to segments in the medium-grained segmentation results; and forming fine-grained segmentation results based on the respective search elements, the fine-grained segmentation results having finer granularity than the medium-grained segmentation results.
Abstract translation: 文本处理包括:基于最小语义单元的词典对接收的文本进行分段,以获得中粒度分割结果; 合并中粒度分割结果以获得粗粒度分割结果,粗粒度分割结果具有比中粒度分割结果更粗糙的粒度; 在最小语义单元的词典中查找与中粒度分割结果中的段对应的相应搜索元素; 并且基于相应的搜索元素形成细粒度分割结果,细粒度分割结果具有比中粒度分割结果更精细的粒度。
-
公开(公告)号:US20170206897A1
公开(公告)日:2017-07-20
申请号:US15404855
申请日:2017-01-12
Applicant: Alibaba Group Holding Limited
Inventor: Huixing Jiang , Jian Sun , Min Chu
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30864 , G10L13/08 , G10L15/04 , G10L15/1815 , G10L25/51
Abstract: Analyzing textual data is disclosed, including by: receiving textual data; determining that the textual data is a candidate for analogy analysis based at least in part on at least a portion of the textual data matching an analogical question template; extracting a source substantive from the textual data; using the source substantive to determine a target substantive from a word vector model that is trained on a set of training data; and generating an answer including the target substantive based at least in part on an analogical answer template corresponding to the analogical question template.
-
公开(公告)号:US10176804B2
公开(公告)日:2019-01-08
申请号:US15404855
申请日:2017-01-12
Applicant: Alibaba Group Holding Limited
Inventor: Huixing Jiang , Jian Sun , Min Chu
Abstract: Analyzing textual data is disclosed, including by: receiving textual data; determining that the textual data is a candidate for analogy analysis based at least in part on at least a portion of the textual data matching an analogical question template; extracting a source substantive from the textual data; using the source substantive to determine a target substantive from a word vector model that is trained on a set of training data; and generating an answer including the target substantive based at least in part on an analogical answer template corresponding to the analogical question template.
-
公开(公告)号:US20160132492A1
公开(公告)日:2016-05-12
申请号:US14881927
申请日:2015-10-13
Applicant: Alibaba Group Holding Limited
Inventor: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
CPC classification number: G06F17/2863 , G06F17/271 , G06F17/277 , G06F17/2785 , G06F17/28
Abstract: Text processing includes: segmenting received text based on a lexicon of smallest semantic units to obtain medium-grained segmentation results; merging the medium-grained segmentation results to obtain coarse-grained segmentation results, the coarse-grained segmentation results having coarser granularity than the medium-grained segmentation results; looking up in the lexicon of smallest semantic units respective search elements that correspond to segments in the medium-grained segmentation results; and forming fine-grained segmentation results based on the respective search elements, the fine-grained segmentation results having finer granularity than the medium-grained segmentation results.
-
公开(公告)号:US20150100307A1
公开(公告)日:2015-04-09
申请号:US14514279
申请日:2014-10-14
Applicant: Alibaba Group Holding Limited
Inventor: Jian Sun , Lei Hou , Jing Ming Tang , Min Chu , Xiao Ling Liao , Bing Jing Xu , Ren Gang Peng , Yang Yang
CPC classification number: G06F17/2863 , G06F17/271 , G06F17/277 , G06F17/2785 , G06F17/28
Abstract: Text processing includes: segmenting received text based on a lexicon of smallest semantic units to obtain medium-grained segmentation results; merging the medium-grained segmentation results to obtain coarse-grained segmentation results, the coarse-grained segmentation results having coarser granularity than the medium-grained segmentation results; looking up in the lexicon of smallest semantic units respective search elements that correspond to segments in the medium-grained segmentation results; and forming fine-grained segmentation results based on the respective search elements, the fine-grained segmentation results having finer granularity than the medium-grained segmentation results.
Abstract translation: 文本处理包括:基于最小语义单元的词典对接收的文本进行分段,以获得中粒度分割结果; 合并中粒度分割结果以获得粗粒度分割结果,粗粒度分割结果具有比中粒度分割结果更粗糙的粒度; 在最小语义单元的词典中查找与中粒度分割结果中的段对应的相应搜索元素; 并且基于相应的搜索元素形成细粒度分割结果,细粒度分割结果具有比中粒度分割结果更精细的粒度。
-
-
-
-