发明申请
- 专利标题: Matching text sets
- 专利标题(中): 匹配文本集
-
申请号: US13200123申请日: 2011-09-19
-
公开(公告)号: US20120072220A1公开(公告)日: 2012-03-22
- 发明人: Xu Zhang , Ningjun Su , Haijie Gu , Jiancheng Qi
- 申请人: Xu Zhang , Ningjun Su , Haijie Gu , Jiancheng Qi
- 专利权人: Alibaba Group Holding Limited
- 当前专利权人: Alibaba Group Holding Limited
- 优先权: CN201010290693.4 20100920
- 主分类号: G10L15/00
- IPC分类号: G10L15/00
摘要:
Matching text sets is disclosed, including: extracting a text set from data associated with a current period; storing the text set with a plurality of text sets; extracting a keyword from the text set; determining a weight value associated with the keyword associated with the text set; determining a degree of similarity between the text set and another text set based at least in part on a weight value associated with the keyword associated with the text set and a weight value associated with a keyword associated with the other text set; and determining whether the text set is related to the other text set based at least in part on the determined degree of similarity.
信息查询