发明申请
US20130144875A1 SET EXPANSION PROCESSING DEVICE, SET EXPANSION PROCESSING METHOD, PROGRAM AND NON-TRANSITORY MEMORY MEDIUM
有权
SET扩展处理设备,扩展处理方法,程序和非终端记忆介质
- 专利标题: SET EXPANSION PROCESSING DEVICE, SET EXPANSION PROCESSING METHOD, PROGRAM AND NON-TRANSITORY MEMORY MEDIUM
- 专利标题(中): SET扩展处理设备,扩展处理方法,程序和非终端记忆介质
-
申请号: US13700898申请日: 2012-02-22
-
公开(公告)号: US20130144875A1公开(公告)日: 2013-06-06
- 发明人: Masato Hagiwara
- 申请人: Masato Hagiwara
- 申请人地址: JP Shinagawa-ku, Tokyo
- 专利权人: RAKUTEN, INC.
- 当前专利权人: RAKUTEN, INC.
- 当前专利权人地址: JP Shinagawa-ku, Tokyo
- 优先权: JP2011-048124 20110304
- 国际申请: PCT/JP2012/054211 WO 20120222
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A receiving unit (101) receives a seed string. A search unit (102) obtains snippets of documents containing the seed string. A segment acquisition unit (103) obtains segments by partitioning those snippets using a segment partition string. A segment component acquisition unit (104) obtains segment components by partitioning the segments using a segment component partition string. A segment score computation unit (105) computes a segment score for a segment from the standard deviation of the lengths of the segment components. A segment component score computation unit (106) computes a segment component score for a segment component from the segment score and the distance between the position of the seed string and the position of the segment component. A selection unit (107) selects any of the segment components as candidates for instances contained in the expanded set of the seed string based on the segment component scores.