发明申请
US20130144875A1 SET EXPANSION PROCESSING DEVICE, SET EXPANSION PROCESSING METHOD, PROGRAM AND NON-TRANSITORY MEMORY MEDIUM 有权
SET扩展处理设备,扩展处理方法,程序和非终端记忆介质

  • 专利标题: SET EXPANSION PROCESSING DEVICE, SET EXPANSION PROCESSING METHOD, PROGRAM AND NON-TRANSITORY MEMORY MEDIUM
  • 专利标题(中): SET扩展处理设备,扩展处理方法,程序和非终端记忆介质
  • 申请号: US13700898
    申请日: 2012-02-22
  • 公开(公告)号: US20130144875A1
    公开(公告)日: 2013-06-06
  • 发明人: Masato Hagiwara
  • 申请人: Masato Hagiwara
  • 申请人地址: JP Shinagawa-ku, Tokyo
  • 专利权人: RAKUTEN, INC.
  • 当前专利权人: RAKUTEN, INC.
  • 当前专利权人地址: JP Shinagawa-ku, Tokyo
  • 优先权: JP2011-048124 20110304
  • 国际申请: PCT/JP2012/054211 WO 20120222
  • 主分类号: G06F17/30
  • IPC分类号: G06F17/30
SET EXPANSION PROCESSING DEVICE, SET EXPANSION PROCESSING METHOD, PROGRAM AND NON-TRANSITORY MEMORY MEDIUM
摘要:
A receiving unit (101) receives a seed string. A search unit (102) obtains snippets of documents containing the seed string. A segment acquisition unit (103) obtains segments by partitioning those snippets using a segment partition string. A segment component acquisition unit (104) obtains segment components by partitioning the segments using a segment component partition string. A segment score computation unit (105) computes a segment score for a segment from the standard deviation of the lengths of the segment components. A segment component score computation unit (106) computes a segment component score for a segment component from the segment score and the distance between the position of the seed string and the position of the segment component. A selection unit (107) selects any of the segment components as candidates for instances contained in the expanded set of the seed string based on the segment component scores.
信息查询
0/0