发明公开
- 专利标题: INDEX GENERATING PROGRAM AND SEARCH PROGRAM
- 专利标题(中): 指数生成程序和搜索程序
-
申请号: EP12877979.0申请日: 2012-05-31
-
公开(公告)号: EP2857986A1公开(公告)日: 2015-04-08
- 发明人: KATAOKA, Masahiro , MURATA, Takahiro , OHTA, Takafumi
- 申请人: Fujitsu Limited
- 申请人地址: 1-1, Kamikondanaka 4-chome Nakahara-ku Kawasaki-shi, Kanagawa 211-8588 JP
- 专利权人: Fujitsu Limited
- 当前专利权人: Fujitsu Limited
- 当前专利权人地址: 1-1, Kamikondanaka 4-chome Nakahara-ku Kawasaki-shi, Kanagawa 211-8588 JP
- 代理机构: Hoffmann Eitle
- 国际公布: WO2013179348 20131205
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
[Problem to be solved] It is an object in one aspect of an embodiment of the invention to suppress narrow down noise generated when targets are narrowed down at the time of a string search performed on document data.
[Solution] According to an aspect of an embodiment, a computer changes, in accordance with whether a document element that has a predetermined number of child elements is present in a document file, control of determining whether data in the document file is to be included in which of a plurality of blocks by changing the control between the control performed for each document element in the hierarchy of the child elements and the control performed for each document element in the hierarchy of the document element or in the hierarchy higher than the hierarchy of the document element; divides the document file into the plurality of blocks; and generates, for each piece of data obtained by being divided, index information that indicates whether each of the pieces of the data includes predetermined character information.
[Solution] According to an aspect of an embodiment, a computer changes, in accordance with whether a document element that has a predetermined number of child elements is present in a document file, control of determining whether data in the document file is to be included in which of a plurality of blocks by changing the control between the control performed for each document element in the hierarchy of the child elements and the control performed for each document element in the hierarchy of the document element or in the hierarchy higher than the hierarchy of the document element; divides the document file into the plurality of blocks; and generates, for each piece of data obtained by being divided, index information that indicates whether each of the pieces of the data includes predetermined character information.
信息查询