Invention Grant
- Patent Title: Information collection apparatus, search engine, information collection method, and program
- Patent Title (中): 信息收集设备,搜索引擎,信息收集方法和程序
-
Application No.: US13003875Application Date: 2009-08-14
-
Publication No.: US08676782B2Publication Date: 2014-03-18
- Inventor: Seiji Hamada , Makoto Yamamoto
- Applicant: Seiji Hamada , Makoto Yamamoto
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Konrad Raynes Davda & Victor LLP
- Agent David W. Victor
- Priority: JP2008-261848 20081008
- International Application: PCT/JP2009/064362 WO 20090814
- International Announcement: WO2010/041517 WO 20100415
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
The present invention provides an information collection apparatus, an information collection method, and a program capable of collecting information from information resources on a network effectively as well as a search engine that searches the information resources collected. An information collection apparatus of the present invention that collects information from information resources on a network includes an extraction unit that acquires data from an information resource via the network to extract a link-destination address included in the data, a calculation unit that calculates, by comparing each link-destination address with a collection rule describing a set of addresses qualified for a collection target, a score for each link-destination address that reflects a distance from the set to a link-destination information resource indicated by the link-destination address, and a judgment unit that judges whether the link-destination information resource is to be included in the collection target or not in accordance with the score calculated for the link-destination information resource.
Public/Granted literature
- US20110119263A1 INFORMATION COLLECTION APPARATUS, SEARCH ENGINE, INFORMATION COLLECTION METHOD, AND PROGRAM Public/Granted day:2011-05-19
Information query