Integrated retrieval scheme for retrieving semi-structured documents
    1.
    发明授权
    Integrated retrieval scheme for retrieving semi-structured documents 失效
    用于检索半结构化文档的集成检索方案

    公开(公告)号:US06424980B1

    公开(公告)日:2002-07-23

    申请号:US09328944

    申请日:1999-06-09

    IPC分类号: G06F1500

    摘要: An integrated retrieval scheme retrieves data involved in a plurality of semi-structured documents scattering over open networks and collects the required information item by item from the semi-structured documents through a unified interface without regard to differences in the document structures, presentation styles, and elements of the semi-structured documents. The search scheme receives a query consisting of search items and search conditions from a user. The search scheme finds, according to location data that specifies the location of each of the semi-structured documents, the location of each semi-structured document that contains all search items and converts, if necessary, item presentation styles of the entered query into that of the location found semi-structured documents according to style conversion data, and forms queries for the location found semi-structured documents, and transmits the queries to the found locations and obtains the location found semi-structured documents, and extracts item data from the obtained semi-structured documents according to structure data being used to delimit document into items and attribute data being used for conditional retrieval, and prepares a search result, and converts, if necessary, item presentation styles of the search result into the item presentation styles of each user according to the style conversion data.

    摘要翻译: 综合检索方案检索在开放网络上散布的多个半结构化文档中涉及的数据,并通过统一界面从半结构化文档逐项收集所需信息,而不考虑文档结构,表示方式和 半结构化文档的元素。搜索方案从用户接收包括搜索项和搜索条件的查询。 搜索方案根据指定每个半结构化文档的位置的位置数据找到包含所有搜索项的每个半结构化文档的位置,并且如果需要,将输入的查询的项目呈现样式转换成 的位置根据样式转换数据找到半结构化文档,并形成查找位置找到半结构化文档,并将查询发送到找到的位置,并获取位置找到半结构化文档,并从中提取项目数据 根据用于将文档划分为用于条件检索的项目和属性数据的结构数据获得的半结构化文档,并且准备搜索结果,并且如果需要,将搜索结果的项目呈现样式转换为项目呈现样式 每个用户根据样式转换数据。