Thematic web corpus
    1.
    发明授权

    公开(公告)号:US10783196B2

    公开(公告)日:2020-09-22

    申请号:US15354870

    申请日:2016-11-17

    Abstract: The invention notably relates to a computer-implemented method, performed by a server storing an index of a search engine, for sending, to a client, the URLs of pages of a Web corpus that relates to a theme. The method comprises receiving, from the client, a structured query that corresponds to the theme, the structured query consisting of a disjunction of at least one keyword; determining in the index the group that consists of the URLs of all pages that match the query; and sending to the client the URLs of the group as a stream.Such a method improves the building of a thematic Web corpus.

    THEMATIC WEB CORPUS
    2.
    发明申请
    THEMATIC WEB CORPUS 审中-公开

    公开(公告)号:US20170140055A1

    公开(公告)日:2017-05-18

    申请号:US15354870

    申请日:2016-11-17

    Abstract: The invention notably relates to a computer-implemented method, performed by a server storing an index of a search engine, for sending, to a client, the URLs of pages of a Web corpus that relates to a theme. The method comprises receiving, from the client, a structured query that corresponds to the theme, the structured query consisting of a disjunction of at least one keyword; determining in the index the group that consists of the URLs of all pages that match the query; and sending to the client the URLs of the group as a stream.Such a method improves the building of a thematic Web corpus.

Patent Agency Ranking