Invention Grant
- Patent Title: Thematic web corpus
-
Application No.: US15354870Application Date: 2016-11-17
-
Publication No.: US10783196B2Publication Date: 2020-09-22
- Inventor: Xavier Grehant , Morgan Champenois
- Applicant: DASSAULT SYSTEMES
- Applicant Address: FR Velizy Villacoublay
- Assignee: DASSAULT SYSTEMES
- Current Assignee: DASSAULT SYSTEMES
- Current Assignee Address: FR Velizy Villacoublay
- Agency: Oblon, McClelland, Maier & Neustadt, L.L.P.
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@713abf6b
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/9535 ; G06F16/955 ; G06F16/951 ; H04L29/08 ; H04L29/06

Abstract:
The invention notably relates to a computer-implemented method, performed by a server storing an index of a search engine, for sending, to a client, the URLs of pages of a Web corpus that relates to a theme. The method comprises receiving, from the client, a structured query that corresponds to the theme, the structured query consisting of a disjunction of at least one keyword; determining in the index the group that consists of the URLs of all pages that match the query; and sending to the client the URLs of the group as a stream.Such a method improves the building of a thematic Web corpus.
Public/Granted literature
- US20170140055A1 THEMATIC WEB CORPUS Public/Granted day:2017-05-18
Information query