-
公开(公告)号:US11886229B1
公开(公告)日:2024-01-30
申请号:US17182083
申请日:2021-02-22
Applicant: Tanium Inc.
Inventor: Naveen Goela , Joshua F. Stoddard , John R. Coates , Christian L. Hunt , Adam Mustafa
IPC: G06F16/14 , G06F16/13 , G06F16/93 , G06F16/182 , G06F18/22
CPC classification number: G06F16/156 , G06F16/137 , G06F16/144 , G06F16/182 , G06F16/93 , G06F18/22
Abstract: In a distributed system that includes a collection of machines, a server system generates a global dictionary from sampling responses received from machines in the collection of machine, at least a subject of the sampling responses including information indicating one or more terms in a corpus of information stored at a respective machine in the collection of machines. The global dictionary includes global document frequency values corresponding to the document frequencies of terms in the corpora of information stored in the collection of machines. The server system generates a similarity search query for a target document, the similarity search query including identifiers of terms in the target document and optionally document frequency information for those terms, obtained from the global dictionary, and sends, through one or more linear communication orbits, the similarity search query to one or more respective machines in the collection of machines.