-
公开(公告)号:US10747750B2
公开(公告)日:2020-08-18
申请号:US15787804
申请日:2017-10-19
Applicant: Oath INC.
Inventor: Edward Bortnikov , Ohad Shacham , Dmitry Basin , Gali Sheffi , Yonatan Gottesman
IPC: G06F16/93 , G06F16/248 , G06F16/9535 , G06F16/2457 , G06F16/23 , G06F16/22
Abstract: Systems and methods for indexing content in a personal search system are disclosed. In one embodiment, the method comprises receiving a content item from a data source; extracting, from the content item, at least one document appearing in the content item, the at least one document including a first timestamp; storing the document in a content index; retrieving an event associated with the document from an event index, the event including a second timestamp and one or more attributes; generating an updated document based on the one or more attributes, the updated document including the second timestamp and the one or more attributes; and replacing the document stored in the content index with the updated document if the second timestamp is later than the first timestamp.
-
公开(公告)号:US20200210434A1
公开(公告)日:2020-07-02
申请号:US16235204
申请日:2018-12-28
Applicant: Oath Inc.
Inventor: Eshcar Hillel , Edward Bortnikov , Idit Keidar , Alexander Spiegelman , Lee Rhodes
IPC: G06F16/2455 , G06F7/08 , G06F16/27 , G06F16/22
Abstract: The present teaching relates to a method and system generating a data sketch. A plurality of local data sketches are generated, wherein for each local data sketch, it is determined whether a propagation condition is satisfied. In response to the condition being satisfied, the local data sketch is processed when a synchronization parameter satisfies a criterion. The synchronization parameter is set to a predetermined value to enable propagation of the local data sketch to the data sketch. Upon the local data sketch being propagated to the data sketch, the above steps are repeated.
-
公开(公告)号:US11030263B2
公开(公告)日:2021-06-08
申请号:US15977492
申请日:2018-05-11
Applicant: Oath Inc.
Inventor: Edward Bortnikov , Yonatan Gottesman , Gali Sheffi , Dmitry Basin , Nandaja Ananthanarayanan , Rakesh Chhabra
IPC: G06F16/00 , G06F16/9535 , G06F17/18 , G06F16/23 , G06F16/31 , G06F16/38 , G06F16/17 , G06F40/30
Abstract: Methods, systems, and programming for updating a search index are described herein. In a non-limiting embodiment, event data representing actions performed on a plurality of content items managed via the search index may be obtained. The event data may be filtered based on pre-determined criteria to generate updated event data, where the updated event data represents relevant actions performed on at least one of the plurality of content items and are associated with the search index. An updated search index based on the updated event data may then be generated.
-
公开(公告)号:US20200333968A1
公开(公告)日:2020-10-22
申请号:US16386505
申请日:2019-04-17
Applicant: Oath Inc.
Inventor: Edward Bortnikov , Eshcar Hillel , Anastasia Braginsky , Eran Gilad , Idit Keidar , Yonatan Gottesman
IPC: G06F3/06 , G06F16/9035 , G06F16/903 , G06F16/901
Abstract: The present teaching relates to a method, system and programming for operating a data storage. The data storage comprises of different portions including: a first portion having a plurality of metadata objects stored therein, each of the metadata objects being associated with a filter and corresponding to a range of keys, wherein at least one of the metadata objects is associated with a data structure, and a second portion having a plurality of files stored therein, each of the plurality of files being associated with one of the plurality of metadata objects; The data storage synchronizes a scan request with respect to one or more write requests based on a parameter associated with the scan request and each of the one or more write requests.
-
5.
公开(公告)号:US11048423B2
公开(公告)日:2021-06-29
申请号:US16386505
申请日:2019-04-17
Applicant: Oath Inc.
Inventor: Edward Bortnikov , Eshcar Hillel , Anastasia Braginsky , Eran Gilad , Idit Keidar , Yonatan Gottesman
IPC: G06F12/00 , G06F3/06 , G06F16/901 , G06F16/9035 , G06F16/903
Abstract: The present teaching relates to a method, system and programming for operating a data storage. The data storage comprises of different portions including: a first portion having a plurality of metadata objects stored therein, each of the metadata objects being associated with a filter and corresponding to a range of keys, wherein at least one of the metadata objects is associated with a data structure, and a second portion having a plurality of files stored therein, each of the plurality of files being associated with one of the plurality of metadata objects; The data storage synchronizes a scan request with respect to one or more write requests based on a parameter associated with the scan request and each of the one or more write requests.
-
公开(公告)号:US20200334258A1
公开(公告)日:2020-10-22
申请号:US16386496
申请日:2019-04-17
Applicant: Oath Inc.
Inventor: Edward Bortnikov , Eshcar Hillel , Anastasia Braginsky , Eran Gilad , Idit Keidar , Yonatan Gottesman
IPC: G06F16/2455 , G06F16/2458
Abstract: The present teaching relates to a method, system and programming for searching a data storage. A key is extracted from a request and a metadata object associated with the key is identified. Further, a determination is made as to whether the metadata object is associated with a data structure stored in a first portion of the data storage. In response to a successful determination, the data structure is searched to retrieve a value associated with the key from the first portion. In response to an unsuccessful determination, a cache is searched to retrieve the value associated with the key, and in response to the key being absent in the cache, a file associated with the metadata object is searched to retrieve the value associated with the key, wherein the file is stored in a second portion of the data storage.
-
公开(公告)号:US10510017B2
公开(公告)日:2019-12-17
申请号:US14945221
申请日:2015-11-18
Applicant: Oath Inc.
Inventor: Edward Bortnikov , Alexander Libov , Ohad Shacham
Abstract: In one embodiment, a processor of a computing device receives a query. The computing device may compare a centroid of each of a plurality of clusters to the query such that a subset of the plurality of clusters is selected, each of the plurality of clusters having a set of data points. An assignment of the subset of the plurality of clusters may be communicated to a hardware accelerator of the computing device. A plurality of threads of the hardware accelerator of the computing device may generate one or more distance tables that store results of intermediate computations corresponding to the query and the subset of the plurality of clusters. The distance tables may be stored in shared memory of the hardware accelerator. A plurality of threads of the hardware accelerator may determine a plurality of data points using the distance tables. The processor may provide query results pertaining to at least a portion of the plurality of data points.
-
公开(公告)号:US10459959B2
公开(公告)日:2019-10-29
申请号:US15345277
申请日:2016-11-07
Applicant: Oath Inc.
Inventor: David Carmel , Guy Gueta , Edward Bortnikov
Abstract: Methods and apparatus for performing top-k query processing include pruning a list of documents to identify a subset of the list of documents, where pruning includes, for other query terms in the set of query terms, skipping a document in the list of documents based, at least in part, on the contribution of the query term to the score of the corresponding document and the term upper bound for each other query term, in the set of query terms, that matches the document.
-
公开(公告)号:US20190294691A1
公开(公告)日:2019-09-26
申请号:US15928723
申请日:2018-03-22
Applicant: Oath Inc.
Inventor: Edward Bortnikov , David Carmel , Gali Sheffi , Idit Keidar , Dmitry Basin
IPC: G06F17/30
Abstract: Methods, systems, and programming for retrieving content items for a search are described herein. In a non-limiting embodiment, a query including a plurality of terms may be received. For each of the plurality of terms, a posting list of one or more content items may be obtained. The posting list may include a ranked list of term scores corresponding to the one or more content items, each of the term scores being indicative of a level of relevance of a corresponding content item to a term associated with the posting list. A list of relevant content items for the query may be determined based on the term scores in each posting list for the one or more content items identified with respect to each term. At least one of the relevant content items may be provided as a response to the query.
-
公开(公告)号:US20180219870A1
公开(公告)日:2018-08-02
申请号:US15925625
申请日:2018-03-19
Applicant: Oath Inc.
Inventor: Edward Bortnikov , Ohad Shacham , Idit Keidar , Eshcar Hillel , Sameer Paranjpye
CPC classification number: H04L63/108 , G06F9/466 , G06F16/2322 , H04L2463/121
Abstract: Briefly, embodiments disclosed herein relate to managing transactions in a distributed content transaction system.
-
-
-
-
-
-
-
-
-