Updating data models for streaming data

    公开(公告)号:US10579623B2

    公开(公告)日:2020-03-03

    申请号:US15143138

    申请日:2016-04-29

    Abstract: Dynamically updating a ridge regression data model of a continuous stream of data is disclosed. New data chunks corresponding to a current data accumulation point are received and the data values in the new data chunks are transformed via standardization methods. A ridge estimator for the standardized data that includes data chunks received up to a penultimate data accumulation point to include the new data chunks is dynamically updated. The cumulative observations received up to the current data accumulation point are updated and stored. Predictions for the continuous data stream are generated based on the updated ridge estimator.

    Data stream analytics
    2.
    发明授权

    公开(公告)号:US11599561B2

    公开(公告)日:2023-03-07

    申请号:US15142504

    申请日:2016-04-29

    Abstract: Examples disclosed herein involve data stream analytics. In examples herein, a data stream may be analyzed by computing a set of hashes of a real-valued vector, the real-valued vector corresponding to a sample data object of a data stream; generating a list of data objects from a database corresponding to the sample data object based on the set of hashes, the list of data objects ordered based on similarity of the data objects to the sample data object of the data stream; and updating a data structure representative of activity of the sample data object in the data stream based on the list of data objects, the data structure to provide incremental analysis corresponding to the sample data object.

    Incremental automatic update of ranked neighbor lists based on k-th nearest neighbors

    公开(公告)号:US10810458B2

    公开(公告)日:2020-10-20

    申请号:US16073891

    申请日:2015-12-03

    Abstract: Incremental automatic update of ranked neighbor lists based on k-th nearest neighbors is disclosed. One example is a system including an indexing module to retrieve an incoming data stream, and retrieve ranked neighbor lists for received data objects. An evaluator determines similarity measures between the received data objects and their respective k-th nearest neighbors. A threshold determination module determines a statistical distribution based on the determined similarity measures, and a threshold based on the statistical distribution. The evaluator determines additional similarity measures between a new data object in the data stream and the received data objects. A neighbor update module automatically selects a sub-plurality of the received data objects by comparing the additional similarity measures to the threshold, and determines, for each selected data object, if the respective retrieved neighbor list is to be incrementally updated based on neighborhood comparisons for the new data object and the selected data object.

    SECURE INFORMATION RETRIEVAL BASED ON HASH TRANSFORMS

    公开(公告)号:US20170163424A1

    公开(公告)日:2017-06-08

    申请号:US15327015

    申请日:2014-08-29

    Abstract: Secure information retrieval is disclosed. One example is a system including an information retriever comprising a collection of nodes that receive a hash count from a first dataset, the first dataset including a first data term, and provide the hash count to a second dataset, the second dataset including a plurality of second data terms. A hash transformer transforms the data terms based on the hash count. A modifier modifies, for a given node, the transformed data terms. An evaluator evaluates, for each node, a similarity value between the first data term and each given second data term based on shared data elements between the modified first data term and a given modified second data term associated with the given second data term. The information retriever provides to the first dataset, at least one term identifier associated with a second data term.

    Automatic selection of neighbor lists to be incrementally updated

    公开(公告)号:US10803053B2

    公开(公告)日:2020-10-13

    申请号:US16073836

    申请日:2015-12-03

    Abstract: Automatic selection of neighbor lists to be incrementally updated is disclosed. One example is a system including an indexing module to receive an incoming data stream, and retrieve neighbor lists for received data objects. An evaluator determines similarity measures between pairs of the received data objects. A threshold determination module determines distributions of order statistics based on the determined similarity measures and retrieved neighbor lists, and a threshold based on the distributions of order statistics. The evaluator determines additional similarity measures between a new data object in the data stream and the received data objects. A neighbor update module automatically selects a sub-plurality of the received data objects by comparing the additional similarity measures to the threshold, and determines, for each selected data object, if the respective retrieved neighbor list is to be incrementally updated based on neighborhood comparisons for the new data object and the selected data object.

    AUTOMATIC SELECTION OF NEIGHBOR LISTS TO BE INCREMENTALLY UPDATED

    公开(公告)号:US20190034479A1

    公开(公告)日:2019-01-31

    申请号:US16073836

    申请日:2015-12-03

    Abstract: Automatic selection of neighbor lists to be incrementally updated is disclosed. One example is a system including an indexing module to receive an incoming data stream, and retrieve neighbor lists for received data objects. An evaluator determines similarity measures between pairs of the received data objects. A threshold determination module determines distributions of order statistics based on the determined similarity measures and retrieved neighbor lists, and a threshold based on the distributions of order statistics. The evaluator determines additional similarity measures between a new data object in the data stream and the received data objects. A neighbor update module automatically selects a sub-plurality of the received data objects by comparing the additional similarity measures to the threshold, and determines, for each selected data object, if the respective retrieved neighbor list is to be incrementally updated based on neighborhood comparisons for the new data object and the selected data object.

    SECURE MULTI-PARTY INFORMATION RETRIEVAL
    8.
    发明申请

    公开(公告)号:US20180114028A1

    公开(公告)日:2018-04-26

    申请号:US15567531

    申请日:2015-05-01

    Abstract: Secure multi-party information retrieval is disclosed. One example is a system including a query processor to request secure retrieval of candidate terms similar to a query term. A collection of information processors, where a given information processor receives the request and generates a random permutation. A plurality of data processors, where a given data processor generates clusters of a plurality of terms in a given dataset, where the clusters are based on similarity scores for pairs of terms, and selects a representative term from each cluster. The given information processor determines similarity scores between a secured query term received from the query processor and secured representative terms received from the given data processor, where the secured terms are based on the permutation, and the given data processor filters, without knowledge of the query term, the candidate terms of the plurality of terms based on the determined similarity scores.

    DATA STREAM ANALYTICS
    9.
    发明申请

    公开(公告)号:US20170316081A1

    公开(公告)日:2017-11-02

    申请号:US15142504

    申请日:2016-04-29

    CPC classification number: G06F16/285 G06F16/2228 G06F16/24568 G06F16/289

    Abstract: Examples disclosed herein involve data stream analytics. In examples herein, a data stream may be analyzed by computing a set of hashes of a real-valued vector, the real-valued vector corresponding to a sample data object of a data stream; generating a list of data objects from a database corresponding to the sample data object based on the set of hashes, the list of data objects ordered based on similarity of the data objects to the sample data object of the data stream; and updating a data structure representative of activity of the sample data object in the data stream based on the list of data objects, the data structure to provide incremental analysis corresponding to the sample data object.

Patent Agency Ranking