DATABASE SYSTEMS AND USER INTERFACES FOR PROCESSING DISCRETE DATA ITEMS WITH STATISTICAL MODELS ASSOCIATED WITH CONTINUOUS PROCESSES

    公开(公告)号:US20230214690A1

    公开(公告)日:2023-07-06

    申请号:US18182240

    申请日:2023-03-10

    CPC classification number: G06N5/04 G06N20/00 G06F30/20

    Abstract: A computer-implemented method is provided to predict one or more expected quantities using a machine learning model. The method system may comprise steps to receive a set of data items associated with one or more characteristics, generate or train a machine learning model using the set of data items and associated characteristics, receive one or more sets of simulation parameters from a user indicating a hypothetical scenario and a time period, and generate user interface data. The user interface data may comprise a time-based chart illustrating the respective time periods. The computing system may further apply machine learning model to the set of simulation parameters to predict a set of expected quantities based on the simulation parameters, aggregate one or more types of expected quantities from the set of expected quantities to determine one or more combined quantities, and include in the user interface indications of the one or more combined quantities. The computing system may then cause the user interface to be presented. In some implementations of the method as disclosed herein, receiving the data items may comprise retrieving one or more discrete events from a data source, and converting the one or more discrete events into one or more continuous quantities.

    Systems and methods for managing custom code in a data computing platform

    公开(公告)号:US11689530B2

    公开(公告)日:2023-06-27

    申请号:US16700153

    申请日:2019-12-02

    Inventor: James Ding

    CPC classification number: H04L63/101 H04L63/0281 H04L63/20

    Abstract: A system for managing custom code within a data computing platform determines that a request for one or more uniform resource identifiers external to the platform is being made by custom code executing in the platform. In response to the determination, the system checks a whitelist of allowable external URIs against the requested one or more URIs and allows access to the requested one or more URIs if a match is detected with the whitelist, otherwise access by the custom code to the requested one or more URIs is denied. In addition, or alternatively, the system checks a blacklist of disallowed external URIs against the requested one or more URIs and denies access to the requested one or more URIs if a match is detected with the blacklist, otherwise access by the custom code to the requested one or more URIs is allowed. The blacklist can override the whitelist.

    Time-series data storage and processing database system

    公开(公告)号:US11687543B2

    公开(公告)日:2023-06-27

    申请号:US16805257

    申请日:2020-02-28

    CPC classification number: G06F16/2477 G06F16/248 G06F16/2428

    Abstract: A database system is described that includes components for storing time-series data and executing custom, user-defined computational expressions in substantially real-time such that the results can be provided to a user device for display in an interactive user interface. For example, the database system may process stored time-series data in response to requests from a user device. The request may include a start time, an end time, a period, and/or a computational expression. The database system may retrieve the time-series data identified by the computational expression and, for each period, perform the arithmetic operation(s) identified by the computational expression on data values corresponding to times within the start time and the end time. Once all new data values have been generated, the database system may transmit the new data values to the user device for display in the interactive user interface.

    Performing database joins in distributed data processing systems

    公开(公告)号:US11687532B2

    公开(公告)日:2023-06-27

    申请号:US17557883

    申请日:2021-12-21

    CPC classification number: G06F16/24544 G06F16/2282 G06F16/2456 G06F16/24532

    Abstract: A computer-implemented method for efficiently performing a database join in a distributed data processing system comprising multiple computational nodes, the method comprising determining a first set of one or more columns of a first database table and a second set of one or more columns of a second database table on which the join is to be performed; estimating a size of the rows of the first table which have a particular combination of values in the first set of columns; computing a salt factor n based on the estimated size of rows and further based on a processing capacity of a computational node of the distributed data processing system; assigning one of n different salt values to each row of the first table having the particular combination of values in the first set of columns; for each row of the second table having the particular combination of values in the second set of columns into n rows, expanding the row into n row, and assigning to each expanded row a different one of the n salt values; and performing a join operation on the modified first and second tables, wherein the rows of the first and second tables have the same combination of values in the first and second sets of columns and the same salt value are joined on the same computational node.

    Systems and methods for grouping and enriching data items accessed from one or more databases for presentation in a user interface

    公开(公告)号:US11681694B2

    公开(公告)日:2023-06-20

    申请号:US17445878

    申请日:2021-08-25

    Inventor: Luke Tomlin

    CPC classification number: G06F16/2423 G06F3/0484 G06F16/24578

    Abstract: Embodiments of the present disclosure relate to a data analysis system for grouping and enriching data items for presentation to an analyst through a user interface. Data items from one or more data sources are combined into memory-efficient clustered data structures, which may be stored as one or more data tables in a database. Analysis and scoring of those clustered data structures can be performed by utilizing various criteria or rules to generate scores, reports, alerts, or conclusions that may aid an analyst in evaluating the clustered data structures. The analysis and scoring may also be added to the clustered data structures which are stored as one or more data tables in a database. The analyst may be prompted to create a dossier format or specification and to additional enrichments to be performed on the raw data items in the clustered data structures. The system can also perform versioning on the raw data items in the one or more data tables, wherein the versioning is performed at least by aggregating a subset of raw data items from the one or more data tables. The system may then search, group, or filter the raw data items based on the analyst-defined dossier format, as well as add enrichments to the data. Some examples of enrichments include changing the way the data is displayed, inserting data located in a separate reference table, or ordering data to help construct timelines, histograms, and/or other visualizations based upon the various attributes of the raw data items. The enriched data may be presented to the analyst through a user interface, in the user-defined format or specification in order to allow the analyst to efficiently evaluate the data clusters in the context of, for example, a risky trading investigation.

    Systems and methods for accessing federated data

    公开(公告)号:US11681690B2

    公开(公告)日:2023-06-20

    申请号:US17693172

    申请日:2022-03-11

    CPC classification number: G06F16/2379 G06F16/252 G06F16/256

    Abstract: Systems and methods are provided that allow federated data from various source systems to be accessed and analyzed through a data analysis platform. The federated data may be stored in different formats. The data analysis platform can receive the federated data in whatever format it has been stored at its respective source system. A script can be used to generate temporary representations (or temporary objects) for the federated data by transforming the federated data. Moreover, the temporary representations or temporary objects can be further transformed into a data analysis platform-specific format. A user of data analysis platform may access and/or manipulate either the temporary representations or objects as well as the data analysis platform-specific objects. Temporary objects can be transformed automatically into corresponding platform-specific objects when necessary to provide an enhanced capability or operation on the objects.

Patent Agency Ranking