Sampling in sliding windows with tight optimality and time decayed design

    公开(公告)号:US11496374B2

    公开(公告)日:2022-11-08

    申请号:US16867436

    申请日:2020-05-05

    发明人: Daniel Ting

    IPC分类号: H04L43/02 H04L49/90

    摘要: A method samples a stream of data items. Each data item has an associated timestamp. The method assigns a priority value to each data item. Each data item is represented as a point on a two-dimensional graph whose axes are time and priority. A sliding window covers a predetermined length of time tspan and uses a backward probability decay curve to specify what priority values are included in the sliding window. This defines, for a current time tc, a current data sample consisting of data items whose timestamps t fall within the time span tc−tspan≤t≤tc and have priority values below the decay curve. The data sample is stored in a buffer. The process iteratively moves the sliding window forward by a time increment, creating a provisional data sample. When the size of the provisional data sample is too large or too small, the process scales the decay curve.

    Elimination of query fragment duplication in complex database queries

    公开(公告)号:US11475005B2

    公开(公告)日:2022-10-18

    申请号:US17064490

    申请日:2020-10-06

    IPC分类号: G06F16/2453

    摘要: A database engine includes one or more computing devices, each having one or more processors and memory. The memory stores programs configured for execution by the processors. The database engine receives a database query from a client, and parses the database query to build a query operator tree. The query operator tree includes a plurality of query operators. The database engine performs one or more optimization passes on the query operator tree, including a deduplication optimization pass, to form an optimized execution plan. The deduplication optimization pass includes determining that a first query operator is equivalent to a second query operator during a traversal of the query operator tree, and replacing the second query operator with a link to reuse results from the first query operator. The database engine executes the optimized execution plan to retrieve a result set from the database and returns the result set to the client.

    Systems and Methods for Using Multiple Aggregation Levels in a Single Data Visualization

    公开(公告)号:US20220300525A1

    公开(公告)日:2022-09-22

    申请号:US17714070

    申请日:2022-04-05

    摘要: A computer system displays a graphical user interface that includes a data visualization region, a shelf region, and a schema information region. The computer system receives a first user input to place a first dimension data field from the schema information region into the shelf region. In accordance with the first user input, the computer system determines a visualization level of detail that is based on at least the first dimension data field. The computer system receives a second user input to specify a first aggregation. The first aggregation has a custom level of detail that is distinct from the visualization level of detail. The computer system builds a visual specification that includes the visualization and custom levels of detail. The computer system generates a data visualization corresponding to the visual specification and displays the data visualization in the data visualization region.

    INTERACTIVE DATA VISUALIZATION USER INTERFACE WITH HIERARCHICAL FILTERING BASED ON GESTURE LOCATION ON A CHART

    公开(公告)号:US20220004293A1

    公开(公告)日:2022-01-06

    申请号:US17478815

    申请日:2021-09-17

    发明人: Robin STEWART

    摘要: A device displays a chart representing data from a dataset. The chart has a first region displaying labels corresponding to data values of a first data field, a second region displaying labels corresponding to data values of a second data field, and a third region displaying visual marks representing aggregated data corresponding to pairs of data values from the first and second data fields. In response to a user input, the device removes visual marks from the chart. When the input location is the first region, the device removes visual marks in the first chart corresponding to a first data value of the first data field. When the input location is the second region, the device removes visual marks corresponding to a second data value of the second data field. Otherwise, the device removes only one visual mark. This updates the displayed chart.

    Highlighting Data Marks in Popup Secondary Data Visualizations According to Selected Data Values from Primary Data Visualizations

    公开(公告)号:US20210382604A1

    公开(公告)日:2021-12-09

    申请号:US17409759

    申请日:2021-08-23

    发明人: Marc RUETER Jun Kim

    摘要: A computing device displays a primary data visualization in a data visualization workbook that includes a second worksheet having a secondary data visualization. The primary data visualization includes a plurality of visual marks representing a first set of data fields from a dataset. A first user input in a tooltip definition refers to the secondary data visualization. A second user input selects a first visual mark. In response to the second user input, the device generates a highlighted version of the secondary data visualization, for a second set of data fields from the dataset, specified in the second worksheet, based on the selected first visual mark. The device displays the generated secondary data visualization in a tooltip. Sometimes the device receives a third user input selecting a second visual mark in the plurality of visual marks, and dynamically updates the displayed secondary data visualization based on the second visual mark.

    Systems and Methods for Visualizing Object Models of Database Tables

    公开(公告)号:US20210256039A1

    公开(公告)日:2021-08-19

    申请号:US17307427

    申请日:2021-05-04

    摘要: A method visualizes object models for data sources. The device displays, in an object model visualization region, a first visualization of a tree of data object icons, each data object icon representing a logical combination of one or more tables. The device also displays recommendations for one or more data sources to add objects to the tree. While displaying the first visualization in the object model visualization region, the device detects a first input on a first data object icon of the tree of data object icons. In response to detecting the first input on the first data object icon, the device displays a second visualization of the tree of the data object icons in a first portion of the object model visualization region and displays a third visualization of information related to the first data object icon in a second portion of the object model visualization region.

    Determining levels of detail for data visualizations using natural language constructs

    公开(公告)号:US11055489B2

    公开(公告)日:2021-07-06

    申请号:US16166125

    申请日:2018-10-21

    摘要: A computing device displays a data visualization interface and receives user selection of a data source and a natural language command directed to the data source. The device forms an intermediate expression according to a context-free grammar and a semantic model of data fields in the data source. It identifies in the natural language command (i) a first term that specifies an aggregation type in a first aggregation, (ii) a second term that specifies a data field, in the semantic model, to be aggregated for the first aggregation, and (iii) terms that specify data fields, in the semantic model, to determine grouping for the first aggregation. It translates the intermediate expression into database queries, executes the database queries to retrieve one or more data sets from the data source, aggregated according to the first aggregation, then generates and displays an updated data visualization of the retrieved data sets.

    Analyzing natural language expressions in a data visualization user interface

    公开(公告)号:US11048871B2

    公开(公告)日:2021-06-29

    申请号:US16134892

    申请日:2018-09-18

    摘要: A method is performed at a computing device coupled with a display. The method includes displaying a graphical user interface on the display and analyzing a natural language input, received from a user, to identify a portion of the natural language input corresponding to a first phrase that includes a first term. The method further includes receiving, from the user, a second input that modifies the first term in the first phrase. In response to receiving the second input, the computing device updates a second phrase based on the second input. In response to updating the second phrase based on the second input, the computing device displays, on the graphical user interface, an updated natural language expression that comprises the modified first phrase and the updated second phrase and displays an updated data visualization representing the updated natural language expression.