Interactive data visualization user interface with multiple interaction profiles

    公开(公告)号:US10896532B2

    公开(公告)日:2021-01-19

    申请号:US16539812

    申请日:2019-08-13

    发明人: Robin Stewart

    摘要: A user selects a dataset and a first interaction profile. There are multiple interaction profiles, each specifying a corresponding user interface and a corresponding decision tree for forming data visualizations according to user-selected data fields. A first data visualization user interface is displayed corresponding to the first interaction profile. Using the first data visualization user interface, the user selects data fields from the dataset and uses them to specify data visualization characteristics. The method displays a first data visualization according to the user-selected data fields and the decision tree for the first interaction profile. When the user selects a second interaction profile, the method displays a second data visualization according to the second interaction profile. The second data visualization uses at least some of the user-selected data fields. The second data visualization is displayed according to the specified data visualization characteristics and the decision tree for the second interaction profile.

    Data preparation user interface for aggregate comparison of datasets at different nodes in a process flow

    公开(公告)号:US10891041B1

    公开(公告)日:2021-01-12

    申请号:US16541049

    申请日:2019-08-14

    发明人: Brice Johnson

    摘要: A method compares data sets in a data preparation application. The method displays a user interface including a data flow pane and a profile pane. The data flow pane displays a flow diagram with nodes, each node corresponding to a data set. When a user selects a first node, the method displays distributions of data values for data fields from a first data set for the first node. When a user selects a second node, the method: (1) forms a composite data set comprising a union of (i) the first data set and (ii) a second data set for the second node; (2) groups data values for each data field to form sets of bins; and (3) displays data value distributions for each data field using the bins. Each bin depicts counts of data values in the bin originating from the first data set and the second data set.

    Computing domain cardinality estimates for optimizing database query execution

    公开(公告)号:US10824625B1

    公开(公告)日:2020-11-03

    申请号:US16236183

    申请日:2018-12-28

    IPC分类号: G06F16/2453 G06N3/08

    摘要: A method implements optimization of database queries by computing domain cardinality estimates. A client sends a database query to a server. The method parses the query to identify data columns. For each of the data columns, the method computes a lower bound and an upper bound of distinct data values using a pre-computed table size. The method also computes a patch factor by applying a pre-computed function to a ratio between a number of distinct data values that appear exactly once in a data sample and a number of distinct data values in the sample. Based on the patch factor, the lower bound, and the upper bound, the method computes an estimate of distinct values for each of the data columns. The method subsequently generates an execution plan for the query according to the computed estimates, executes the execution plan, and returns a result set to the client.

    INTERACTIVE LINEAGE ANALYZER FOR DATA ASSETS
    85.
    发明申请

    公开(公告)号:US20200334277A1

    公开(公告)日:2020-10-22

    申请号:US16389389

    申请日:2019-04-19

    摘要: Embodiments are directed to managing a data model that includes data objects associated with layers of the data model may be provided. A lineage model that includes lineage objects that has one or more lineage objects associated with a layer may be provided. A lineage object associated with a layer may be activated. A first portion data objects may be determined based on the activated lineage object. An anchor object may be determined from the first portion of data objects. A search of the data model may be employed to provide dependency information based on a position of the anchor data object in the data model. A second portion of data objects may be determined based on the dependency information. A report that includes a list of the second portion of data objects, the dependency information, or the like, may be provided.

    Methods and devices for adjusting chart magnification

    公开(公告)号:US10657685B2

    公开(公告)日:2020-05-19

    申请号:US15859235

    申请日:2017-12-29

    发明人: Robin Stewart

    摘要: A method is performed at an electronic device. The process displays a line chart, which includes a first plurality of data marks and a first plurality of line segments connecting adjacent data marks. The process detects a touch input, and expands the line chart horizontally, including expanding a first line segment. The process adds a second plurality of data marks on the first line segment, thereby subdividing the first line segment into a second plurality of line segments, which are initially collinear. After expanding the chart and adding the second plurality of marks, the process determines an ordinate value for each of the second plurality of marks, and animatedly moves each of the second plurality of marks from the first line segment to a respective vertical location defined by the respective determined ordinate value. The process concurrently moves each of the second plurality of line segments according to its endpoints.

    Filtering popup secondary data visualizations according to selected data from primary data visualizations

    公开(公告)号:US10656779B1

    公开(公告)日:2020-05-19

    申请号:US16127108

    申请日:2018-09-10

    发明人: Marc Rueter Jun Kim

    摘要: A computing device displays a graphical user interface on a display. The graphical user interface has a data visualization region that displays a primary data visualization, which includes a plurality of visual marks. The primary data visualization represents a first set of data fields of a plurality of data fields from a dataset. The device receives user input in the graphical user interface to select one of the visual marks. In response to the user input, the device generates a secondary data visualization, filtered according to one or more data values. The data values correspond to the first set of data fields and are associated with the selected visual mark. The secondary data visualization represents a second set of data fields from the dataset. The device then displays the generated secondary data visualization in a tooltip for the selected data mark, overlaying a portion of the primary data visualization.

    Correlated Incremental Loading of Multiple Data Sets for an Interactive Data Prep Application

    公开(公告)号:US20200012656A1

    公开(公告)日:2020-01-09

    申请号:US16155818

    申请日:2018-10-09

    摘要: A user interface that includes a data flow pane and a profile pane. The data flow pane displays a flow diagram that identifies a data source. For each of multiple queries against the data source, the process issues the query against the data source asynchronously with an initial block size. Upon retrieval of the initial set of rows, the process repeats the query asynchronously with an updated block size until all of the rows have been retrieved. Periodically the process determines a high water mark for rows from the data source that have been retrieved for all of the queries. When the water mark changes, the process updates the profile pane to display data value histograms for multiple data fields in the data source. Each bar in each data value histogram counts the rows below the water mark that have a single specific data value or range of data values.

    User interface for graphically refactoring data flows

    公开(公告)号:US10528587B2

    公开(公告)日:2020-01-07

    申请号:US15705174

    申请日:2017-09-14

    摘要: An exemplary method includes displaying a user interface that includes multiple panes, and receiving a first user input to select an existing node from a flow diagram or a flow element template. The system displays a moveable icon representing a new node for placement in the flow diagram. The new node specifies an operation corresponding to the selected existing node or the selected flow element template. Drop targets are displayed in the flow diagram according to dependencies between the data flow operation of the new node and operations of existing nodes. A second user input places the moveable icon over a first drop target. In response to ceasing to detect the second user input, the system inserts the new node into the flow diagram at the first drop target. The new node performs the specified data flow operation.