Correlated incremental loading of multiple data sets for an interactive data prep application

    公开(公告)号:US10885057B2

    公开(公告)日:2021-01-05

    申请号:US16155818

    申请日:2018-10-09

    摘要: A user interface that includes a data flow pane and a profile pane. The data flow pane displays a flow diagram that identifies a data source. For each of multiple queries against the data source, the process issues the query against the data source asynchronously with an initial block size. Upon retrieval of the initial set of rows, the process repeats the query asynchronously with an updated block size until all of the rows have been retrieved. Periodically the process determines a high water mark for rows from the data source that have been retrieved for all of the queries. When the water mark changes, the process updates the profile pane to display data value histograms for multiple data fields in the data source. Each bar in each data value histogram counts the rows below the water mark that have a single specific data value or range of data values.

    Correlated Incremental Loading of Multiple Data Sets for an Interactive Data Prep Application

    公开(公告)号:US20200012656A1

    公开(公告)日:2020-01-09

    申请号:US16155818

    申请日:2018-10-09

    摘要: A user interface that includes a data flow pane and a profile pane. The data flow pane displays a flow diagram that identifies a data source. For each of multiple queries against the data source, the process issues the query against the data source asynchronously with an initial block size. Upon retrieval of the initial set of rows, the process repeats the query asynchronously with an updated block size until all of the rows have been retrieved. Periodically the process determines a high water mark for rows from the data source that have been retrieved for all of the queries. When the water mark changes, the process updates the profile pane to display data value histograms for multiple data fields in the data source. Each bar in each data value histogram counts the rows below the water mark that have a single specific data value or range of data values.

    Correlated Incremental Loading of Multiple Data Sets for an Interactive Data Prep Application

    公开(公告)号:US20210124756A1

    公开(公告)日:2021-04-29

    申请号:US17142138

    申请日:2021-01-05

    摘要: A user interface includes a data flow pane and a profile pane. The data flow pane displays a flow diagram that identifies a data source. For each of multiple queries against the data source, the process issues the query against the data source asynchronously with a block size. Upon retrieval of the initial set of rows, the process repeats the query asynchronously until all of the rows have been retrieved. Periodically the process determines a high water mark for rows from the data source that have been retrieved for all of the queries. When the water mark changes, the process updates the profile pane to display statistical distributions of data values for multiple data fields in the data source. Each distribution element in each statistical distribution counts the rows below the water mark that have a single specific data value or range of data values.

    Correlated incremental loading of multiple data sets for an interactive data prep application

    公开(公告)号:US11188556B2

    公开(公告)日:2021-11-30

    申请号:US17142138

    申请日:2021-01-05

    摘要: A user interface includes a data flow pane and a profile pane. The data flow pane displays a flow diagram that identifies a data source. For each of multiple queries against the data source, the process issues the query against the data source asynchronously with a block size. Upon retrieval of the initial set of rows, the process repeats the query asynchronously until all of the rows have been retrieved. Periodically the process determines a high water mark for rows from the data source that have been retrieved for all of the queries. When the water mark changes, the process updates the profile pane to display statistical distributions of data values for multiple data fields in the data source. Each distribution element in each statistical distribution counts the rows below the water mark that have a single specific data value or range of data values.