PROVIDING DATA VIEWS FROM A TIME-SERIES DATA LAKE TO A DATA WAREHOUSING SYSTEM

    公开(公告)号:US20210271568A1

    公开(公告)日:2021-09-02

    申请号:US17187353

    申请日:2021-02-26

    Applicant: Clumio, Inc.

    Abstract: Techniques are disclosed relating to providing data views from a time-series data lake to a data warehousing system. In various embodiments, the disclosed techniques include providing, by a cloud-based service, a data lake service that maintains a time-series data lake storing a time-series representation of data from a plurality of data sources associated with a first organization. In some embodiments, the cloud-based service may receive additional backup data, including a first backup image of a first data source, associated with the first organization as part of a backup operation. The cloud-based service may then store a logical backup of the first data source in the data lake and, in response to a query from a data warehousing system, the cloud-based service may retrieve a particular view of the backup data from the data lake and provide it to the data warehousing system.

    RETRIEVAL OF DATA FROM A TIME-SERIES DATA LAKE

    公开(公告)号:US20210271684A1

    公开(公告)日:2021-09-02

    申请号:US17187300

    申请日:2021-02-26

    Applicant: Clumio, Inc.

    Abstract: Techniques are disclosed relating to the retrieval of data from a time-series data lake. For example, in various embodiments, the disclosed techniques include providing, by a cloud-based service, a data lake service that maintains data for a plurality of organizations and where, for a first organization, the data lake service maintains a time-series data lake that stores a time-series representation of backup data associated with the first organization. The cloud-based service may receive a request, including one or more search criteria, for data associated with the first organization and, based on the search criteria, retrieve a particular view of the backup data that is stored in the data lake. In various embodiments, the particular view may include backup data from various different data sources and from various different points in time. The cloud-based service may then provide the particular view of the backup data to the requesting entity.

    STORAGE OF BACKUP DATA USING A TIME-SERIES DATA LAKE

    公开(公告)号:US20210271567A1

    公开(公告)日:2021-09-02

    申请号:US17187286

    申请日:2021-02-26

    Applicant: Clumio, Inc.

    Abstract: Techniques are disclosed relating to the storage of backup data using a time-series data lake. For example, in various embodiments, the disclosed techniques include providing a cloud-based data lake service that maintains data for a plurality of organizations and where, for a first organization, the cloud-based data lake service maintains a time-series data lake that stores a time-series representation of data associated with the first organization. In various embodiments, the data lake service may receive backup data from a plurality of data sources associated with the first organization, generate metadata associated with the backup data, and store the backup data, along with the corresponding metadata, in the time-series data lake.

    DATA PRESERVATION USING A TIME-SERIES DATA LAKE

    公开(公告)号:US20210271685A1

    公开(公告)日:2021-09-02

    申请号:US17187359

    申请日:2021-02-26

    Applicant: Clumio, Inc.

    Abstract: Techniques are disclosed relating to data preservation using a time-series data lake. For example, in some embodiments, the disclosed techniques include maintaining, by a cloud-based service, a time-series data lake that includes, for an organization, a time-series representation of a plurality of data sources associated with the organization. In various embodiments, the time-series data lake retains data according to a first retention policy. In response to a request for a subset of data that is associated with the organization, the cloud-based service may retrieve the subset of data from the time-series data lake and then store the subset of data in a particular storage location that retains data according to a second, different retention policy.

    Modification of data in a time-series data lake

    公开(公告)号:US11455316B2

    公开(公告)日:2022-09-27

    申请号:US17187365

    申请日:2021-02-26

    Applicant: Clumio, Inc.

    Abstract: Techniques are disclosed relating to the modification of data in a time-series data lake. For example, in various embodiments, the disclosed techniques include a cloud-based service that maintains a time-series data lake that includes, for an organization, a time-series representation of data from one or more of the organization's data sources. The cloud-based service may receive a request to modify data associated with a particular user of the organization. As a non-limiting example, this request may correspond to a “Right to Be Forgotten” request from the particular user. This request may include one or more search parameters and an indication of one or more modifications to be performed. Based on the request, the cloud-based service may parse the time-series data lake to identify a subset of data that matches the one or more search parameters and perform the requested modifications on the subset of data in the time-series data lake.

    MODIFICATION OF DATA IN A TIME-SERIES DATA LAKE

    公开(公告)号:US20210271686A1

    公开(公告)日:2021-09-02

    申请号:US17187365

    申请日:2021-02-26

    Applicant: Clumio, Inc.

    Abstract: Techniques are disclosed relating to the modification of data in a time-series data lake. For example, in various embodiments, the disclosed techniques include a cloud-based service that maintains a time-series data lake that includes, for an organization, a time-series representation of data from one or more of the organization's data sources. The cloud-based service may receive a request to modify data associated with a particular user of the organization. As a non-limiting example, this request may correspond to a “Right to Be Forgotten” request from the particular user. This request may include one or more search parameters and an indication of one or more modifications to be performed. Based on the request, the cloud-based service may parse the time-series data lake to identify a subset of data that matches the one or more search parameters and perform the requested modifications on the subset of data in the time-series data lake.

Patent Agency Ranking