Programmatically choosing preferred storage parameters for files in large-scale distributed storage systems

    公开(公告)号:US11880340B2

    公开(公告)日:2024-01-23

    申请号:US16926391

    申请日:2020-07-10

    Applicant: Google LLC

    Abstract: A method includes receiving trace data representing access information about files stored in a large-scale distributed storage system, identifying file access patterns based on the trace data, receiving metadata information associated with the files stored in the large-scale distributed storage system, and generating a preferred storage parameter for each file based on the received metadata information and the identified file access patterns. The method also includes receiving, file reliability or accessibility information of a new file, determining whether the received file reliability or accessibility information of the new file matches information of a file group of the files in the large-scale distributed storage system, and when the file reliability or accessibility information of the new file matches the information of the file group, storing the new file in the large-scale distributed storage system using the preferred storage parameter associated with the file group.

    Programmatically choosing preferred storage parameters for files in large-scale distributed storage systems

    公开(公告)号:US12222901B2

    公开(公告)日:2025-02-11

    申请号:US17445339

    申请日:2021-08-18

    Applicant: Google LLC

    Abstract: A method includes receiving trace data representing access information about files stored in a large-scale distributed storage system, identifying file access patterns based on the trace data, receiving metadata information associated with the files stored in the large-scale distributed storage system, and generating a preferred storage parameter for each file based on the received metadata information and the identified file access patterns. The method also includes receiving, file reliability or accessibility information of a new file, determining whether the received file reliability or accessibility information of the new file matches information of a file group of the files in the large-scale distributed storage system, and when the file reliability or accessibility information of the new file matches the information of the file group, storing the new file in the large-scale distributed storage system using the preferred storage parameter associated with the file group.

    Programmatically choosing preferred storage parameters for files in large-scale distributed storage systems

    公开(公告)号:US11113242B2

    公开(公告)日:2021-09-07

    申请号:US16434376

    申请日:2019-06-07

    Applicant: Google LLC

    Abstract: A method includes receiving trace data representing access information about files stored in a large-scale distributed storage system, identifying file access patterns based on the trace data, receiving metadata information associated with the files stored in the large-scale distributed storage system, and generating a preferred storage parameter for each file based on the received metadata information and the identified file access patterns. The method also includes receiving, file reliability or accessibility information of a new file, determining whether the received file reliability or accessibility information of the new file matches information of a file group of the files in the large-scale distributed storage system, and when the file reliability or accessibility information of the new file matches the information of the file group, storing the new file in the large-scale distributed storage system using the preferred storage parameter associated with the file group.

    Programmatically choosing preferred storage parameters for files in large-scale distributed storage systems

    公开(公告)号:US11995037B2

    公开(公告)日:2024-05-28

    申请号:US16702535

    申请日:2019-12-03

    Applicant: Google LLC

    Abstract: A method includes receiving trace data representing access information about files stored in a large-scale distributed storage system, identifying file access patterns based on the trace data, receiving metadata information associated with the files stored in the large-scale distributed storage system, and generating a preferred storage parameter for each file based on the received metadata information and the identified file access patterns. The method also includes receiving, file reliability or accessibility information of a new file, determining whether the received file reliability or accessibility information of the new file matches information of a file group of the files in the large-scale distributed storage system, and when the file reliability or accessibility information of the new file matches the information of the file group, storing the new file in the large-scale distributed storage system using the preferred storage parameter associated with the file group.

    PROGRAMMATICALLY CHOOSING PREFERRED STORAGE PARAMETERS FOR FILES IN LARGE-SCALE DISTRIBUTED STORAGE SYSTEMS

    公开(公告)号:US20210382852A1

    公开(公告)日:2021-12-09

    申请号:US17445339

    申请日:2021-08-18

    Applicant: Google LLC

    Abstract: A method includes receiving trace data representing access information about files stored in a large-scale distributed storage system, identifying file access patterns based on the trace data, receiving metadata information associated with the files stored in the large-scale distributed storage system, and generating a preferred storage parameter for each file based on the received metadata information and the identified file access patterns. The method also includes receiving, file reliability or accessibility information of a new file, determining whether the received file reliability or accessibility information of the new file matches information of a file group of the files in the large-scale distributed storage system, and when the file reliability or accessibility information of the new file matches the information of the file group, storing the new file in the large-scale distributed storage system using the preferred storage parameter associated with the file group.

    PROGRAMMATICALLY CHOOSING PREFERRED STORAGE PARAMETERS FOR FILES

    公开(公告)号:US20250165435A1

    公开(公告)日:2025-05-22

    申请号:US19033464

    申请日:2025-01-21

    Applicant: Google LLC

    Abstract: A method includes receiving trace data representing access information about files stored in a large-scale distributed storage system, identifying file access patterns based on the trace data, receiving metadata information associated with the files stored in the large-scale distributed storage system, and generating a preferred storage parameter for each file based on the received metadata information and the identified file access patterns. The method also includes receiving, file reliability or accessibility information of a new file, determining whether the received file reliability or accessibility information of the new file matches information of a file group of the files in the large-scale distributed storage system, and when the file reliability or accessibility information of the new file matches the information of the file group, storing the new file in the large-scale distributed storage system using the preferred storage parameter associated with the file group.

    Scaling high-level statistical languages to large, distributed datasets

    公开(公告)号:US10203936B1

    公开(公告)日:2019-02-12

    申请号:US15367326

    申请日:2016-12-02

    Applicant: Google LLC

    Abstract: A system and method for performing large-scale data processing using a statistical programming language are disclosed. One or more high-level statistical operations may be received. The received high-level statistical operations may be dynamically translated into a graph of low-level data operations. The unnecessary operations may be removed and operations may be fused or chained together. Operations may then be grouped into distributed data processing operation. The low-level operations may then be run.

Patent Agency Ranking