SYSTEMS AND METHODS FOR SHARDING BASED ON DISTRIBUTED INVERTED INDEXES

    公开(公告)号:US20230185765A1

    公开(公告)日:2023-06-15

    申请号:US18164105

    申请日:2023-02-03

    Applicant: Box, Inc.

    CPC classification number: G06F16/137 G06F16/152 G06F16/183

    Abstract: According to one embodiment, distributing data across a plurality of storage shards can comprise generating a file key for each file of a plurality of files stored in a plurality of physical shards, each physical shard maintained by a node of a plurality of nodes in one or more clusters. The file key can comprise a hash of an enterprise identifier for an entity to which the creator of the file is a member, a hash of a folder identifier for a location in which the file is stored, and a hash of a file identifier uniquely identifying the file. The generated file keys can be sorted into an ordered list and the ordered list can be logically partitioning into a plurality of logical shards. Each logical shard of the plurality of logical shards can then be mapped to one of the plurality of physical shards.

    Systems and methods for sharding based on distributed inverted indexes

    公开(公告)号:US11599500B2

    公开(公告)日:2023-03-07

    申请号:US16600106

    申请日:2019-10-11

    Applicant: Box, Inc.

    Abstract: According to one embodiment, distributing data across a plurality of storage shards can comprise generating a file key for each file of a plurality of files stored in a plurality of physical shards, each physical shard maintained by a node of a plurality of nodes in one or more clusters. The file key can comprise a hash of an enterprise identifier for an entity to which the creator of the file is a member, a hash of a folder identifier for a location in which the file is stored, and a hash of a file identifier uniquely identifying the file. The generated file keys can be sorted into an ordered list and the ordered list can be logically partitioning into a plurality of logical shards. Each logical shard of the plurality of logical shards can then be mapped to one of the plurality of physical shards.

Patent Agency Ranking