Systems and methods for sharding based on distributed inverted indexes

    公开(公告)号:US11599500B2

    公开(公告)日:2023-03-07

    申请号:US16600106

    申请日:2019-10-11

    Applicant: Box, Inc.

    Abstract: According to one embodiment, distributing data across a plurality of storage shards can comprise generating a file key for each file of a plurality of files stored in a plurality of physical shards, each physical shard maintained by a node of a plurality of nodes in one or more clusters. The file key can comprise a hash of an enterprise identifier for an entity to which the creator of the file is a member, a hash of a folder identifier for a location in which the file is stored, and a hash of a file identifier uniquely identifying the file. The generated file keys can be sorted into an ordered list and the ordered list can be logically partitioning into a plurality of logical shards. Each logical shard of the plurality of logical shards can then be mapped to one of the plurality of physical shards.

Patent Agency Ranking