Real-time modification of application programming interface behavior

    公开(公告)号:US11983580B2

    公开(公告)日:2024-05-14

    申请号:US17452773

    申请日:2021-10-28

    Applicant: Box, Inc.

    CPC classification number: G06F9/541 G06F16/2379 G06F21/6245 G06F2221/2113

    Abstract: Methods, systems, and computer program products for application-specific, real-time modification of application programming interface behavior. Meaning is derived from analysis of human-readable intelligence found in a collaboration object of a content management system. The meaning is used to inform the behavior of an application programming interface that is exposed to applications that interface with the content management system. The content management system invokes a range of analysis modules that examine the human-intelligible contents of a requested collaboration object to determine meaning from the human-intelligible contents. Content-derived tags are emitted based on the analysis. When the application invokes an entry point of the API, a set of content-derived tags are associated with the application, and the occurrence and/or values of the content-derived tags are then used in rules. The results of evaluation of the rules determine how the API will respond to the application.

    Detection of personally identifiable information

    公开(公告)号:US11941147B2

    公开(公告)日:2024-03-26

    申请号:US17463372

    申请日:2021-08-31

    Applicant: Box, Inc.

    CPC classification number: G06F21/6245

    Abstract: Methods, systems, and computer program products for detection of personally identifiable information (PII). A first detector and a second detector are configured to interoperate. The first detector is different from the second detector and the second detector incurs a greater computational cost than the first detector when processing identical content. Content is presented to the first detector so as to implement a first type of PII detection that is based at least in part on regular expression analysis using regular expressions. The content is presented to the second detector. The second detector performs PII detection based on content analysis that is different from the first detector's regular expression analysis. The second detector causes generation of new regular expressions based on the content analysis and the first detector is updated with such new regular expressions. Performance of the first detector is continually improved as new regular expressions are generated.

    Efficient process for creating range-partitioned indexes ensuring uniform document distribution

    公开(公告)号:US11941028B2

    公开(公告)日:2024-03-26

    申请号:US16244289

    申请日:2019-01-10

    Applicant: Box, Inc.

    CPC classification number: G06F16/278 G06F16/2272 G06F16/2282

    Abstract: Embodiments are directed to distributing records among storage partitions by maintaining a table of records. The table of records can be indexed based on an original partitioning key in the table of records. A plurality of counters can be initialized with each counter associated with a sub-range in a total range of key values for a secondary index partitioning key. Each record of the table of records can be read and a count of records in the associated sub-range can be accumulated in each counter. The number of records per partition can be determined based on the total number of records in the total range of key values and the number of available partitions and the records can be distributed to the available partitions in the storage system based on the number of records in each sub-range.

    Prioritizing operations over content objects of a content management system

    公开(公告)号:US11936656B2

    公开(公告)日:2024-03-19

    申请号:US17163222

    申请日:2021-01-29

    Applicant: Box, Inc.

    Abstract: Content object operations over content objects of a content management system are prioritized to be performed immediately, or at a later time. The immediate scheduling of an operation is determined by policies, rules, and/or predictive model outcomes. The determination for later time scheduling is based on analysis of a history of events on content objects. If the content object operation is deemed to be at least potentially delayable to a later time, then a scheduling model is consulted to determine an urgency of performing the content object operation on the content object. The urgency value resulting from consulting the scheduling model is combined with then-current resource availability to determine a timeframe for performance of the content object operation on the content object relative to other entries in a continuously updated list of to-be-performed operations. The performance of the content object operation on the content object is initiated in due course.

    Form and template detection
    15.
    发明授权

    公开(公告)号:US11928425B2

    公开(公告)日:2024-03-12

    申请号:US16948831

    申请日:2020-10-01

    Applicant: Box, Inc.

    CPC classification number: G06F40/186 G06F16/14 G06F16/164

    Abstract: Methods, systems and computer program products for content management systems. A content management system is configured to manage a plurality of content objects. Unsupervised learning is performed over the plurality of content objects to identify document templates associated with content objects taken from the plurality of content objects. When a document template is identified, template metadata is associated with the document template. Additional content objects that are similar to the document template can take on the template metadata. In this way, many documents can be automatically populated with template metadata that corresponds to the identified document template. All or portions of the template metadata can be applied to policies, which policies serve to marshal ongoing document handling operations. During learning, document features are extracted and analyzed so as to define feature clusters, which feature clusters are in turn are used to form document template clusters.

    Workflow selection
    17.
    发明授权

    公开(公告)号:US11768707B2

    公开(公告)日:2023-09-26

    申请号:US16553161

    申请日:2019-08-27

    Applicant: Box, Inc.

    CPC classification number: G06F9/5027 G06F16/254 G06F16/288 H04L67/10 H04L67/53

    Abstract: Methods, systems and computer program products for content management systems. In a computing environment having multiple interconnected computer systems, one or more network communication links between a first computer system that hosts a first application and a second computer system that hosts one or more second applications are established. The first computer system hosts a plurality of shared content objects that can be operated over by any second applications that are interfaced with the first application. Particular interaction events over a shared content object take place at the second applications and/or at the interfaces between the first and second applications. Such particular interaction events are recorded as they occur, after which, based on event attributes associated with the particular interaction events, at least one workflow of the first application is selected. The selected workflow of the first application is executed to perform operations over the shared content object.

    SYNTHETIC TRAINING DATASETS FOR PERSONALLY IDENTIFIABLE INFORMATION CLASSIFIERS

    公开(公告)号:US20230244811A1

    公开(公告)日:2023-08-03

    申请号:US17589610

    申请日:2022-01-31

    Applicant: Box, Inc.

    CPC classification number: G06F21/6245 G06F21/6272 G06F16/93

    Abstract: Handling user-demanded privacy controls over data of an electronic document collaboration system. A storage facility is configured to store content objects and associated metadata that pertains to the content objects. A user raises a privacy action request that comprises a demand to change how certain content objects that contain personally identifiable information (PII) of the user are handled. A plurality of content objects are classified using a PII classifier that is trained using synthetically-generated training set entries where, rather than reading actual contents from electronic documents of the collaboration system to generate training set entries, instead, the training set entries are generated using words that are randomly selected from a repository of natural language words. When PII corresponding to the user who raised the privacy action request is discovered in content objects, then the content management system modifies those content objects and/or its metadata in accordance with the demand.

Patent Agency Ranking