-
31.
公开(公告)号:US20230385033A1
公开(公告)日:2023-11-30
申请号:US18327795
申请日:2023-06-01
Applicant: Palantir Technologies Inc.
Inventor: Mark Elliot , Punyashloka Biswal , Ankit Shankar , Omar Ali , John Chakerian , Ben Duffield , Adam Borochoff
CPC classification number: G06F8/34 , G06F8/33 , G06F8/41 , G06F8/36 , G06F11/3668 , G06T11/206 , G06F9/45512
Abstract: The programming notebook system, methods, and user interfaces described herein provide software developers with enhanced tools by which a programming notebook workflow and session history associated with code cells in a programming notebook may be tracked and maintained. As a developer progresses through a development workflow, the developer can select an option to save a program code card representing some or all of the program code cell inputs. A card editor user interface may present an aggregated listing of all program code the developer has provided across multiple code cells during the current session which the developer can edit, refine, and/or comment. The card editor may also allow the developer to add associated user interface code to display a UI component associated with the program code card, and allow the developer to add a description and tags for the card so that the card can be searched for and reused.
-
公开(公告)号:US11809216B2
公开(公告)日:2023-11-07
申请号:US17820062
申请日:2022-08-16
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
CPC classification number: G06F8/71 , G06F9/3885 , G06F9/4494 , G06F9/4881 , G06F16/2329 , G06F16/2358
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method receives a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method generates a task identifier for the data processing task, and providing, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method receives the second data set and metadata from the agent associated with the external system and stores the second data set and associated metadata.
-
公开(公告)号:US11803566B2
公开(公告)日:2023-10-31
申请号:US17644426
申请日:2021-12-15
Applicant: Palantir Technologies Inc.
Inventor: Robert Fink , James Baker , Mark Elliot
IPC: G06F16/25 , G06F16/23 , G06F16/2457
CPC classification number: G06F16/254 , G06F16/2358 , G06F16/2457
Abstract: Disclosed herein is a data structure which includes a sequence of events, each event associated with a sequence number indicating a temporal position of an event within the sequence of events; one or more read-offsets, each read-offset associated with a consumer, wherein each read-offset indicates a sequence number up to which a consumer has read events within the sequence of events; and at least one snapshot which represents events with sequence numbers smaller than the smallest read-offset in a compacted form. Disclosed herein is also a computer-implemented method of maintaining the data structure. Disclosed herein is a computer-implemented method, wherein the method is performed on a sequence of events accessible by a plurality of consumers, each event associated with a sequence number indicating a temporal position of an event within the sequence of events, each consumer associated with a read-offset indicating the sequence number up to which the consumer has read events within the sequence of events, the method includes determining a smallest read-offset of all read-offsets; compacting events with sequence numbers smaller than the smallest read-offset into a snapshot; and replacing the events with sequence numbers smaller than the smallest read-offset with the snapshot. Disclosed herein are corresponding computer-readable media and computing systems.
-
公开(公告)号:US11625369B2
公开(公告)日:2023-04-11
申请号:US17583751
申请日:2022-01-25
Applicant: Palantir Technologies Inc.
Inventor: Joseph Ellis , Mark Elliot , Robert Fink , Sandor Van Wassenhove
IPC: G06F16/00 , G06F16/21 , G06F16/245 , G06F16/23
Abstract: Systems and methods are provided for data migration. The system may comprise one or more processors and a memory storing instructions that, when executed by the one or more processors, cause the system to migrate at least one first table of a first database schema to at least one second table of a second database schema, determine a query for modifying the first table during the migration, modify the second table based at least in part on the query, and update a mutation table to describe the modification, wherein the mutation table at least describes the modification.
-
公开(公告)号:US11620280B2
公开(公告)日:2023-04-04
申请号:US17444715
申请日:2021-08-09
Applicant: Palantir Technologies Inc.
Inventor: Benjamin Duffield , Joshua Casale , Mark Elliot , Matthew Sills , Robert Kruszewski , Rahij Ramsharan
IPC: G06F16/23 , G06F16/27 , G06F16/2455 , G06F16/248
Abstract: A database system comprised of a decoupled compute layer and storage layer is implemented to store, build, and maintain a canonical dataset, a temporary buffer, and projection datasets. The canonical dataset is a set of batch updated data. The data is appended in chunks to the canonical dataset such that the canonical dataset becomes a historical dataset over time. The buffer is a write ahead log that contains the most recent chunks of data and provides atomicity and durability for the database system. The projection datasets are indexes of the canonical dataset and/or the buffer that may have single or multiple column sort-orders and/or particular data formats. The writes to the canonical dataset, projection datasets, and buffer may be asynchronous and therefore the database system is advantageously less resource constrained.
-
公开(公告)号:US11580206B2
公开(公告)日:2023-02-14
申请号:US16784225
申请日:2020-02-06
Applicant: Palantir Technologies Inc.
Inventor: Hannah Korus , Brian Schimpf , Lam Tran , Mark Elliot , Robert Kruszewski
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for data security protection are provided. One of the methods includes: receiving a job associated with a project, wherein the project is associated with one or more data sources; identifying a plurality of inputs and a plurality of outputs associated with the job; determining a plurality of required permissions associated with the job, wherein each of the required permissions comprises an operation on a required data source, the operation corresponding to at least one of the inputs or the outputs; verifying that the one or more data sources associated with the project comprise the required data source associated with each of the required permissions; and generating a token associated with the job, the token encoding the required permissions associated with the job, wherein the token is required for execution of the job.
-
公开(公告)号:US11573776B1
公开(公告)日:2023-02-07
申请号:US17091912
申请日:2020-11-06
Applicant: Palantir Technologies Inc.
Inventor: Robert Fink , Matthew Cheah , Mingyu Kim , Lynn Cuthriell , Divyanshu Arora , Justin Uang , Jared Newman , Jakob Juelich , Kevin Chen , Mark Elliot , Michael Nazario
Abstract: Data transformation in a distributed system of applications and data repositories is described. The subsystems for the overall framework are distributed, thereby allowing for customization to require only isolated changes to one or more subsystems. In one embodiment, a source code repository is used to receive and store source code. A build subsystem can retrieve source code from the source code repository and build it, using one or more criteria. By building the source code, the build subsystem can generate an artifact, which is executable code, such as a JAR or SQL file. Likewise, by building the source code, the build subsystem can generate one or more job specifications for executing the executable code. In one embodiment, the artifact and job specification may be used to launch an application server in a cluster. The application server can then receive data transformation instructions and execute the data transformation instructions.
-
公开(公告)号:US11556535B2
公开(公告)日:2023-01-17
申请号:US17443219
申请日:2021-07-22
Applicant: Palantir Technologies Inc.
Inventor: Mark Elliot , Joseph Ellis , Rahij Ramsharan , Matthew Sills , Lawrence Manning
IPC: G06F16/2453 , G06F16/248
Abstract: A database system comprised of a decoupled compute layer and storage layer is implemented to store, build, and maintain a canonical dataset, a temporary buffer, and an edits dataset. The canonical dataset is a set of batch updated data. The data is appended in chunks to the canonical dataset such that the canonical dataset becomes a historical dataset over time. The buffer is a write ahead log that contains the most recent chunks of data and provides atomicity and durability for the database system. The edits dataset is the set of data that contains edits such as cell mutations, row appends and/or row deletions. The database system enables users to make cell or row-level edits to tables and observe those edits in analytical systems or downstream builds with minimal latency.
-
公开(公告)号:US20220229900A1
公开(公告)日:2022-07-21
申请号:US17713868
申请日:2022-04-05
Applicant: Palantir Technologies Inc.
Inventor: Thomas Powell , Kyle Patron , Mark Elliot
Abstract: A method performed by one or more processors, and an apparatus is disclosed. The method may comprise identifying a request from a custom computer program within a sandbox to perform an operation not permitted within the sandbox, and receiving a first indication of security privileges associated with a provider of the custom computer program. The method may also comprise selectively causing the operation to be performed based on the first indication of security privileges.
-
公开(公告)号:US20220121767A1
公开(公告)日:2022-04-21
申请号:US17646341
申请日:2021-12-29
Applicant: Palantir Technologies Inc.
Inventor: Brandon Krieger , Mark Elliot , Matthew Lynch
Abstract: Example embodiments relate to a policy service employed to perform operations to: generate and maintain a data-set that comprises at least a column and a row that intersect at a cell; assign an access policy to a row or column of the data-set, wherein the access policy is defined by one or more access credentials required to receive access the cell that intersect with the row or column; receive a request to read the data-set from a user account, wherein the user account has an associated credential; filter the cell that intersects with the row or column of the data-set based on the access policy and the credential of the user account, in response to receiving the request from the user account; and provide the user account with access to the filtered data-set.
-
-
-
-
-
-
-
-
-