-
公开(公告)号:US12130810B2
公开(公告)日:2024-10-29
申请号:US18083157
申请日:2022-12-16
Applicant: Palantir Technologies Inc.
Inventor: Mark Elliot , Joseph Ellis , Rahij Ramsharan , Matthew Sills , Lawrence Manning
IPC: G06F16/00 , G06F16/18 , G06F16/215 , G06F16/2453 , G06F16/2455 , G06F16/248
CPC classification number: G06F16/24537 , G06F16/1805 , G06F16/215 , G06F16/2456 , G06F16/248 , G06F16/24564
Abstract: A database system comprised of a decoupled compute layer and storage layer is implemented to store, build, and maintain a canonical dataset, a temporary buffer, and an edits dataset. The canonical dataset is a set of batch updated data. The data is appended in chunks to the canonical dataset such that the canonical dataset becomes a historical dataset over time. The buffer is a write ahead log that contains the most recent chunks of data and provides atomicity and durability for the database system. The edits dataset is the set of data that contains edits such as cell mutations, row appends and/or row deletions. The database system enables users to make cell or row-level edits to tables and observe those edits in analytical systems or downstream builds with minimal latency.
-
公开(公告)号:US10127289B2
公开(公告)日:2018-11-13
申请号:US15233149
申请日:2016-08-10
Applicant: Palantir Technologies Inc.
Inventor: Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin
IPC: G06F17/30
Abstract: Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.
-
3.
公开(公告)号:US20170300197A1
公开(公告)日:2017-10-19
申请号:US15636234
申请日:2017-06-28
Applicant: Palantir Technologies Inc.
Inventor: John Chakerian , Carl Freeland , Jack Grossman , Lawrence Manning , Adam Torres , Michael Yang
IPC: G06F3/0482 , G06F17/30
Abstract: Computer-implemented systems and methods are disclosed to interface with one or more storage devices storing a plurality of documents, wherein each of the plurality of documents is associated with one or more tags of one or more predefined hierarchies of tags, wherein the one or more hierarchies of tags include multiple dimensions. In accordance with some embodiments, a method is provided to identify one or more documents from the data storage devices. The method comprises acquiring, via an interface, a selection of one or more tags of the one or more predefined hierarchies of tags. The method further comprises identifying one or more documents from the data storage devices in response to the selection, the identified one or more documents having tags that have a relationship with the selected tags, and providing data corresponding to the identified documents for displaying in the interface.
-
公开(公告)号:US11995064B2
公开(公告)日:2024-05-28
申请号:US17469767
申请日:2021-09-08
Applicant: Palantir Technologies Inc.
Inventor: Jonathan Lafleche , Justin Uang , Onur Satici , Yifei Huang , Ovidiu-Dan Sanduleac , Lawrence Manning
CPC classification number: G06F16/2365 , G06F9/485 , G06F16/219
Abstract: Systems, methods, and non-transitory computer readable media are provided for managing expiration of modules. An expiry dataset may be maintained. The expiry dataset may include a set of identifiers corresponding to a set of modules, a set of expiry values for the set of modules, and a set of termination tasks for the set of modules. A request to refresh a module may be received from a client. Responsive to the reception of the request, an expiry value and a termination task for the module within the expiry dataset may be updated. The expiry value may be independent of a timestamp associated with the request.
-
公开(公告)号:US20220374454A1
公开(公告)日:2022-11-24
申请号:US17812984
申请日:2022-07-15
Applicant: Palantir Technologies Inc.
Inventor: Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin
Abstract: Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.
-
公开(公告)号:US11449333B2
公开(公告)日:2022-09-20
申请号:US16900071
申请日:2020-06-12
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method may comprise receiving a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method may also comprise generating a task identifier for the data processing task, and providing, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method may also comprise receiving the second data set and metadata from the agent associated with the external system and storing the second data set and associated metadata.
-
公开(公告)号:US10719313B2
公开(公告)日:2020-07-21
申请号:US16251578
申请日:2019-01-18
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method may comprise receiving a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method may also comprise generating a task identifier for the data processing task, and providing, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method may also comprise receiving the second data set and metadata from the agent associated with the external system and storing the second data set and associated metadata.
-
8.
公开(公告)号:US20190079937A1
公开(公告)日:2019-03-14
申请号:US16189040
申请日:2018-11-13
Applicant: Palantir Technologies Inc.
Inventor: Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin
IPC: G06F17/30
Abstract: Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.
-
公开(公告)号:US11704325B2
公开(公告)日:2023-07-18
申请号:US17812984
申请日:2022-07-15
Applicant: Palantir Technologies Inc.
Inventor: Lawrence Manning , Rahul Mehta , Daniel Erenrich , Guillem Palou Visa , Roger Hu , Xavier Falco , Rowan Gilmore , Eli Bingham , Jason Prestinario , Yifei Huang , Daniel Fernandez , Jeremy Elser , Clayton Sader , Rahul Agarwal , Matthew Elkherj , Nicholas Latourette , Aleksandr Zamoshchin
IPC: G06F16/00 , G06F16/2457 , G06F16/35 , G06F16/9535 , G06F16/28 , G06F18/23
CPC classification number: G06F16/24578 , G06F16/285 , G06F16/35 , G06F16/9535 , G06F18/23
Abstract: Computer implemented systems and methods are disclosed for automatically clustering and canonically identifying related data in various data structures. Data structures may include a plurality of records, wherein each record is associated with a respective entity. In accordance with some embodiments, the systems and methods further comprise identifying clusters of records associated with a respective entity by grouping the records into pairs, analyzing the respective pairs to determine a probability that both members of the pair relate to a common entity, and identifying a cluster of overlapping pairs to generate a collection of records relating to a common entity. Clusters may further be analyzed to determine canonical names or other properties for the respective entities by analyzing record fields and identifying similarities.
-
10.
公开(公告)号:US11526255B2
公开(公告)日:2022-12-13
申请号:US17241470
申请日:2021-04-27
Applicant: Palantir Technologies Inc.
Inventor: John Chakerian , Carl Freeland , Jack Grossman , Lawrence Manning , Adam Torres , Michael Yang
IPC: G06F16/30 , G06F3/0482 , G06F16/81 , G06F16/84 , G06F16/93 , G06F16/25 , G06F16/28 , G06F16/955
Abstract: Computer-implemented systems and methods are disclosed to interface with one or more storage devices storing a plurality of documents, wherein each of the plurality of documents is associated with one or more tags of one or more predefined hierarchies of tags, wherein the one or more hierarchies of tags include multiple dimensions. In accordance with some embodiments, a method is provided to identify one or more documents from the data storage devices. The method comprises acquiring, via an interface, a selection of one or more tags of the one or more predefined hierarchies of tags. The method further comprises identifying one or more documents from the data storage devices in response to the selection, the identified one or more documents having tags that have a relationship with the selected tags, and providing data corresponding to the identified documents for displaying in the interface.
-
-
-
-
-
-
-
-
-