Techniques for data extraction
    22.
    发明授权

    公开(公告)号:US10133782B2

    公开(公告)日:2018-11-20

    申请号:US15225437

    申请日:2016-08-01

    Abstract: Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification comprises a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.

    Interactive user interface for dynamic data analysis exploration and query processing

    公开(公告)号:US09870389B2

    公开(公告)日:2018-01-16

    申请号:US15398113

    申请日:2017-01-04

    Abstract: The systems and methods described herein provide highly dynamic and interactive data analysis user interfaces which enable data analysts to quickly and efficiently explore large volume data sources. In particular, a data analysis system, such as described herein, may provide features to enable the data analyst to investigate large volumes of data over many different paths of analysis while maintaining detailed and retraceable steps taken by the data analyst over the course of an investigation, as captured via the data analyst's queries and user interaction with the user interfaces provided by the data analysis system. Data analysis paths may involve exploration of high volume data sets, such as Internet proxy data, which may include trillions of rows of data. The data analyst may pursue a data analysis path that involves, among other things, applying filters, joining to other tables in a database, viewing interactive data visualizations, and so on.

    Method and system for generating a parser and parsing complex data
    25.
    发明授权
    Method and system for generating a parser and parsing complex data 有权
    用于生成解析器和解析复杂数据的方法和系统

    公开(公告)号:US09495353B2

    公开(公告)日:2016-11-15

    申请号:US14526066

    申请日:2014-10-28

    Inventor: Mark Elliot

    CPC classification number: G06F17/2705 G06F8/427 G06F17/30595 G06F17/30985

    Abstract: Computer-implemented systems and methods are disclosed for constructing a parser that parses complex data. In some embodiments, a method is provided for receiving a parser definition as an input to a parser generator and generating a parser at least in part from the parser definition. In some embodiments, the generated parser comprises two or more handlers forming a processing pipeline. In some embodiments, the parser receives as input a first string into the processing pipeline. In some embodiments, the parser generates a second string by a first handler and inputs the second string regeneratively into the parsing pipeline, if the first string matches an expression specified for the first handler in the parser definition.

    Abstract translation: 公开了用于构建解析复杂数据的解析器的计算机实现的系统和方法。 在一些实施例中,提供了一种用于接收解析器定义作为解析器生成器的输入并且至少部分地从解析器定义生成解析器的方法。 在一些实施例中,所生成的解析器包括形成处理流水线的两个或多个处理器。 在一些实施例中,解析器将作为输入的第一串接收到处理流水线中。 在一些实施例中,如果第一个字符串匹配在解析器定义中为第一个处理程序指定的表达式,则解析器由第一处理程序生成第二个字符串并将其再次输入到解析管道中。

    Interactive user interface for dynamic data analysis exploration and query processing
    26.
    发明授权
    Interactive user interface for dynamic data analysis exploration and query processing 有权
    交互式用户界面,用于动态数据分析探索和查询处理

    公开(公告)号:US09335911B1

    公开(公告)日:2016-05-10

    申请号:US14858647

    申请日:2015-09-18

    Abstract: The systems and methods described herein provide highly dynamic and interactive data analysis user interfaces which enable data analysts to quickly and efficiently explore large volume data sources. In particular, a data analysis system, such as described herein, may provide features to enable the data analyst to investigate large volumes of data over many different paths of analysis while maintaining detailed and retraceable steps taken by the data analyst over the course of an investigation, as captured via the data analyst's queries and user interaction with the user interfaces provided by the data analysis system. Data analysis paths may involve exploration of high volume data sets, such as Internet proxy data, which may include trillions of rows of data. The data analyst may pursue a data analysis path that involves, among other things, applying filters, joining to other tables in a database, viewing interactive data visualizations, and so on.

    Abstract translation: 本文所述的系统和方法提供高度动态和交互的数据分析用户界面,使数据分析人员能够快速有效地探索大量数据源。 特别地,诸如本文所描述的数据分析系统可以提供特征,使得数据分析者可以在许多不同的分析途径中调查大量的数据,同时保持数据分析者在调查过程中采取的详细和可追溯的步骤 ,通过数据分析师的查询和用户与数据分析系统提供的用户界面进行交互捕获。 数据分析路径可能涉及大量数据集的探索,例如互联网代理数据,其可以包括数万亿行数据。 数据分析师可以追踪数据分析路径,其中包括应用过滤器,加入数据库中的其他表,查看交互式数据可视化等。

    LOW-LATENCY DATABASE SYSTEM
    27.
    发明申请

    公开(公告)号:US20250013640A1

    公开(公告)日:2025-01-09

    申请号:US18891642

    申请日:2024-09-20

    Abstract: A computer system can receive one or more edits to be made to a canonical dataset and can temporarily store the one or more edits in a buffer. In response to receipt of a query of the canonical dataset, the computer system can rewrite the query to read from the canonical dataset and the buffer; combine the one or more edits from the buffer with the canonical dataset to form a combined dataset based on resolution policies to avoid conflicts between data; rewrite the query to execute on the combined dataset in lieu of the canonical dataset to optimize query performance; and execute the query on the combined dataset.

    PROVIDING EXTERNAL ACCESS TO A PROCESSING PLATFORM

    公开(公告)号:US20240012642A1

    公开(公告)日:2024-01-11

    申请号:US18473520

    申请日:2023-09-25

    CPC classification number: H04L63/083 G06F16/2379

    Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method receives a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method generates a task identifier for the data processing task, and provides, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method receives the second data set and metadata from the agent associated with the external system and stores the second data set and associated metadata.

Patent Agency Ranking