System and method for generating size-based splits in a massively parallel or distributed database environment

    公开(公告)号:US11544268B2

    公开(公告)日:2023-01-03

    申请号:US16502720

    申请日:2019-07-03

    Abstract: A system and method is described for database split generation in a massively parallel or distributed database environment including a plurality of databases and a data warehouse layer providing data summarization and querying functionality. A database table accessor of the system obtains, from an associated client application, a query for data in a table of the data warehouse layer, wherein the query includes a user preference. The system obtains table data representative of properties of the table, and determines a splits generator in accordance with one or more of the user preference or the properties of the table. The system generates, by the selected splits generator, table splits dividing the user query into a plurality of query splits, and outputs the plurality of query splits to an associated plurality of mappers for execution by the associated plurality of mappers of each of the plurality of query splits against the table.

    SYSTEM AND METHOD FOR TRANSPARENT MULTI KEY-VALUE WEIGHTED ATTRIBUTED CONNECTION USING UNI-TAG CONNECTION POOLS
    4.
    发明申请
    SYSTEM AND METHOD FOR TRANSPARENT MULTI KEY-VALUE WEIGHTED ATTRIBUTED CONNECTION USING UNI-TAG CONNECTION POOLS 有权
    使用非标签连接池进行透明多重键加权归属连接的系统和方法

    公开(公告)号:US20140379756A1

    公开(公告)日:2014-12-25

    申请号:US14202348

    申请日:2014-03-10

    CPC classification number: G06F17/3056

    Abstract: A system and method for transparent multi key-value weighted attributed connection using uni-tag connection pools. In accordance with an embodiment, a connection pool enables labeling of connections that software applications can use to access a database. A connection pool associated with a database enables tagging of connection pools at the database and allows applications to selectively obtain connections based on tags. A request is received from an application to query data from the database using a labeled connection or low-cost alternative. If a low-cost connection is found, but requires configuration, the system returns unmatched labels for use by the application in configuring its environment to use the connection. The system can also generate a tag for the connection. Upon subsequent release of the database session, the tag can be made available for subsequent use of the tag, or a tagged connection, by the same or by other applications.

    Abstract translation: 一种使用单标签连接池进行透明多键值加权归属连接的系统和方法。 根据一个实施例,连接池可以标记软件应用程序可以用来访问数据库的连接。 与数据库关联的连接池可以标记数据库中的连接池,并允许应用程序根据标签选择性地获取连接。 从应用程序接收到使用标记的连接或低成本替代方案从数据库查询数据的请求。 如果找到低成本连接但需要配置,则系统返回不匹配的标签,供应用程序在配置其环境以使用连接时使用。 系统还可以生成连接的标签。 在随后的数据库会话释放之后,可以使标签可用于随后使用标签或标记的连接,由相同或其他应用程序使用。

    SYSTEM AND METHOD FOR DYNAMIC DATABASE SPLIT GENERATION IN A MASSIVELY PARALLEL OR DISTRIBUTED DATABASE ENVIRONMENT

    公开(公告)号:US20240134861A1

    公开(公告)日:2024-04-25

    申请号:US18400568

    申请日:2023-12-29

    CPC classification number: G06F16/24554 G06F16/278

    Abstract: A system and method is described for database split generation in a massively parallel or other distributed database environment including a plurality of databases and a data warehouse layer providing data summarization and querying functionality. A database table accessor of the system obtains, from an associated client application, a query for data in a table of the data warehouse layer, wherein the query includes a user preference. The system obtains table data representative of properties of the table, and determines a splits generator in accordance with one or more of the user preference or the properties of the table. The system generates, by the selected splits generator, table splits dividing the user query into a plurality of query splits, and outputs the plurality of query splits to an associated plurality of mappers for execution by the associated plurality of mappers of each of the plurality of query splits against the table.

    System and method for dynamic database split generation in a massively parallel or distributed database environment

    公开(公告)号:US11899666B2

    公开(公告)日:2024-02-13

    申请号:US16502727

    申请日:2019-07-03

    CPC classification number: G06F16/24554 G06F16/278

    Abstract: A system and method is described for database split generation in a massively parallel or other distributed database environment including a plurality of databases and a data warehouse layer providing data summarization and querying functionality. A database table accessor of the system obtains, from an associated client application, a query for data in a table of the data warehouse layer, wherein the query includes a user preference. The system obtains table data representative of properties of the table, and determines a splits generator in accordance with one or more of the user preference or the properties of the table. The system generates, by the selected splits generator, table splits dividing the user query into a plurality of query splits, and outputs the plurality of query splits to an associated plurality of mappers for execution by the associated plurality of mappers of each of the plurality of query splits against the table.

    System and method for generating rowid range-based splits in a massively parallel or distributed database environment

    公开(公告)号:US10380114B2

    公开(公告)日:2019-08-13

    申请号:US14864773

    申请日:2015-09-24

    Abstract: A system and method is described for database split generation in a massively parallel or distributed database environment including a plurality of databases and a data warehouse layer providing data summarization and querying functionality. A database table accessor of the system obtains, from an associated client application, a query for data in a table of the data warehouse layer, wherein the query includes a user preference. The system obtains table data representative of properties of the table, and determines a splits generator in accordance with one or more of the user preference or the properties of the table. The system generates, by the selected splits generator, table splits dividing the user query into a plurality of query splits, and outputs the plurality of query splits to an associated plurality of mappers for execution by the associated plurality of mappers of each of the plurality of query splits against the table.

    System and method for query processing with table-level predicate pushdown in a massively parallel or distributed database environment

    公开(公告)号:US10078684B2

    公开(公告)日:2018-09-18

    申请号:US14864785

    申请日:2015-09-24

    CPC classification number: G06F16/283 G06F16/245 G06F16/86 G06Q10/00

    Abstract: A system and method is described for database split generation in a massively parallel or distributed database environment including a plurality of databases and a data warehouse layer providing data summarization and querying functionality. A database table accessor of the system obtains, from an associated client application, a query for data in a table of the data warehouse layer, wherein the query includes a user preference. The system obtains table data representative of properties of the table, and determines a splits generator in accordance with one or more of the user preference or the properties of the table. The system generates, by the selected splits generator, table splits dividing the user query into a plurality of query splits, and outputs the plurality of query splits to an associated plurality of mappers for execution by the associated plurality of mappers of each of the plurality of query splits against the table.

    SYSTEM AND METHOD FOR MARSHALING MASSIVE DATABASE DATA FROM NATIVE LAYER TO JAVA USING LINEAR ARRAY
    9.
    发明申请
    SYSTEM AND METHOD FOR MARSHALING MASSIVE DATABASE DATA FROM NATIVE LAYER TO JAVA USING LINEAR ARRAY 有权
    使用线性阵列将大量数据库数据从内层划分到JAVA的系统和方法

    公开(公告)号:US20140379739A1

    公开(公告)日:2014-12-25

    申请号:US14251410

    申请日:2014-04-11

    CPC classification number: G06F17/3056

    Abstract: A system and method for marshaling database data from a native interface layer, to a Java layer, using a linear array. In accordance with an embodiment, a request is received from a software application to query or access data stored at the database. At a database driver native interface layer, the system obtains cell data from the database, determines cell coordinates and a cell metadata, and linearizes the cell data if required. The linearized data is then flushed to a linear byte array in the database driver presentation layer, and the cell coordinates and cell metadata are provided for use by a compact data handler and the application in accessing the data.

    Abstract translation: 一种用于使用线性阵列将数据库数据从本机接口层传送到Java层的系统和方法。 根据实施例,从软件应用程序接收到查询或访问存储在数据库中的数据的请求。 在数据库驱动器本机接口层,系统从数据库获取单元数据,确定单元坐标和单元格元数据,并根据需要对单元数据进行线性化。 然后将线性化数据刷新到数据库驱动程序表示层中的线性字节数组,并且提供单元坐标和单元元数据供小型数据处理程序和应用程序访问数据。

Patent Agency Ranking