-
1.
公开(公告)号:US20220222231A1
公开(公告)日:2022-07-14
申请号:US17147498
申请日:2021-01-13
Applicant: Coupang Corp.
IPC: G06F16/22 , G06F16/2457 , G06F16/2453 , G06K9/62
Abstract: Systems and method are provided for AI-based database parameter optimization. One method includes receiving, from a user device, a query; preprocessing the query; predicting a plurality of optimal parameters for executing the query, by: calculating a predicted change in database metrics based on the preprocessed query; sending the predicted change in database metrics to a tuner; calculating database performance metrics based on the predicted change in database metrics and current database performance metrics; and calculating a vector of optimal parameters based on the database performance metrics; and executing the received query based on the predicted plurality of optimal parameters.
-
公开(公告)号:US20220156237A1
公开(公告)日:2022-05-19
申请号:US17181272
申请日:2021-02-22
Applicant: Coupang Corp.
Inventor: Bin Dong , Zhan Chen , Zhiyong Huang , Chuanchuan Han , Junzhao Zhang
IPC: G06F16/21 , G06F16/22 , G06F16/28 , G06F16/2455
Abstract: Methods and systems for database query efficiency improvement are disclosed. In one embodiment, a method includes mirroring a primary database to a secondary database; creating a testing database comprising the schema; receiving a query; running the query on the testing database; and evaluating the query by: identifying predicates in the query; determining most common values for each column name by querying the secondary database; creating, for each column name, a list comprising at least one of the most common values; creating a test predicate comprising one of the column names and an entry for the list corresponding to the column name; creating a test query comprising one or more test predicates; determining a resource utilization of the query by running each of the test queries on the secondary database; and providing, to a user interface for display, an efficiency improvement recommendation when the resource utilization exceeds a threshold.
-
公开(公告)号:US10963438B1
公开(公告)日:2021-03-30
申请号:US16950342
申请日:2020-11-17
Applicant: Coupang Corp.
Inventor: Bin Dong , Zhan Chen , Zhiyong Huang , Chuanchuan Han , Junzhao Zhang
IPC: G06F16/00 , G06F16/21 , G06F16/2455 , G06F16/28 , G06F16/22
Abstract: Methods and systems for database query efficiency improvement are disclosed. In one embodiment, a method includes mirroring a primary database to a secondary database; creating a testing database comprising the schema; receiving a query; running the query on the testing database; and evaluating the query by: identifying predicates in the query; determining most common values for each column name by querying the secondary database; creating, for each column name, a list comprising at least one of the most common values; creating a test predicate comprising one of the column names and an entry for the list corresponding to the column name; creating a test query comprising one or more test predicates; determining a resource utilization of the query by running each of the test queries on the secondary database; and providing, to a user interface for display, an efficiency improvement recommendation when the resource utilization exceeds a threshold.
-
公开(公告)号:US11709830B2
公开(公告)日:2023-07-25
申请号:US17240219
申请日:2021-04-26
Applicant: Coupang Corp.
Inventor: Zhan Chen , Bin Dong , Zhiyong Huang , Chuanchuan Han , Junzhao Zhang
IPC: G06F16/245 , G06F16/25 , G06F16/28 , G06F16/27 , G06F16/2452 , G06F16/2455 , G06F16/2453 , G06F11/34 , G06F11/20
CPC classification number: G06F16/24528 , G06F11/2025 , G06F11/3428 , G06F16/24549 , G06F16/24564 , G06F16/258 , G06F16/27 , G06F16/285
Abstract: Systems and methods for managing a highly available distributed hybrid database comprising: a memory storing instructions; and one or more processors configured to execute the instructions to: receive a query from a user device to retrieve data from a distributed database comprising a source node, a first plurality of replica nodes, and a second plurality of replica nodes, wherein the source node and the first plurality of replica nodes form a transactional cluster, and wherein the second plurality of replica nodes forms an analytical cluster; determine whether to process the query using the transactional cluster or the analytical cluster based on one or more rules; translate the query into a first protocol that the determined cluster comprehends; select a replica node corresponding to the determined cluster; process the query using the selected replica node; and send data associated with results from processing the query to the user device.
-
公开(公告)号:US20220284010A1
公开(公告)日:2022-09-08
申请号:US17581741
申请日:2022-01-21
Applicant: COUPANG CORP.
Inventor: Bin Dong
IPC: G06F16/23 , G06F9/54 , G06F16/2455 , G06F16/245 , G06F16/22 , G06F16/25 , G06F7/14
Abstract: a computer-implemented system for generating a data transaction log, the system comprising: at least one first storage storing at least one first data table, the at least one first data table including at least one transaction log and being associated with first nodes; at least one second storage for storing at least one second data table; at least one processor configured to execute the steps of: generating a registry; maintaining at least one data source agent, each data source agent being configured to: query the registry to receive an address of the one or more first data table; receiving, in response to the query, the at least one transaction logs from at least one first storage, wherein each transaction log is associated with a transaction time; compiling one or more modified transaction logs; and maintaining at least one streaming process, each streaming process being configured to: generate the at least one second data table based on the modified transaction logs for storage in the at least one second storage; and generate display based on the at least one second data table.
-
公开(公告)号:US11216441B1
公开(公告)日:2022-01-04
申请号:US17105127
申请日:2020-11-25
Applicant: Coupang Corp.
Inventor: Zhan Chen , Zhiyong Huang , Bin Dong , Chuanchuan Han , Junzhao Zhang
Abstract: Systems and methods for managing a highly available distributed database comprising: a memory storing instructions; and one or more processors configured to execute the instructions to: determine that a source node, in a distributed database comprising the source node and one or more replica nodes, is not available; select a most-updated replica node from the one or more replica nodes; switch a role of the most-updated replica node to source; update a data store to label the source node as unavailable and the selected replica node as being a promoted source node; send a notification to a user device to update a database topology based on the updated data store; determine whether the user device has updated the database topology; and upon determining the user device has not updated the database topology, continue to send the notification to the user device until the user device has updated the database topology.
-
公开(公告)号:US11263206B1
公开(公告)日:2022-03-01
申请号:US17189816
申请日:2021-03-02
Applicant: COUPANG CORP.
Inventor: Bin Dong
IPC: G06F16/23 , G06F9/54 , G06F16/25 , G06F16/22 , G06F7/14 , G06F16/245 , G06F16/2455 , G06Q30/06 , G06Q10/10 , G06Q50/28
Abstract: A computer-implemented system for generating a data transaction log, the system comprising: at least one first storage storing at least one first data table, the at least one first data table including at least one transaction log and being associated with first nodes; at least one second storage for storing at least one second data table; at least one processor configured to execute the steps of: generating a registry; maintaining at least one data source agent, each data source agent being configured to: query the registry to receive an address of the one or more first data table; receiving, in response to the query, the at least one transaction logs from at least one first storage, wherein each transaction log is associated with a transaction time; compiling one or more modified transaction logs; and maintaining at least one streaming process, each streaming process being configured to: generate the at least one second data table based on the modified transaction logs for storage in the at least one second storage; and generate display based on the at least one second data table.
-
公开(公告)号:US11016969B1
公开(公告)日:2021-05-25
申请号:US17105040
申请日:2020-11-25
Applicant: Coupang Corp.
Inventor: Zhan Chen , Bin Dong , Zhiyong Huang , Chuanchuan Han , Junzhao Zhang
IPC: G06F16/2452 , G06F16/2455 , G06F16/2453 , G06F16/27 , G06F11/34 , G06F16/25 , G06F16/28 , G06F11/20
Abstract: Systems and methods for managing a highly available distributed hybrid database comprising: a memory storing instructions; and one or more processors configured to execute the instructions to: receive a query from a user device to retrieve data from a distributed database comprising a source node, a first plurality of replica nodes, and a second plurality of replica nodes, wherein the source node and the first plurality of replica nodes form a transactional cluster, and wherein the second plurality of replica nodes forms an analytical cluster; determine whether to process the query using the transactional cluster or the analytical cluster based on one or more rules; translate the query into a first protocol that the determined cluster comprehends; select a replica node corresponding to the determined cluster; process the query using the selected replica node; and send data associated with results from processing the query to the user device.
-
公开(公告)号:US12130802B2
公开(公告)日:2024-10-29
申请号:US17581741
申请日:2022-01-21
Applicant: COUPANG CORP.
Inventor: Bin Dong
IPC: G06F16/23 , G06F7/14 , G06F9/54 , G06F16/22 , G06F16/245 , G06F16/2455 , G06F16/25 , G06Q10/08 , G06Q10/10 , G06Q30/0601
CPC classification number: G06F16/2379 , G06F7/14 , G06F9/541 , G06F16/2282 , G06F16/2358 , G06F16/245 , G06F16/24554 , G06F16/254 , G06F16/258 , G06Q10/08 , G06Q10/10 , G06Q30/0601
Abstract: A computer-implemented system and method for generating a data transaction log is disclosed. The method has the steps of: generating a registry; maintaining at least one data source agent, each data source agent being configured to: query the registry to receive an address of the one or more first data table; receiving, in response to the query, the at least one transaction logs from at least one first storage, wherein each transaction log is associated with a transaction time; compiling one or more modified transaction logs; and maintaining at least one streaming process, each streaming process being configured to: generate the at least one second data table based on the modified transaction logs for storage in the at least one second storage; and generate display based on the at least one second data table.
-
公开(公告)号:US12013826B2
公开(公告)日:2024-06-18
申请号:US17181272
申请日:2021-02-22
Applicant: Coupang Corp.
Inventor: Bin Dong , Zhan Chen , Zhiyong Huang , Chuanchuan Han , Junzhao Zhang
IPC: G06F16/21 , G06F16/22 , G06F16/2455 , G06F16/28
CPC classification number: G06F16/217 , G06F16/211 , G06F16/214 , G06F16/221 , G06F16/24557 , G06F16/287
Abstract: Methods and systems for database query efficiency improvement are disclosed. In one embodiment, a method includes mirroring a primary database to a secondary database; creating a testing database comprising the schema; receiving a query; running the query on the testing database; and evaluating the query by: identifying predicates in the query; determining most common values for each column name by querying the secondary database; creating, for each column name, a list comprising at least one of the most common values; creating a test predicate comprising one of the column names and an entry for the list corresponding to the column name; creating a test query comprising one or more test predicates; determining a resource utilization of the query by running each of the test queries on the secondary database; and providing, to a user interface for display, an efficiency improvement recommendation when the resource utilization exceeds a threshold.
-
-
-
-
-
-
-
-
-