Systems and methods for managing a highly available distributed hybrid transactional and analytical database

    公开(公告)号:US11016969B1

    公开(公告)日:2021-05-25

    申请号:US17105040

    申请日:2020-11-25

    Applicant: Coupang Corp.

    Abstract: Systems and methods for managing a highly available distributed hybrid database comprising: a memory storing instructions; and one or more processors configured to execute the instructions to: receive a query from a user device to retrieve data from a distributed database comprising a source node, a first plurality of replica nodes, and a second plurality of replica nodes, wherein the source node and the first plurality of replica nodes form a transactional cluster, and wherein the second plurality of replica nodes forms an analytical cluster; determine whether to process the query using the transactional cluster or the analytical cluster based on one or more rules; translate the query into a first protocol that the determined cluster comprehends; select a replica node corresponding to the determined cluster; process the query using the selected replica node; and send data associated with results from processing the query to the user device.

    Systems and methods for managing a highly available and scalable distributed database in a cloud computing environment

    公开(公告)号:US11216441B1

    公开(公告)日:2022-01-04

    申请号:US17105127

    申请日:2020-11-25

    Applicant: Coupang Corp.

    Abstract: Systems and methods for managing a highly available distributed database comprising: a memory storing instructions; and one or more processors configured to execute the instructions to: determine that a source node, in a distributed database comprising the source node and one or more replica nodes, is not available; select a most-updated replica node from the one or more replica nodes; switch a role of the most-updated replica node to source; update a data store to label the source node as unavailable and the selected replica node as being a promoted source node; send a notification to a user device to update a database topology based on the updated data store; determine whether the user device has updated the database topology; and upon determining the user device has not updated the database topology, continue to send the notification to the user device until the user device has updated the database topology.

    COMPUTERIZED SYSTEMS AND METHODS FOR USING ARTIFICIAL INTELLIGENCE TO OPTIMIZE DATABASE PARAMETERS

    公开(公告)号:US20220222231A1

    公开(公告)日:2022-07-14

    申请号:US17147498

    申请日:2021-01-13

    Applicant: Coupang Corp.

    Inventor: Bin Dong Zhan Chen

    Abstract: Systems and method are provided for AI-based database parameter optimization. One method includes receiving, from a user device, a query; preprocessing the query; predicting a plurality of optimal parameters for executing the query, by: calculating a predicted change in database metrics based on the preprocessed query; sending the predicted change in database metrics to a tuner; calculating database performance metrics based on the predicted change in database metrics and current database performance metrics; and calculating a vector of optimal parameters based on the database performance metrics; and executing the received query based on the predicted plurality of optimal parameters.

    SYSTEMS AND METHODS FOR DATABASE QUERY EFFICIENCY IMPROVEMENT

    公开(公告)号:US20220156237A1

    公开(公告)日:2022-05-19

    申请号:US17181272

    申请日:2021-02-22

    Applicant: Coupang Corp.

    Abstract: Methods and systems for database query efficiency improvement are disclosed. In one embodiment, a method includes mirroring a primary database to a secondary database; creating a testing database comprising the schema; receiving a query; running the query on the testing database; and evaluating the query by: identifying predicates in the query; determining most common values for each column name by querying the secondary database; creating, for each column name, a list comprising at least one of the most common values; creating a test predicate comprising one of the column names and an entry for the list corresponding to the column name; creating a test query comprising one or more test predicates; determining a resource utilization of the query by running each of the test queries on the secondary database; and providing, to a user interface for display, an efficiency improvement recommendation when the resource utilization exceeds a threshold.

    Systems and methods for database query efficiency improvement

    公开(公告)号:US10963438B1

    公开(公告)日:2021-03-30

    申请号:US16950342

    申请日:2020-11-17

    Applicant: Coupang Corp.

    Abstract: Methods and systems for database query efficiency improvement are disclosed. In one embodiment, a method includes mirroring a primary database to a secondary database; creating a testing database comprising the schema; receiving a query; running the query on the testing database; and evaluating the query by: identifying predicates in the query; determining most common values for each column name by querying the secondary database; creating, for each column name, a list comprising at least one of the most common values; creating a test predicate comprising one of the column names and an entry for the list corresponding to the column name; creating a test query comprising one or more test predicates; determining a resource utilization of the query by running each of the test queries on the secondary database; and providing, to a user interface for display, an efficiency improvement recommendation when the resource utilization exceeds a threshold.

    Systems and methods for database query efficiency improvement

    公开(公告)号:US12013826B2

    公开(公告)日:2024-06-18

    申请号:US17181272

    申请日:2021-02-22

    Applicant: Coupang Corp.

    Abstract: Methods and systems for database query efficiency improvement are disclosed. In one embodiment, a method includes mirroring a primary database to a secondary database; creating a testing database comprising the schema; receiving a query; running the query on the testing database; and evaluating the query by: identifying predicates in the query; determining most common values for each column name by querying the secondary database; creating, for each column name, a list comprising at least one of the most common values; creating a test predicate comprising one of the column names and an entry for the list corresponding to the column name; creating a test query comprising one or more test predicates; determining a resource utilization of the query by running each of the test queries on the secondary database; and providing, to a user interface for display, an efficiency improvement recommendation when the resource utilization exceeds a threshold.

    Systems and methods for reducing disk usage and network latency

    公开(公告)号:US11150806B1

    公开(公告)日:2021-10-19

    申请号:US17237926

    申请日:2021-04-22

    Applicant: COUPANG CORP.

    Abstract: Disclosed embodiments provide systems and methods for reducing disk storage and network latency. A method reducing disk storage and network latency comprises receiving customer data of a customer to store in a database, conditioning the customer data, and formatting the conditioned customer data into first and second data strings respectively having a first data type and a second data type. The method further comprises flipping a sign bit of the first data string, encoding the sign-bit-flipped first data string and second data string into serialized data by representing every two digits of the first string with one byte, and flipping all bits of the serialized data if the received customer data is represented by a negative value. The method further comprises storing the serialized data in the database if negative, receiving a request for the customer data, deserializing serialized data to be retrieved from the database, and retrieving the deserialized data from the database.

Patent Agency Ranking