Systems and methods for word segmentation based on a competing neural character language model

    公开(公告)号:US11113468B1

    公开(公告)日:2021-09-07

    申请号:US17028023

    申请日:2020-09-22

    Applicant: Coupang Corp.

    Inventor: Shusi Yu Jing Li

    Abstract: Systems and methods are provided for detecting inaccuracy in a product title, comprising identifying, by running a string algorithm on a title associated with a product, at least one product type associated with the product, predicting, using a machine learning algorithm, at least one product type associated with the product based on the title, detecting an inaccuracy in the title, based on at least one of the identification or the prediction, and outputting, to a remote device, a message indicating that the title comprises the inaccuracy. Running the string algorithm may comprise receiving a set of strings, generating a tree based on the received set of strings, receiving the title, and traversing the generated tree using the title to find a match. Using the machine learning algorithm may comprise identifying words in the title, learning a vector representation for each character n-gram of each word, and summing each character n-gram.

    Systems and methods for word segmentation based on a competing neural character language model

    公开(公告)号:US10817665B1

    公开(公告)日:2020-10-27

    申请号:US16869741

    申请日:2020-05-08

    Applicant: COUPANG CORP.

    Inventor: Shusi Yu Jing Li

    Abstract: Systems and methods are provided for detecting inaccuracy in a product title, comprising identifying, by running a string algorithm on a title associated with a product, at least one product type associated with the product, predicting, using a machine learning algorithm, at least one product type associated with the product based on the title, detecting an inaccuracy in the title, based on at least one of the identification or the prediction, and outputting, to a remote device, a message indicating that the title comprises the inaccuracy. Running the string algorithm may comprise receiving a set of strings, generating a tree based on the received set of strings, receiving the title, and traversing the generated tree using the title to find a match. Using the machine learning algorithm may comprise identifying words in the title, learning a vector representation for each character n-gram of each word, and summing each character n-gram.

    SYSTEMS AND METHODS FOR GENERATING A PERSONALIZED ADVERTISEMENT

    公开(公告)号:US20230139513A1

    公开(公告)日:2023-05-04

    申请号:US17516487

    申请日:2021-11-01

    Applicant: COUPANG CORP.

    Abstract: Systems and methods of generating, for display on a graphical user interface (GUI), a personalized advertisement. The systems and methods can include receiving data indicative of initiating a browsing session, aggregating user activity data associated with a user of the application, the user activity data including current session data and/or past session data, applying machine learning on the user activity data to generate one or more excluded products, applying the one or more excluded products to a product database to generate a list of relevant products, ranking, using one or more ranking rules, the list of relevant products to generate a ranked list of relevant products, and sending, to the mobile device, the personalized advertisement including one or more relevant products from the ranked list of relevant products.

Patent Agency Ranking