MISMATCH DETECTION MODEL
    2.
    发明公开

    公开(公告)号:US20240070438A1

    公开(公告)日:2024-02-29

    申请号:US18387159

    申请日:2023-11-06

    摘要: A system comprising one or more processors and one or more non-transitory computer-readable media storing computing instructions that, when executed on the one or more processors, cause the one or more processors to perform operations comprising: obtaining a set of items that have been grouped together as matching items in a group; generating, using an ensemble learning model, a predictive indication of a mismatched item grouped together in error as part of the set of items, wherein the ensemble learning model comprises at least two detection models that are performed simultaneously with each other to output predictive indications comprising the predictive indication; and determining a final mismatch decision for an item of the set of items, wherein the final mismatch decision is based on the predictive indication, and wherein the item comprises the mismatched item. Other embodiments are disclosed.

    SYSTEMS AND METHODS FOR CLASSIFICATION USING STRUCTURED AND UNSTRUCTURED ATTRIBUTES

    公开(公告)号:US20200242465A1

    公开(公告)日:2020-07-30

    申请号:US16262620

    申请日:2019-01-30

    IPC分类号: G06N3/08 G06N20/00

    摘要: Systems and methods including one or more processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more processing modules and perform acts of receiving attribute data comprising a set of unstructured attribute data and a set of structured attribute data, analyzing the set of unstructured attribute data by processing through a first set of one or more Long Short Term Memory (LSTM) layers, to obtain an unstructured semantic signature, analyzing the set of the structured attribute data by processing through a first set of one or more Convolutional Neural Network (CNN) layers, to obtain a structured semantic signature, analyzing the unstructured semantic signature and the structured semantic signature, and classifying the item in one or more item categories. Other embodiments are disclosed herein.

    Method and system for attribute extraction from product titles using sequence labeling algorithms

    公开(公告)号:US10664888B2

    公开(公告)日:2020-05-26

    申请号:US16174011

    申请日:2018-10-29

    摘要: Some embodiments can comprise a system comprising one or more computer processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more computer processing modules a perform acts of: receiving, at the one or more computer processing modules and from a third-party electronic device, a title for a product; dividing, at the one or more computer processing modules, the title into a sequence of tokens; storing, by the one or more computer processing modules onto the one or more non-transitory storage modules, the sequence of tokens; determining, at the one or more computer processing modules and using a sequence labeling model, a type of each token of the sequence of tokens; storing, by the one or more computer processing modules onto the one or more non-transitory storage modules, the type of each token of the sequence of tokens; encoding, at the one or more computer processing modules, each token of the sequence of tokens to indicate the type of each token of the sequence of tokens, wherein the type of each token of the sequence of tokens can comprise a BIO encoding scheme, wherein: a label B of the BIO encoding scheme can indicate a first token of a brand name; a label I of the BIO encoding scheme can indicate a subsequent token of the brand name; and a label O of the BIO encoding scheme can indicate a token that is not part of the brand name; determining, at the one or more computer processing modules, a brand name present in the title using each token of the sequence of tokens, as encoded; storing, by the one or more computer processing modules onto the one or more non-transitory storage modules, the brand name present in the title; normalizing, at the one or more computer processing modules, the brand name present in the title to create a standardized representation of the brand name; writing, by the one or more computer processing modules onto the one or more non-transitory storage modules, the standardized representation of the brand name present in the title to an empty database entry associated with the product; and in response to a search request from a user, transmitting instructions to a user display to display a representation of the standardized representation of the brand name for each token of the sequence of tokens. Other embodiments are also disclosed herein.

    Systems and methods for classification using structured and unstructured attributes

    公开(公告)号:US11636330B2

    公开(公告)日:2023-04-25

    申请号:US16262620

    申请日:2019-01-30

    IPC分类号: G06N3/08 G06N20/00

    摘要: Systems and methods including one or more processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more processing modules and perform acts of receiving attribute data comprising a set of unstructured attribute data and a set of structured attribute data, analyzing the set of unstructured attribute data by processing through a first set of one or more Long Short Term Memory (LSTM) layers, to obtain an unstructured semantic signature, analyzing the set of the structured attribute data by processing through a first set of one or more Convolutional Neural Network (CNN) layers, to obtain a structured semantic signature, analyzing the unstructured semantic signature and the structured semantic signature, and classifying the item in one or more item categories. Other embodiments are disclosed herein.

    MISMATCH DETECTION MODEL
    6.
    发明申请

    公开(公告)号:US20210241076A1

    公开(公告)日:2021-08-05

    申请号:US16779510

    申请日:2020-01-31

    IPC分类号: G06N3/04 G06Q30/06

    摘要: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform obtaining a set of items that have been grouped together as matching items in a group; performing an ensemble mismatch detection; performing multiple detection models on the set of items to generate respective outputs regarding mismatches; combining the respective outputs to determine whether a quantity of detected mismatches is at least a predetermined threshold; when the quantity of detected mismatches is at least the predetermined threshold, the acts also can include separating at least one of the set of items from the group; and when the quantity of detected mismatches is not at least the predetermined threshold, the acts additionally can include maintaining each item of the set of items in the group. Other embodiments are disclosed.

    Automatically determining items to include in a variant group

    公开(公告)号:US11977561B2

    公开(公告)日:2024-05-07

    申请号:US16779473

    申请日:2020-01-31

    摘要: A method including obtaining image data and attribute information of a first item in an item catalog. The method also can include generating candidate variant items from the item catalog for the first item using a combination of (a) a k-nearest neighbors approach to search for first candidate variant items based on text embeddings for the attribute information of the first item, and (b) an elastic search approach to search for second candidate variant items based on image embeddings for the image data of the first item. The method additionally can include performing respective classifications based on respective pairs comprising the first item and each of the candidate variant items to filter the candidate variant items. The method further can include determining a respective distance between the first item and each of the candidate variant items, as filtered. The method additionally can include determining one or more items in the candidate variant items, as filtered, to include in a variant group for the first item, based on a decision function using a predetermined threshold and the respective distance for the each of the candidate variant items, as filtered. Other embodiments are described.

    Mismatch detection model
    10.
    发明授权

    公开(公告)号:US11809979B2

    公开(公告)日:2023-11-07

    申请号:US16779510

    申请日:2020-01-31

    摘要: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform obtaining a set of items that have been grouped together as matching items in a group; performing an ensemble mismatch detection; performing multiple detection models on the set of items to generate respective outputs regarding mismatches; combining the respective outputs to determine whether a quantity of detected mismatches is at least a predetermined threshold; when the quantity of detected mismatches is at least the predetermined threshold, the acts also can include separating at least one of the set of items from the group; and when the quantity of detected mismatches is not at least the predetermined threshold, the acts additionally can include maintaining each item of the set of items in the group. Other embodiments are disclosed.