Categorization for a global taxonomy

    公开(公告)号:US11574240B2

    公开(公告)日:2023-02-07

    申请号:US16358474

    申请日:2019-03-19

    摘要: Methods and systems are provided for generating training data for training a classifier to assign nodes of a taxonomy graph to items based on item descriptions. Each node has a label. For each item, the system identifies for that item one or more candidate paths within the taxonomy graph that are relevant to that item. The system identifies the candidate paths based on content of the item description of that item matching labels of nodes. A candidate path is a sequence of nodes starting a root node of the taxonomy graph. For each identified candidate path, the system labels the item description with the candidate path equivalently with leaf node or label of the leaf node. The labeled item descriptions compose the training data for training the classifier.

    CATEGORIZATION FOR A GLOBAL TAXONOMY
    2.
    发明申请

    公开(公告)号:US20190287018A1

    公开(公告)日:2019-09-19

    申请号:US16358474

    申请日:2019-03-19

    IPC分类号: G06N20/00 G06F16/35 G06F16/33

    摘要: Methods and systems are provided for generating training data for training a classifier to assign nodes of a taxonomy graph to items based on item descriptions. Each node has a label. For each item, the system identifies for that item one or more candidate paths within the taxonomy graph that are relevant to that item. The system identifies the candidate paths based on content of the item description of that item matching labels of nodes. A candidate path is a sequence of nodes starting a root node of the taxonomy graph. For each identified candidate path, the system labels the item description with the candidate path equivalently with leaf node or label of the leaf node. The labeled item descriptions compose the training data for training the classifier.