Scalable hierarchical clustering
    1.
    发明授权

    公开(公告)号:US11675766B1

    公开(公告)日:2023-06-13

    申请号:US16808162

    申请日:2020-03-03

    CPC classification number: G06F16/2246 G06F16/285 G06F16/9024

    Abstract: A hierarchical representation of an input data set comprising similarity scores for respective entity pairs is generated iteratively. In a particular iteration, clusters are obtained from a subset of the iteration's input entity pairs which satisfy a similarity criterion, and then spanning trees are generated for at least some of the clusters. An indication of at least a representative pair of one or more of the clusters is added to the hierarchical representation in the iteration. The hierarchical representation is used to respond to clustering requests.

    Machine learned system for predicting item package quantity relationship between item descriptions

    公开(公告)号:US11461829B1

    公开(公告)日:2022-10-04

    申请号:US16455601

    申请日:2019-06-27

    Abstract: Systems and methods are disclosed to implement a machine learned system to determine the comparative relationship between item package quantity (IPQ) information indicated in two item descriptions. In embodiments, the system employs a neural network that includes a token encoding layer, an attribute summarizing layer, and a comparison layer. The token encoding layer accepts an item description as a token sequence and encodes the tokens with token attributes that are relevant to IPQ extraction. The attribute summarizing layer uses a convolutional neural network to generate a set of fixed-size feature vectors for each encoded token sequence. All feature vectors for both item descriptions are then provided to the comparison layer to generate the IPQ comparison result. Advantageously, the disclosed neural network model can be trained to make accurate predictions about the IPQ relationship of the two item descriptions using a small set of token-level attributes as input signals.

Patent Agency Ranking