-
公开(公告)号:US20240070436A1
公开(公告)日:2024-02-29
申请号:US17900592
申请日:2022-08-31
Applicant: Huawei Technologies Co., Ltd.
Inventor: Hang XU , Lu HOU , Guansong LU , Minzhe NIU , Zhenguo LI , Runhui HUANG , Lewei YAO , Chunjing XU , Xiaodan LIANG
IPC: G06N3/04 , G06F40/284
CPC classification number: G06N3/0454 , G06F40/284
Abstract: A method is provided for data processing performed by a processing system. The method comprises determining a set of first tokens for first data and a set of second token for second data, each token comprising information associated with a segment of the respective data, determining pair-wise similarities between the set of first tokens and the set of second tokens, each pair comprising a first token in the set of first tokens and a second token in the set of second tokens, determining, for each first token in the set of first tokens, a maximum similarity based on the determined pair-wise similarities between the respective first token and the second tokens in the set of second tokens, and determining a first similarity between the first data and the second data by aggregating the maximum similarities corresponding to the first tokens in the set of first set of tokens.