-
公开(公告)号:US20240394574A1
公开(公告)日:2024-11-28
申请号:US18790920
申请日:2024-07-31
Applicant: Snowflake Inc.
Inventor: Anupam Datta , Shayak Sen , Apoorv Gupta , David Sandai Kurokawa
Abstract: A computing machine receives a representation of a machine learning model, a representation of a first data segment, and a representation of a second data segment. The computing machine computes an output difference between an output of the machine learning model applied to the first data segment and an output of the machine learning model applied to the second data segment. The computing machine determines a set of reasons for the computed output difference based on a set of metrics defining distance between feature importance distributions, the set of reasons identifying a set of features from a feature vector of the machine learning model along with a relative contribution of each feature to the computed output difference. The computing machine provides an output representing the set of reasons.