-
公开(公告)号:US11232085B2
公开(公告)日:2022-01-25
申请号:US14990175
申请日:2016-01-07
Applicant: Amazon Technologies, Inc.
Inventor: Nina Mishra , Daniel Blick , Sudipto Guha , Okke Joost Schrijvers
IPC: G06F16/215 , G06N5/00 , G06N20/00 , G06F16/2458
Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.