- 专利标题: Consistent filtering of machine learning data
-
申请号: US14460314申请日: 2014-08-14
-
公开(公告)号: US10540606B2公开(公告)日: 2020-01-21
- 发明人: Leo Parker Dirac , Jin Li , Tianming Zheng , Donghui Zhuo
- 申请人: Amazon Technologies, Inc.
- 申请人地址: US NV Reno
- 专利权人: Amazon Technologies, Inc.
- 当前专利权人: Amazon Technologies, Inc.
- 当前专利权人地址: US NV Reno
- 代理机构: Meyertons, Hood, Kivlin, Kowert & Goetzel, P.C.
- 代理商 Robert C. Kowert
- 主分类号: G06N20/00
- IPC分类号: G06N20/00
摘要:
Consistency metadata, including a parameter for a pseudo-random number source, are determined for training-and-evaluation iterations of a machine learning model. Using the metadata, a first training set comprising records of at least a first chunk is identified from a plurality of chunks of a data set. The first training set is used to train a machine learning model during a first training-and-evaluation iteration. A first test set comprising records of at least a second chunk is identified using the metadata, and is used to evaluate the model during the first training-and-evaluation iteration.
公开/授权文献
- US20150379425A1 CONSISTENT FILTERING OF MACHINE LEARNING DATA 公开/授权日:2015-12-31
信息查询