- 专利标题: Counter data generation for data profiling using only true samples
-
申请号: US18136830申请日: 2023-04-19
-
公开(公告)号: US12112268B2公开(公告)日: 2024-10-08
- 发明人: Fardin Abdi Taghi Abad , Reza Farivar , Vincent Pham , Kenneth Taylor , Mark Watson , Jeremy Goodsitt , Austin Walters , Anh Truong
- 申请人: Capital One Services, LLC
- 申请人地址: US VA McLean
- 专利权人: CAPITAL ONE SERVICES, LLC
- 当前专利权人: CAPITAL ONE SERVICES, LLC
- 当前专利权人地址: US VA Mclean
- 代理机构: HUNTON ANDREWS KURTH LLP
- 主分类号: G06F40/10
- IPC分类号: G06F40/10 ; G06N3/045 ; G06N3/08
摘要:
A method for generating a dual-class dataset is disclosed. A single-class dataset and a context dataset are obtained. The context dataset can be labeled. A model can be trained using the combination of the single-class dataset and the labeled context dataset. The model can be run on the context dataset. The data points that are classified the same as the data points included in the single-class dataset, can be removed from the labeled context dataset and added to the single-class dataset. These steps can be repeated until no data points are classified by the model.
公开/授权文献
信息查询