Machine learning based duplicate invoice detection
摘要:
Embodiments detect duplicate invoices, each invoice including a plurality of fields. Embodiments generate synthetic training data using a plurality of training invoices and generating one or more modified fields for each of the plurality of training invoices. Embodiments train a machine learning model using the synthetic training data and generate a plurality of candidate invoice pairs. Embodiments input the plurality of candidate invoice pairs to the trained machine learning model and generate, by the trained machine learning model, a prediction of whether each of the candidate invoices pairs is a duplicate invoice pair.
公开/授权文献
信息查询
0/0