-
公开(公告)号:US20180181609A1
公开(公告)日:2018-06-28
申请号:US15391912
申请日:2016-12-28
Applicant: Google Inc.
Inventor: Chao Chen , Alonso Alphonsovich Rukubayihunga , Christian Posse , Xuejun Tao , Ye Tian , Geordon Kitchen
CPC classification number: G06F16/2365 , G06F16/2255 , G06Q10/1053
Abstract: Systems and methods for de-duplicating electronic job postings are provided. In one embodiment, a method includes obtaining a first set of data indicative of a job posting. The first set of data includes one or more characteristics associated with the job posting. The method includes accessing a second set of data indicative of a job posting cluster. The job posting cluster includes one or more previous job postings. One of the previous job postings is a master job posting that is representative of the previous job postings. The method includes determining whether the job posting is duplicative of the previous job postings based at least in part on the characteristics associated with the job posting and the master job posting. The method includes providing for storage a third set of data indicative of the job posting associated with the job posting cluster or associated with a new job posting cluster.