System for De-Duplicating Job Postings
    1.
    发明申请

    公开(公告)号:US20180181609A1

    公开(公告)日:2018-06-28

    申请号:US15391912

    申请日:2016-12-28

    申请人: Google Inc.

    IPC分类号: G06F17/30 G06Q10/10

    摘要: Systems and methods for de-duplicating electronic job postings are provided. In one embodiment, a method includes obtaining a first set of data indicative of a job posting. The first set of data includes one or more characteristics associated with the job posting. The method includes accessing a second set of data indicative of a job posting cluster. The job posting cluster includes one or more previous job postings. One of the previous job postings is a master job posting that is representative of the previous job postings. The method includes determining whether the job posting is duplicative of the previous job postings based at least in part on the characteristics associated with the job posting and the master job posting. The method includes providing for storage a third set of data indicative of the job posting associated with the job posting cluster or associated with a new job posting cluster.