SEARCH ENGINE
    1.
    发明申请
    SEARCH ENGINE 审中-公开

    公开(公告)号:US20180107983A1

    公开(公告)日:2018-04-19

    申请号:US15296230

    申请日:2016-10-18

    申请人: Google Inc.

    IPC分类号: G06Q10/10 G06F17/30 G06N99/00

    摘要: Methods, systems, and apparatus, including computer programs encoded on storage devices, for performing a job opportunity search. In one aspect, a system includes a data processing apparatus, and a computer-readable storage device having stored thereon instructions that, when executed by the data processing apparatus, cause the data processing apparatus to perform operations. The operations include defining a vector vocabulary, defining an occupation taxonomy that includes multiple different occupations, obtaining multiple labeled training data items, wherein each labeled training data item is associated with at least (i) a job title, and (ii) an occupation, generating, for each of the respective labeled training data items, an occupation vector that includes a feature weight for each respective term in the vector vocabulary, and associating each respective occupation vector with an occupation in the occupation taxonomy based on the occupation of the labeled training data item used to generate the occupation vector.

    System for De-Duplicating Job Postings
    2.
    发明申请

    公开(公告)号:US20180181609A1

    公开(公告)日:2018-06-28

    申请号:US15391912

    申请日:2016-12-28

    申请人: Google Inc.

    IPC分类号: G06F17/30 G06Q10/10

    摘要: Systems and methods for de-duplicating electronic job postings are provided. In one embodiment, a method includes obtaining a first set of data indicative of a job posting. The first set of data includes one or more characteristics associated with the job posting. The method includes accessing a second set of data indicative of a job posting cluster. The job posting cluster includes one or more previous job postings. One of the previous job postings is a master job posting that is representative of the previous job postings. The method includes determining whether the job posting is duplicative of the previous job postings based at least in part on the characteristics associated with the job posting and the master job posting. The method includes providing for storage a third set of data indicative of the job posting associated with the job posting cluster or associated with a new job posting cluster.