TRAINING IMAGE AND TEXT EMBEDDING MODELS
    8.
    发明申请

    公开(公告)号:US20200250538A1

    公开(公告)日:2020-08-06

    申请号:US16265811

    申请日:2019-02-01

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for jointly training an image embedding model and a text embedding model. In one aspect, a method comprises: processing data from a historical query log of a search system to generate a candidate set of training examples, wherein each training example comprises: (i) a search query comprising a sequence of one or more words, (ii) an image, and (iii) selection data characterizing how often users selected the image in response to the image being identified by a search result for the search query; selecting a plurality of training examples from the candidate set of training examples; and using the training data to jointly train the image embedding model and the text embedding model.

Patent Agency Ranking