-
公开(公告)号:US11720942B1
公开(公告)日:2023-08-08
申请号:US16915361
申请日:2020-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Loris Bazzani , Yanbei Chen
IPC: G06Q30/06 , G06N3/02 , G06F16/535 , G06Q30/0601 , G06T7/00
CPC classification number: G06Q30/0613 , G06F16/535 , G06N3/02 , G06T7/00
Abstract: Techniques are generally described for interactive image retrieval using visual semantic matching. Image data and text data are encoded into a single shared visual semantic embedding space. A prediction model is trained using reference inputs, target outputs, and modification text describing changes to the reference inputs to obtain the target outputs. The prediction model can be used to perform image-to-text, text-to-image, and interactive retrieval.