Invention Application
- Patent Title: GENERATING EMBEDDINGS IN A MULTIMODAL EMBEDDING SPACE FOR CROSS-LINGUAL DIGITAL IMAGE RETRIEVAL
-
Application No.: US17075450Application Date: 2020-10-20
-
Publication No.: US20220121702A1Publication Date: 2022-04-21
- Inventor: Ajinkya Kale , Zhe Lin , Pranav Aggarwal
- Applicant: Adobe Inc.
- Applicant Address: US CA San Jose
- Assignee: Adobe Inc.
- Current Assignee: Adobe Inc.
- Current Assignee Address: US CA San Jose
- Main IPC: G06F16/535
- IPC: G06F16/535 ; G06K9/62 ; G06F40/279 ; G06F16/242 ; G06F16/538 ; G06N3/04 ; G06N3/08

Abstract:
The present disclosure relates to methods, systems, and non-transitory computer-readable media for retrieving digital images in response to queries. For example, in one or more embodiments, the disclosed systems receive a query comprising text and generates a cross-lingual-multimodal embedding for the text within a multimodal embedding space. The disclosed systems further identifies an image embedding for a digital image that corresponds to (e.g., is relevant to) the text from the query based on an embedding distance between the image embedding and the cross-lingual-multimodal embedding for the text within the multimodal embedding space. Accordingly, the disclosed systems retrieve the digital image associated with the image embedding for display on a client device, such as the client device that submitted the query.
Public/Granted literature
- US11734339B2 Generating embeddings in a multimodal embedding space for cross-lingual digital image retrieval Public/Granted day:2023-08-22
Information query