Invention Grant
- Patent Title: Self-supervised document representation learning
-
Application No.: US17333892Application Date: 2021-05-28
-
Publication No.: US11886815B2Publication Date: 2024-01-30
- Inventor: Jiuxiang Gu , Vlad Morariu , Varun Manjunatha , Tong Sun , Rajiv Jain , Peizhao Li , Jason Kuen , Handong Zhao
- Applicant: Adobe Inc.
- Applicant Address: US CA San Jose
- Assignee: ADOBE INC.
- Current Assignee: ADOBE INC.
- Current Assignee Address: US CA San Jose
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F40/279
- IPC: G06F40/279 ; G06F40/205 ; G06F16/93 ; G06F40/30 ; G06N3/088 ; G06N3/045

Abstract:
One example method involves operations for a processing device that include receiving, by a machine learning model trained to generate a search result, a search query for a text input. The machine learning model is trained by receiving pre-training data that includes multiple documents. Pre-training the machine learning model by generating, using an encoder, feature embeddings for each of the documents included in the pre-training data. The feature embeddings are generated by applying a masking function to visual and textual features in the documents. Training the machine learning model also includes generating, using the feature embeddings, output features for the documents by concatenating the feature embeddings and applying a non-linear mapping to the feature embeddings. Training the machine learning model further includes applying a linear classifier to the output features. Additionally, operations include generating, for display, a search result using the machine learning model based on the input.
Public/Granted literature
- US20220382975A1 SELF-SUPERVISED DOCUMENT REPRESENTATION LEARNING Public/Granted day:2022-12-01
Information query