Invention Grant
- Patent Title: Systems and methods for analyzing text extracted from images and performing appropriate transformations on the extracted text
-
Application No.: US18463951Application Date: 2023-09-08
-
Publication No.: US12033620B1Publication Date: 2024-07-09
- Inventor: Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Fabian Roth , Dounia Berrada , Samer Hassan Hassan , Afroz Mohiuddin , Mikhail Khalman , Ali Essam Ali Elqursh , Belinda Luna Zeng
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: GOOGLE LLC
- Current Assignee: GOOGLE LLC
- Current Assignee Address: US CA Mountain View
- Agency: DORITY & MANNING P.A.
- Main IPC: G06F3/0483
- IPC: G06F3/0483 ; G06F16/30 ; G06F16/33 ; G06F16/583 ; G06V10/778 ; G06V30/14 ; G06V30/148 ; G10L15/183 ; G10L15/22 ; G10L15/30

Abstract:
The present disclosure provides computer-implemented methods, systems, and devices for responding to requests associated with an image. A computing system obtains, wherein the image depicts a first set of textual content. The computing system determines one or more characteristics of the first set of textual content. The computing system determines a response type from a plurality of response types based on the one or more characteristics. The computing system generates a model input, wherein the model input comprises data descriptive of the first set of textual content and a prompt associated with the response type. The computing system provides providing the model input as an input to a machine-learned language model. The computing system receives a second set of text as an output of the machine-learned language model as a result of the machine-learned language model processing the model input. The computing system provides the second set of text for display to a user, wherein the second set of textual content is associated with the response type.
Information query
IPC分类: