Invention Grant
- Patent Title: Generating natural language descriptions of images
-
Application No.: US17092837Application Date: 2020-11-09
-
Publication No.: US12014259B2Publication Date: 2024-06-18
- Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06N3/00
- IPC: G06N3/00 ; G06F40/40 ; G06N3/045 ; G06N3/047

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
Public/Granted literature
- US20210125038A1 Generating Natural Language Descriptions of Images Public/Granted day:2021-04-29
Information query
IPC分类:
G | 物理 |
G06 | 计算;推算或计数 |
G06N | 基于特定计算模型的计算机系统 |
G06N3/00 | 基于生物学模型的计算机系统 |