-
公开(公告)号:US20250131215A1
公开(公告)日:2025-04-24
申请号:US19000935
申请日:2024-12-24
Applicant: Google LLC
Inventor: Puneet Jain , Orhan Firat , Sihang Liang
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that translate text depicted in images from a source language into a target language. Methods can include obtaining a first image that depicts first text written in a source language. The first image is input into an image translation model, which includes a feature extractor and a decoder. The feature extractor accepts the first image as input and in response, generates a first set of image features that are a description of a portion of the first image in which the text is depicted is obtained. The first set of image features are input into a decoder. In response to the input first set of image features, the decoder outputs a second text that is a predicted translation of text in the source language that is represented by the first set of image features.
-
公开(公告)号:US20230124572A1
公开(公告)日:2023-04-20
申请号:US17791409
申请日:2020-01-08
Applicant: GOOGLE LLC
Inventor: Puneet Jain , Orhan Firat , Sihang Liang
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that translate text depicted in images from a source language into a target language. Methods can include obtaining a first image that depicts first text written in a source language. The first image is input into an image translation model, which includes a feature extractor and a decoder. The feature extractor accepts the first image as input and in response, generates a first set of image features that are a description of a portion of the first image in which the text is depicted is obtained. The first set of image features are input into a decoder. In response to the input first set of image features, the decoder outputs a second text that is a predicted translation of text in the source language that is represented by the first set of image features.
-
公开(公告)号:US12217017B2
公开(公告)日:2025-02-04
申请号:US17791409
申请日:2020-01-08
Applicant: GOOGLE LLC
Inventor: Puneet Jain , Orhan Firat , Sihang Liang
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that translate text depicted in images from a source language into a target language. Methods can include obtaining a first image that depicts first text written in a source language. The first image is input into an image translation model, which includes a feature extractor and a decoder. The feature extractor accepts the first image as input and in response, generates a first set of image features that are a description of a portion of the first image in which the text is depicted is obtained. The first set of image features are input into a decoder. In response to the input first set of image features, the decoder outputs a second text that is a predicted translation of text in the source language that is represented by the first set of image features.
-
-