TRANSLATION OF TEXT DEPICTED IN IMAGES

    公开(公告)号:US20230124572A1

    公开(公告)日:2023-04-20

    申请号:US17791409

    申请日:2020-01-08

    Applicant: GOOGLE LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that translate text depicted in images from a source language into a target language. Methods can include obtaining a first image that depicts first text written in a source language. The first image is input into an image translation model, which includes a feature extractor and a decoder. The feature extractor accepts the first image as input and in response, generates a first set of image features that are a description of a portion of the first image in which the text is depicted is obtained. The first set of image features are input into a decoder. In response to the input first set of image features, the decoder outputs a second text that is a predicted translation of text in the source language that is represented by the first set of image features.

    TRANSLATION OF TEXT DEPICTED IN IMAGES

    公开(公告)号:US20250131215A1

    公开(公告)日:2025-04-24

    申请号:US19000935

    申请日:2024-12-24

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that translate text depicted in images from a source language into a target language. Methods can include obtaining a first image that depicts first text written in a source language. The first image is input into an image translation model, which includes a feature extractor and a decoder. The feature extractor accepts the first image as input and in response, generates a first set of image features that are a description of a portion of the first image in which the text is depicted is obtained. The first set of image features are input into a decoder. In response to the input first set of image features, the decoder outputs a second text that is a predicted translation of text in the source language that is represented by the first set of image features.

    Techniques and Models for Multilingual Text Rewriting

    公开(公告)号:US20230274100A1

    公开(公告)日:2023-08-31

    申请号:US17682282

    申请日:2022-02-28

    Applicant: Google LLC

    CPC classification number: G06F40/58 G06F40/197 G06F40/166 G06F40/253 G06N3/08

    Abstract: The technology provides a model-based approach for multilingual text rewriting that is applicable across many languages and across different styles including formality levels or other textual attributes. The model is configured to manipulate both language and textual attributes jointly. This approach supports zero-shot formality-sensitive translation, with no labeled data in the target language. An encoder-decoder architectural approach with attribute extraction is used to train rewriter models that can thus be used in “universal” textual rewriting across many different languages. A cross-lingual learning signal can be incorporated into the training approach. Certain training processes do not employ any exemplars. This approach enables not just straight translation, but also the ability to create new sentences with different attributes.

    GENERATING LABELED TRAINING DATA USING A PRE-TRAINED LANGUAGE MODEL NEURAL NETWORK

    公开(公告)号:US20230196105A1

    公开(公告)日:2023-06-22

    申请号:US18082934

    申请日:2022-12-16

    Applicant: Google LLC

    CPC classification number: G06N3/08

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating labeled training data using a pre-trained language model neural network. In particular, the language model neural network can generate the text input in a new labeled training example from an input sequence that includes (i) one or more context inputs and (ii) a text label that identifies the ground truth category for the new labeled training example.

    CROSS-LINGUAL CLASSIFICATION USING MULTILINGUAL NEURAL MACHINE TRANSLATION

    公开(公告)号:US20200342182A1

    公开(公告)日:2020-10-29

    申请号:US16610233

    申请日:2019-08-26

    Applicant: GOOGLE LLC

    Abstract: Training and/or using a multilingual classification neural network model to perform a natural language processing classification task, where the model reuses an encoder portion of a multilingual neural machine translation model. In a variety of implementations, a client device can generate a natural language data stream from a spoken input from a user. The natural language data stream can be applied as input to an encoder portion of the multilingual classification model. The output generated by the encoder portion can be applied as input to a classifier portion of the multilingual classification model. The classifier portion can generate a predicted classification label of the natural language data stream. In many implementations, an output can be generated based on the predicted classification label, and a client device can present the output.

    Techniques and Models for Multilingual Text Rewriting

    公开(公告)号:US20250148224A1

    公开(公告)日:2025-05-08

    申请号:US19015153

    申请日:2025-01-09

    Applicant: Google LLC

    Abstract: The technology provides a model-based approach for multilingual text rewriting that is applicable across many languages and across different styles including formality levels or other textual attributes. The model is configured to manipulate both language and textual attributes jointly. This approach supports zero-shot formality-sensitive translation, with no labeled data in the target language. An encoder-decoder architectural approach with attribute extraction is used to train rewriter models that can thus be used in “universal” textual rewriting across many different languages. A cross-lingual learning signal can be incorporated into the training approach. Certain training processes do not employ any exemplars. This approach enables not just straight translation, but also the ability to create new sentences with different attributes.

Patent Agency Ranking