-
公开(公告)号:US20230274100A1
公开(公告)日:2023-08-31
申请号:US17682282
申请日:2022-02-28
Applicant: Google LLC
Inventor: Xavier Eduardo Garcia , Orhan Firat , Noah Constant , Xiaoyue Guo
IPC: G06F40/58 , G06F40/197 , G06F40/166 , G06F40/253 , G06N3/08
CPC classification number: G06F40/58 , G06F40/197 , G06F40/166 , G06F40/253 , G06N3/08
Abstract: The technology provides a model-based approach for multilingual text rewriting that is applicable across many languages and across different styles including formality levels or other textual attributes. The model is configured to manipulate both language and textual attributes jointly. This approach supports zero-shot formality-sensitive translation, with no labeled data in the target language. An encoder-decoder architectural approach with attribute extraction is used to train rewriter models that can thus be used in “universal” textual rewriting across many different languages. A cross-lingual learning signal can be incorporated into the training approach. Certain training processes do not employ any exemplars. This approach enables not just straight translation, but also the ability to create new sentences with different attributes.
-
公开(公告)号:US20230325725A1
公开(公告)日:2023-10-12
申请号:US17718738
申请日:2022-04-12
Applicant: Google LLC
Inventor: Brian David Lester , Rami Al-Rfou , Noah Constant
IPC: G06N20/20 , G06V10/764 , G06V10/774
CPC classification number: G06N20/20 , G06V10/764 , G06V10/7747
Abstract: Systems and methods for natural language processing can leverage trained prompts to condition a large pre-trained machine-learned model to generate an output for a specific task. For example, a subset of parameters may be trained for the particular task to then be input with a set of input data into the pre-trained machine-learned model to generate the task-specific output. During the training of the prompt, the parameters of the pre-trained machine-learned model can be frozen, which can reduce the computational resources used during training while still leveraging the previously learned data from the pre-trained machine-learned model.
-
公开(公告)号:US12210848B2
公开(公告)日:2025-01-28
申请号:US17682282
申请日:2022-02-28
Applicant: Google LLC
Inventor: Xavier Eduardo Garcia , Orhan Firat , Noah Constant , Xiaoyue Guo , Parker Riley
IPC: G06F40/58 , G06F40/166 , G06F40/197 , G06F40/253 , G06F40/56 , G06N3/045 , G06N3/047 , G06N3/08 , G06N3/084
Abstract: The technology provides a model-based approach for multilingual text rewriting that is applicable across many languages and across different styles including formality levels or other textual attributes. The model is configured to manipulate both language and textual attributes jointly. This approach supports zero-shot formality-sensitive translation, with no labeled data in the target language. An encoder-decoder architectural approach with attribute extraction is used to train rewriter models that can thus be used in “universal” textual rewriting across many different languages. A cross-lingual learning signal can be incorporated into the training approach. Certain training processes do not employ any exemplars. This approach enables not just straight translation, but also the ability to create new sentences with different attributes.
-
公开(公告)号:US20240020546A1
公开(公告)日:2024-01-18
申请号:US17863840
申请日:2022-07-13
Applicant: Google LLC
Inventor: Tu Thanh Vu , Daniel Matthew Cer , Noah Constant , Brian David Lester , Rami Al-Rfou
IPC: G06N5/02
CPC classification number: G06N5/022
Abstract: Systems and methods for prompt tuning can utilize previously-learned prompts for the initialization of tuning for prompts on different tasks that may differ from the task associated with the previously-learned prompt. The prompt being utilized for initialization can be a generic prompt and/or may be a prompt selected based on a determined similarity between two or more task embeddings.
-
公开(公告)号:US11238211B2
公开(公告)日:2022-02-01
申请号:US16978658
申请日:2019-03-14
Applicant: Google LLC
Inventor: Jan van de Kerkhof , Balint Miklos , Amr Abdelfattah , Tobias Kaufmann , László Lukacs , Bjarke Ebert , Victor Anchidin , Brian Strope , Heeyoung Lee , Yun-hsuan Sung , Noah Constant , Neil Smith
IPC: G06F17/00 , G06F40/134 , G06F40/166 , G06F40/30
Abstract: A system may use a machine-learned model to determine whether to classify a sequence of one or more words within a first document that is being edited as a candidate hyperlink based at least in part on context associated with the first document. In response to classifying the sequence of one or more words as the candidate hyperlink, the system may use the machine-learned model and based at least in part on the sequence of one or more words and the context to determine one or more candidate document to be hyperlinked from the sequence of one or more words. In response to receiving an indication of a second document being selected out of the one or more candidate documents, the system may modify the first document to associate the sequence of one or more words with a hyperlink to the second document.
-
-
-
-