-
公开(公告)号:US20220215183A1
公开(公告)日:2022-07-07
申请号:US17700123
申请日:2022-03-21
Applicant: GOOGLE LLC
Inventor: Markus Freitag , Isaac Caswell , Howard Scott Roy
Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.
-
公开(公告)号:US20240370666A1
公开(公告)日:2024-11-07
申请号:US18773129
申请日:2024-07-15
Applicant: GOOGLE LLC
Inventor: Markus Freitag , Isaac Caswell , Howard Scott Roy
IPC: G06F40/51 , G06F40/166 , G06F40/253 , G06F40/58 , G10L13/00
Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.
-
公开(公告)号:US20220215184A1
公开(公告)日:2022-07-07
申请号:US17608616
申请日:2019-08-22
Applicant: GOOGLE LLC
Inventor: Markus Freitag , Howard Scott Roy
IPC: G06F40/56 , G06F40/226 , G06F40/30
Abstract: Techniques are disclosed for training and/or utilizing an alignments and language model (“ALM”) in automatically determining an ALM score corresponding with natural language text generated using a natural language generation model. The natural language text generated using the natural language generation model can be based on a set of structured data. Additionally or alternatively, the ALM can include a fluency model portion and a semantics model portion. The fluency model portion can be used in determining the fluency and/or grammar of the text. The semantics model portion be used in evaluating the content of the natural language text with respect to the content of the structured data.
-
公开(公告)号:US12039286B2
公开(公告)日:2024-07-16
申请号:US17700123
申请日:2022-03-21
Applicant: GOOGLE LLC
Inventor: Markus Freitag , Isaac Caswell , Howard Scott Roy
IPC: G06F40/51 , G06F40/166 , G06F40/253 , G06F40/58 , G10L13/00
CPC classification number: G06F40/51 , G06F40/166 , G06F40/253 , G06F40/58 , G10L13/00
Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.
-
公开(公告)号:US12073187B2
公开(公告)日:2024-08-27
申请号:US17608616
申请日:2019-08-22
Applicant: GOOGLE LLC
Inventor: Markus Freitag , Howard Scott Roy
IPC: G06F40/56 , G06F40/226 , G06F40/30
CPC classification number: G06F40/56 , G06F40/226 , G06F40/30
Abstract: Techniques are disclosed for training and/or utilizing an alignments and language model (“ALM”) in automatically determining an ALM score corresponding with natural language text generated using a natural language generation model. The natural language text generated using the natural language generation model can be based on a set of structured data. Additionally or alternatively, the ALM can include a fluency model portion and a semantics model portion. The fluency model portion can be used in determining the fluency and/or grammar of the text. The semantics model portion be used in evaluating the content of the natural language text with respect to the content of the structured data.
-
公开(公告)号:US11295092B2
公开(公告)日:2022-04-05
申请号:US16511806
申请日:2019-07-15
Applicant: Google LLC
Inventor: Markus Freitag , Isaac Caswell , Howard Scott Roy
Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.
-
公开(公告)号:US20250077850A1
公开(公告)日:2025-03-06
申请号:US18823288
申请日:2024-09-03
Applicant: GOOGLE LLC
Inventor: Mara Finkelstein , Qijun Tan , Markus Freitag , Apurva Pradip Shah
IPC: G06N3/0475
Abstract: Implementations disclose utilizing a less computationally efficient decoding method in automatically generating corresponding single generative content predictions for training instances and fine-tuning a student generative model based on those automatically generated training instances. Those implementations are further directed to then utilizing, in an inference time environment, the fine-tuned student generative model and a more computationally efficient decoding method in generating generative predictions—and without any utilization of the less computationally efficient decoding method in generating the generative predictions.
-
公开(公告)号:US20230259759A1
公开(公告)日:2023-08-17
申请号:US17673714
申请日:2022-02-16
Applicant: Google LLC
Inventor: Qijun Tan , Markus Freitag , David Grangier
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Provided are systems and methods for sequence-to-sequence modeling with neural quality metrics. More particularly, example aspects of the present disclosure relate to minimum bayes risk (MBR) decoding with neural metrics for machine translation. According to example aspects of the present disclosure, a set of candidate outputs can be sampled from a machine translation model given a source sequence. Given the set of candidate outputs, systems and methods according to example aspects of the present disclosure can select a hypothesis with high expected utility with respect to the distribution over a set of pseudo-references from the machine translation model.
-
公开(公告)号:US20210019373A1
公开(公告)日:2021-01-21
申请号:US16511806
申请日:2019-07-15
Applicant: Google LLC
Inventor: Markus Freitag , Isaac Caswell , Howard Scott Roy
Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.
-
-
-
-
-
-
-
-