AUTOMATIC POST-EDITING MODEL FOR NEURAL MACHINE TRANSLATION

    公开(公告)号:US20220215183A1

    公开(公告)日:2022-07-07

    申请号:US17700123

    申请日:2022-03-21

    Applicant: GOOGLE LLC

    Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.

    AUTOMATIC POST-EDITING MODEL FOR GENERATED NATURAL LANGUAGE TEXT

    公开(公告)号:US20240370666A1

    公开(公告)日:2024-11-07

    申请号:US18773129

    申请日:2024-07-15

    Applicant: GOOGLE LLC

    Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.

    AUTOMATIC EVALUATION OF NATURAL LANGUAGE TEXT GENERATED BASED ON STRUCTURED DATA

    公开(公告)号:US20220215184A1

    公开(公告)日:2022-07-07

    申请号:US17608616

    申请日:2019-08-22

    Applicant: GOOGLE LLC

    Abstract: Techniques are disclosed for training and/or utilizing an alignments and language model (“ALM”) in automatically determining an ALM score corresponding with natural language text generated using a natural language generation model. The natural language text generated using the natural language generation model can be based on a set of structured data. Additionally or alternatively, the ALM can include a fluency model portion and a semantics model portion. The fluency model portion can be used in determining the fluency and/or grammar of the text. The semantics model portion be used in evaluating the content of the natural language text with respect to the content of the structured data.

    Automatic evaluation of natural language text generated based on structured data

    公开(公告)号:US12073187B2

    公开(公告)日:2024-08-27

    申请号:US17608616

    申请日:2019-08-22

    Applicant: GOOGLE LLC

    CPC classification number: G06F40/56 G06F40/226 G06F40/30

    Abstract: Techniques are disclosed for training and/or utilizing an alignments and language model (“ALM”) in automatically determining an ALM score corresponding with natural language text generated using a natural language generation model. The natural language text generated using the natural language generation model can be based on a set of structured data. Additionally or alternatively, the ALM can include a fluency model portion and a semantics model portion. The fluency model portion can be used in determining the fluency and/or grammar of the text. The semantics model portion be used in evaluating the content of the natural language text with respect to the content of the structured data.

    Automatic post-editing model for neural machine translation

    公开(公告)号:US11295092B2

    公开(公告)日:2022-04-05

    申请号:US16511806

    申请日:2019-07-15

    Applicant: Google LLC

    Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.

    Minimum Bayes Risk Decoding with Neural Quality Metrics

    公开(公告)号:US20230259759A1

    公开(公告)日:2023-08-17

    申请号:US17673714

    申请日:2022-02-16

    Applicant: Google LLC

    CPC classification number: G06N3/08

    Abstract: Provided are systems and methods for sequence-to-sequence modeling with neural quality metrics. More particularly, example aspects of the present disclosure relate to minimum bayes risk (MBR) decoding with neural metrics for machine translation. According to example aspects of the present disclosure, a set of candidate outputs can be sampled from a machine translation model given a source sequence. Given the set of candidate outputs, systems and methods according to example aspects of the present disclosure can select a hypothesis with high expected utility with respect to the distribution over a set of pseudo-references from the machine translation model.

    AUTOMATIC POST-EDITING MODEL FOR NEURAL MACHINE TRANSLATION

    公开(公告)号:US20210019373A1

    公开(公告)日:2021-01-21

    申请号:US16511806

    申请日:2019-07-15

    Applicant: Google LLC

    Abstract: Techniques are disclosed for training and/or utilizing an automatic post-editing model in correcting translation error(s) introduced by a neural machine translation model. The automatic post-editing model can be trained using automatically generated training instances. A training instance is automatically generated by processing text in a first language using a neural machine translation model to generate text in a second language. The text in the second language is processed using a neural machine translation model to generate training text in the first language. A training instance can include the text in the first language as well as the training text in the first language.

Patent Agency Ranking