Dataset Refining with Machine Translation Quality Prediction

    公开(公告)号:US20230025739A1

    公开(公告)日:2023-01-26

    申请号:US17852863

    申请日:2022-06-29

    Applicant: Google LLC

    Abstract: Aspects of the technology employ a machine translation quality prediction (MTQP) model to refine datasets that are used in training machine translation systems. This includes receiving, by a machine translation quality prediction model, a sentence pair of a source sentence and a translated output (802). Then performing feature extraction on the sentence pair using a set of two or more feature extractors, where each feature extractor generates a corresponding feature vector (804). The corresponding feature vectors from the set of feature extractors are concatenated together (806). And the concatenated feature vectors are applied to a feedforward neural network, in which the feedforward neural network generates a machine translation quality prediction score for the translated output (808).

Patent Agency Ranking