Method and apparatus for translating speech

    公开(公告)号:US11132518B2

    公开(公告)日:2021-09-28

    申请号:US16691111

    申请日:2019-11-21

    摘要: A method and apparatus for translating speech are provided. The method may include: recognizing received to-be-recognized speech of a source language to obtain a recognized text; concatenating the obtained recognized text after a to-be-translated text, to form a concatenated to-be-translated text; inputting the concatenated to-be-translated text into a pre-trained discriminant model to obtain a discrimination result for characterizing whether the concatenated to-be-translated text is to be translated, where the discriminant model is used to characterize a corresponding relationship between a text and a discrimination result corresponding to the text; in response to the positive discrimination result being obtained, translating the concatenated to-be-translated text to obtain a translation result of a target language, and outputting the translation result.

    Method and apparatus for determining a topic

    公开(公告)号:US11366973B2

    公开(公告)日:2022-06-21

    申请号:US16691104

    申请日:2019-11-21

    摘要: Embodiments of the present disclosure disclose a method and apparatus for determining a topic. A specific embodiment of the method comprises: determining a to-be-recognized sentence sequence; calculating similarities between the to-be-recognized sentence sequence and each of topic templates in a topic template set in a target area, the each of the topic templates in the topic template set corresponding to a topic in at least one topic in the target area, the topic template including a topic section sequence, and a topic section including a topic sentence sequence; and determining a topic of the to-be-recognized sentence sequence according to an associated parameter, the associated parameter including the similarities between the to-be-recognized sentence sequence and the each of the topic templates in the topic template set. This embodiment reduces labor costs during a topic segmentation.

    METHOD AND APPARATUS FOR TRAINING MODELS IN MACHINE TRANSLATION, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210390266A1

    公开(公告)日:2021-12-16

    申请号:US17200551

    申请日:2021-03-12

    摘要: A method and apparatus for training models in machine translation, an electronic device and a storage medium are disclosed, which relates to the field of natural language processing technologies and the field of deep learning technologies. An implementation includes mining similar target sentences of a group of samples based on a parallel corpus using a machine translation model and a semantic similarity model, and creating a first training sample set; training the machine translation model with the first training sample set; mining a negative sample of each sample in the group of samples based on the parallel corpus using the machine translation model and the semantic similarity model, and creating a second training sample set; and training the semantic similarity model with the second sample training set. With the above-mentioned technical solution of the present application, by training the two models jointly, while the semantic similarity model is trained, the machine translation model may be optimized and nurtures the semantic similarity model, thus further improving the accuracy of the semantic similarity model.

    METHOD AND APPARATUS FOR DETERMINING A TOPIC

    公开(公告)号:US20200210522A1

    公开(公告)日:2020-07-02

    申请号:US16691104

    申请日:2019-11-21

    IPC分类号: G06F17/27 G06F17/24

    摘要: Embodiments of the present disclosure disclose a method and apparatus for determining a topic. A specific embodiment of the method comprises: determining a to-be-recognized sentence sequence; calculating similarities between the to-be-recognized sentence sequence and each of topic templates in a topic template set in a target area, the each of the topic templates in the topic template set corresponding to a topic in at least one topic in the target area, the topic template including a topic section sequence, and a topic section including a topic sentence sequence; and determining a topic of the to-be-recognized sentence sequence according to an associated parameter, the associated parameter including the similarities between the to-be-recognized sentence sequence and the each of the topic templates in the topic template set. This embodiment reduces labor costs during a topic segmentation.

    Text translation method, device, and storage medium

    公开(公告)号:US11314946B2

    公开(公告)日:2022-04-26

    申请号:US16701382

    申请日:2019-12-03

    摘要: Embodiments of the present disclosure disclose a text translation method, a text translation apparatus, a device and a storage medium. The method includes: obtaining a source language text; and translating the source language text with a modified translation model to obtain a target language text corresponding to the source language text, the modified translation model being obtained by modifying an original translation model based on a text evaluation result of one or more translated texts for training, the translated text for training being an output result after translating through the original translation model, and the text evaluation result for evaluating a contextual semantic relation in the translated text for training.