Method and system for generation of candidate translations

    公开(公告)号:US10860808B2

    公开(公告)日:2020-12-08

    申请号:US16267207

    申请日:2019-02-04

    摘要: Implementations herein relate to methods and devices for generating candidate translations and for quantizing text as well as words. A method may include generating, by a computing device, pending candidate translations of text to be translated based on predetermined translation rules. The computing device may generate translation probabilities from the text to be translated to the pending candidate translations based on features having impacts on translation probabilities of the pending candidate translations and a predetermined translation probability prediction model. The computing device may then select a predetermined number of pending candidate translations that have the translation probabilities higher than other pending candidate translations in the pending candidate translations to be the candidate translations of the text to be translated. In implementations, the features having impacts on the translation probabilities may include degrees of semantic similarity between the text to be translated and the candidate translations.

    ENABLING RHETORICAL ANALYSIS VIA THE USE OF COMMUNICATIVE DISCOURSE TREES

    公开(公告)号:US20200380214A1

    公开(公告)日:2020-12-03

    申请号:US16998915

    申请日:2020-08-20

    发明人: Boris Galitsky

    摘要: Systems, devices, and methods of the present invention calculate a rhetorical relationship between one or more sentences. In an example, a computer-implemented method accesses a sentence comprising a plurality of fragments. At least one fragment includes a verb and a words. Each word includes a role of the words within the fragment. Each fragment is an elementary discourse unit. The method generates a discourse tree that represents rhetorical relationships between the sentence fragments. The discourse tree includes nodes including nonterminal and terminal nodes, each nonterminal node representing a rhetorical relationship between two of the sentence fragments, and each terminal node of the nodes of the discourse tree is associated with one of the sentence fragments. The method matches each fragment that has a verb to a verb signature, thereby creating communicative discourse tree.

    Training statistical speech translation systems from speech

    公开(公告)号:US10755054B1

    公开(公告)日:2020-08-25

    申请号:US16010168

    申请日:2018-06-15

    申请人: Facebook, Inc.

    摘要: An iterative language translation system includes multiple communicatively connected statistical speech translation systems. The system includes an automatic speech recognition component adapted to recognize spoken language in a source language and to create a source language hypothesis. A machine translation component is adapted to translate the source language hypothesis into a target language. The system also includes a second automatic speech recognition component and second machine translation component. The translation results are used to adapt the automatic speech recognition components and the language hypotheses are used to adapt the machine translation components.

    SENTENCE GENERATING METHOD AND APPARATUS
    89.
    发明申请

    公开(公告)号:US20200175233A1

    公开(公告)日:2020-06-04

    申请号:US16745947

    申请日:2020-01-17

    发明人: Jihyun LEE Hoshik LEE

    摘要: A sentence generating method includes: generating a corresponding word set of a source word set generated based on a source sentence; generating words by performing decoding based on feature vectors generated through encoding of the source sentence; adjusting a probability of at least one of the generated words based either one or both of the source word set and the corresponding word set; and selecting character strings from different character strings including each of the generated words based on the adjusted probability and the probability as unadjusted.