Techniques for distributed optical character recognition and distributed machine language translation
    11.
    发明授权
    Techniques for distributed optical character recognition and distributed machine language translation 有权
    分布式光学字符识别和分布式机器语言翻译技术

    公开(公告)号:US09514376B2

    公开(公告)日:2016-12-06

    申请号:US14264296

    申请日:2014-04-29

    Applicant: Google Inc.

    CPC classification number: G06K9/00979 G06F17/289 G06K2209/01

    Abstract: A technique for selectively distributing OCR and/or machine language translation tasks between a mobile computing device and server(s) includes receiving, at the mobile computing device, an image of an object comprising a text. The mobile computing device can determine a degree of optical character recognition (OCR) complexity for obtaining the text from the image. Based on this degree of OCR complexity, the mobile computing device and/or the server(s) can perform OCR to obtain an OCR text. The mobile computing device can then determine a degree of translation complexity for translating the OCR text from its source language to a target language. Based on this degree of translation complexity, the mobile computing device and/or the server(s) can perform machine language translation of the OCR text from the source language to a target language to obtain a translated OCR text. The mobile computing device can then output the translated OCR text.

    Abstract translation: 用于在移动计算设备和服务器之间选择性地分发OCR和/或机器语言翻译任务的技术包括在移动计算设备处接收包括文本的对象的图像。 移动计算设备可以确定从图像中获得文本的光学字符识别(OCR)复杂程度。 基于这种程度的OCR复杂度,移动计算设备和/或服务器可以执行OCR以获得OCR文本。 然后,移动计算设备可以确定将OCR文本从其源语言翻译成目标语言的翻译复杂程度。 基于这种翻译复杂度,移动计算设备和/或服务器可以执行OCR文本从源语言到目标语言的机器语言翻译,以获得翻译的OCR文本。 然后,移动计算设备可以输出翻译的OCR文本。

    LARGE LANGUAGE MODELS IN MACHINE TRANSLATION
    12.
    发明申请
    LARGE LANGUAGE MODELS IN MACHINE TRANSLATION 有权
    机器翻译中的大量语言模型

    公开(公告)号:US20130346059A1

    公开(公告)日:2013-12-26

    申请号:US13709125

    申请日:2012-12-10

    Applicant: GOOGLE INC.

    CPC classification number: G06F17/2818 G06F17/2827 G06F17/2845

    Abstract: Systems, methods, and computer program products for machine translation are provided. In some implementations a system is provided. The system includes a language model including a collection of n-grams from a corpus, each n-gram having a corresponding relative frequency in the corpus and an order n corresponding to a number of tokens in the n-gram, each n-gram corresponding to a backoff n-gram having an order of n−1 and a collection of backoff scores, each backoff score associated with an n-gram, the backoff score determined as a function of a backoff factor and a relative frequency of a corresponding backoff n-gram in the corpus.

    Abstract translation: 提供了用于机器翻译的系统,方法和计算机程序产品。 在一些实现中,提供了一种系统。 该系统包括语言模型,其包括来自语料库的n-gram的集合,每个n-gram在语料库中具有对应的相对频率,并且n阶对应于n-gram中的令牌数量,每个n-gram对应 到具有n-1级的退避n-gram和回退分数的集合,与n-gram相关联的每个回退分数,作为退避因子的函数确定的退避分数和相应退避n的相对频率 -gram在语料库中。

Patent Agency Ranking