MULTIMODAL TRANSLATION METHOD, APPARATUS, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM

    公开(公告)号:US20220092276A1

    公开(公告)日:2022-03-24

    申请号:US17479195

    申请日:2021-09-20

    Abstract: A method for providing multimodal translation of a content in a source language is provided. The method includes receiving a user input with respect to a translation request of text included in the content, in response to receiving the user input, acquiring a multimodal input from the content, the multimodal input including location information related to the content other multimodal inputs, generating scene information representing the multimodal input related to the content by using a fusion layer based on the location information and the other multimodal inputs, identifying a candidate word set in a target language, determining at least one candidate word from the candidate word set based on the scene information, and translating the text included in the content into the target language using a translation model based on the determined at least one candidate word.

    DATA PROCESSING METHOD, DEVICE WAKE-UP METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240046920A1

    公开(公告)日:2024-02-08

    申请号:US18473701

    申请日:2023-09-25

    CPC classification number: G10L15/08 G10L25/51 G10L2015/088

    Abstract: A data processing method, a device wake-up method, an electronic device, and a storage medium are provided. In the data processing method, speech to be processed is converted into a keyword phone sequence, and a similar pronunciation sequence generator acquires a similar phone sequence corresponding to the keyword phone sequence in a sequence generation manner, thereby acquiring a first data processed result corresponding to the speech to be processed. By replacing the search method of large-scale speech databases with this generation manner, effective coverage of possible real-life sounds can be achieved with a smaller model, thus improving the ability to distinguish confusing pronunciations. The above data processing method performed by the electronic device can be performed by an artificial intelligence (AI) model.

    APPARATUS AND METHOD FOR DETECTING BODY PARTS FROM USER IMAGE
    5.
    发明申请
    APPARATUS AND METHOD FOR DETECTING BODY PARTS FROM USER IMAGE 有权
    用于从用户图像检测身体部位的装置和方法

    公开(公告)号:US20140307955A1

    公开(公告)日:2014-10-16

    申请号:US14249464

    申请日:2014-04-10

    CPC classification number: G06K9/00362 G06K9/4642

    Abstract: An apparatus for detecting a body part from a user image may include an image acquirer to acquire a depth image, an extractor to extract the user image from a foreground of the acquired depth image, and a body part detector to detect the body part from the user image, using a classifier trained based on at least one of a single-user image sample and a multi-user image sample. The single-user image may be an image representing non-overlapping users, and the multi-user image may be an image representing overlapping users.

    Abstract translation: 用于从用户图像检测身体部位的装置可以包括获取深度图像的图像获取器,从所获取的深度图像的前景提取用户图像的提取器,以及从身体部位检测器检测身体部位, 使用基于单用户图像样本和多用户图像样本中的至少一个训练的分类器的用户图像。 单用户图像可以是表示非重叠用户的图像,并且多用户图像可以是表示重叠用户的图像。

    DEVICE AND METHOD FOR ESTIMATING HEAD POSE
    6.
    发明申请
    DEVICE AND METHOD FOR ESTIMATING HEAD POSE 有权
    用于估计头枕的装置和方法

    公开(公告)号:US20140119655A1

    公开(公告)日:2014-05-01

    申请号:US14065833

    申请日:2013-10-29

    Abstract: Provided is a device and method for estimating a head pose which may obtain an excellent head pose recognition result free from the influence of an illumination change, the device including a head area extracting unit to extract a head area from an input depth image, a head pitch angle estimating unit to estimate a pitch angle of a head in the head area, a head yaw angle estimating unit to estimate a yaw angle of the head in the head area, and a head pose displaying unit to display a head pose based on the estimated pitch angle of the head and the estimated yaw angle of the head.

    Abstract translation: 提供了一种用于估计头部姿势的装置和方法,其可以获得不受照明变化影响的优异的头部姿势识别结果,该装置包括:头部区域提取单元,用于从输入深度图像中提取头部区域;头部 俯仰角估计单元,用于估计头部区域中的头部的俯仰角;头部偏航角估计单元,用于估计头部区域中的头部的偏航角;以及头部姿势显示单元,用于基于 估计头部的俯仰角和头部的估计的偏航角。

    MACHINE TRANSLATION METHOD, DEVICES, AND STORAGE MEDIA

    公开(公告)号:US20230401391A1

    公开(公告)日:2023-12-14

    申请号:US18209790

    申请日:2023-06-14

    CPC classification number: G06F40/47 G06F40/284

    Abstract: A method performed by an electronic device comprises acquiring information to be translated. The method includes determining, based on the information to be translated, a target domain adapter from a plurality of candidate domain adapters, the target domain adapter corresponding to the information to be translated, each candidate domain adapter from the plurality of candidate domain adapters corresponding to at least one domain. The method includes obtaining, based on the target domain adapter corresponding to the information to be translated, a translation result corresponding to the information to be translated.

Patent Agency Ranking