METHOD AND APPARATUS FOR PRE-TRAINING SEMANTIC REPRESENTATION MODEL AND ELECTRONIC DEVICE

Invention Publication

US20230147550A1 METHOD AND APPARATUS FOR PRE-TRAINING SEMANTIC REPRESENTATION MODEL AND ELECTRONIC DEVICE 审中-公开

Please log in to see more content

Patent Title: METHOD AND APPARATUS FOR PRE-TRAINING SEMANTIC REPRESENTATION MODEL AND ELECTRONIC DEVICE
Application No.: US18051594

Application Date: 2022-11-01
Publication No.: US20230147550A1

Publication Date: 2023-05-11
Inventor: Dongliang HE , Errui DING
Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Applicant Address: CN Beijing
Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee Address: CN Beijing
Priority: CN 2111307885.6 2021.11.05
Main IPC: G06V10/774
IPC: G06V10/774 ; G06V20/40 ; G06F40/30 ; G06V30/19

METHOD AND APPARATUS FOR PRE-TRAINING SEMANTIC REPRESENTATION MODEL AND ELECTRONIC DEVICE

Abstract:

A method for pre-training a semantic representation model includes: for each video-text pair in pre-training data, determining a mask image sequence, a mask character sequence, and a mask image-character sequence of the video-text pair; determining a plurality of feature sequences and mask position prediction results respectively corresponding to the plurality of feature sequences by inputting the mask image sequence, the mask character sequence, and the mask image-character sequence into an initial semantic representation model; and building a loss function based on the plurality of feature sequences, the mask position prediction results respectively corresponding to the plurality of feature sequences and true mask position results, and adjusting coefficients of the semantic representation model to realize training.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/77	..处理特征空间中的图像或视频特征；使用数据集成或数据缩减，例如主成分分析 [PCA] 或独立成分分析 [ICA] 或自组织图 [SOM]；盲源分离
G06V10/774	...生成训练模式集；引导方法，例如捕获或促进