APPARATUS AND METHOD FOR SHARING AND PRUNING WEIGHTS FOR VISION AND LANGUAGE MODELS

Invention Publication

US20240119077A1 APPARATUS AND METHOD FOR SHARING AND PRUNING WEIGHTS FOR VISION AND LANGUAGE MODELS 审中-公开

Please log in to see more content

Patent Title: APPARATUS AND METHOD FOR SHARING AND PRUNING WEIGHTS FOR VISION AND LANGUAGE MODELS
Application No.: US18368353

Application Date: 2023-09-14
Publication No.: US20240119077A1

Publication Date: 2024-04-11
Inventor: Shangqian GAO , Burak UZKENT , Yilin SHEN , Hongxia JIN
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Applicant Address: KR Suwon-si
Assignee: SAMSUNG ELECTRONICS CO., LTD.
Current Assignee: SAMSUNG ELECTRONICS CO., LTD.
Current Assignee Address: KR Suwon-si
Main IPC: G06F16/33
IPC: G06F16/33 ; G06F16/583 ; G06N3/0985

APPARATUS AND METHOD FOR SHARING AND PRUNING WEIGHTS FOR VISION AND LANGUAGE MODELS

Abstract:

A method of performing a multimodal tasks by using a multimodal model that includes a text encoder and a vision encoder, may include obtaining a text feature from the query via the text encoder; obtaining an image feature from the one or more input images via the vision encoder; and outputting a response to the query based on similarity between the text feature and the image feature, wherein weights vectors of the text encoder and the vision encoder are pruned and shared according to a sharing vector and a pruning vector that are generated by a hypernetwork, and wherein the hypernetwork and the multimodal model are jointly trained to minimize at least one of a difference between the weight vectors in the text encoder and the vision encoder, a difference between the weight vectors in different layers of the text encoder, and a number of parameters in the multimodal model.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构
G06F16/30	.•非结构文本数据（文档管理系统入G06F 16/93）
G06F16/33	..••查询