Unified vision and dialogue transformer with BERT

Invention Grant

US11562147B2 Unified vision and dialogue transformer with BERT 有权

Please log in to see more content

Patent Title: Unified vision and dialogue transformer with BERT
Application No.: US16929738

Application Date: 2020-07-15
Publication No.: US11562147B2

Publication Date: 2023-01-24
Inventor: Yue Wang , Chu Hong Hoi , Shafiq Rayhan Joty
Applicant: salesforce.com, inc.
Applicant Address: US CA San Francisco
Assignee: salesforce.com, inc.
Current Assignee: salesforce.com, inc.
Current Assignee Address: US CA San Francisco
Agency: Haynes and Boone LLP
Main IPC: G06F40/30
IPC: G06F40/30 ; G06F21/36 ; G06F40/35 ; G06N3/08 ; G06F40/284 ; G06K9/62

Unified vision and dialogue transformer with BERT

Abstract:

A visual dialogue model receives image input and text input that includes a dialogue history between the model and a current utterance by a human user. The model generates a unified contextualized representation using a transformer encoder network, in which the unified contextualized representation includes a token level encoding of the image input and text input. The model generates an encoded visual dialogue input from the unified contextualized representation using visual dialogue encoding layers. The encoded visual dialogue input includes a position level encoding and a segment type encoding. The model generates an answer prediction from the encoded visual dialogue input using a first self-attention mask associated with discriminative settings or a second self-attention mask associated with generative settings. Dense annotation fine tuning may be performed to increase accuracy of the answer prediction. The model provides the answer prediction as a response to the current utterance of the human user.

Public/Granted literature

US20210232773A1 Unified Vision and Dialogue Transformer with BERT Public/Granted day:2021-07-29

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/30	.语义分析