Training a model for performing abstractive text summarization

Invention Grant

US12164550B2 Training a model for performing abstractive text summarization 有权

Please log in to see more content

Patent Title: Training a model for performing abstractive text summarization
Application No.: US17651352

Application Date: 2022-02-16
Publication No.: US12164550B2

Publication Date: 2024-12-10
Inventor: Sajad Sotudeh Gharebagh , Hanieh Deilamsalehy , Franck Dernoncourt
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Agency: Weaver Austin Villeneuve & Sampson LLP
Main IPC: G06F40/30
IPC: G06F40/30 ; G06F16/33 ; G06F16/34 ; G06F18/21 ; G06F18/22 ; G06N3/04 ; G06N3/08

Abstract:

Techniques for training for and performing abstractive text summarization are disclosed. Such techniques include, in some embodiments, obtaining textual content, and generating a reconstruction of the textual content using a trained language model, the reconstructed textual content comprising an abstractive summary of the textual content generated based on relative importance parameters associated with respective portions of the textual content. In some cases, the trained language model includes a neural network language model that has been trained by identifying a plurality of discrete portions of training textual content, receiving the plurality of discrete portions of the training textual content as input to the language model, and predicting relative importance parameters associated with respective ones of the plurality of discrete portions of the training textual content, the relative importance parameters each being based at least on one or more linguistic similarity measures with respect to a ground truth.

Public/Granted literature

US20230259544A1 TRAINING A MODEL FOR PERFORMING ABSTRACTIVE TEXT SUMMARIZATION Public/Granted day:2023-08-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/30	.语义分析