Training text summarization neural networks with an extracted segments prediction objective

Invention Grant

US11803751B2 Training text summarization neural networks with an extracted segments prediction objective 有权

Please log in to see more content

Patent Title: Training text summarization neural networks with an extracted segments prediction objective
Application No.: US17140863

Application Date: 2021-01-04
Publication No.: US11803751B2

Publication Date: 2023-10-31
Inventor: Mohammad Saleh , Jingqing Zhang , Yao Zhao , Peter J. Liu
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Fish & Richardson P.C.
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F40/30 ; G06N3/045

Training text summarization neural networks with an extracted segments prediction objective

Abstract:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a text summarization neural network. One of the methods includes pre-training the text summarization neural network including learning values of a plurality of network parameters through self-supervised learning using unlabeled data comprising unlabeled first texts, the pre-training including: obtaining an unlabeled first text comprising a plurality of segments; selecting one or more of the plurality of segments; processing a masked first text that excludes the one or more selected segments to generate a prediction of the one or more selected segments; and determining, based on a difference between the prediction and the one or more selected segments, an update to the current values of the plurality of network parameters; adapting the pre-trained text summarization neural network for a specific text summarization task using labeled data comprising second texts and respective summaries of the second texts.

Public/Granted literature

US20210350229A1 TRAINING TEXT SUMMARIZATION NEURAL NETWORKS WITH AN EXTRACTED SEGMENTS PREDICTION OBJECTIVE Public/Granted day:2021-11-11

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法