Invention Application
- Patent Title: POINTER SENTINEL MIXTURE ARCHITECTURE
-
Application No.: US15421016Application Date: 2017-01-31
-
Publication No.: US20180082171A1Publication Date: 2018-03-22
- Inventor: Stephen Joseph MERITY , Caiming XIONG , James BRADBURY , Richard SOCHER
- Applicant: salesforce.com, inc.
- Applicant Address: US CA San Francisco
- Assignee: salesforce.com, inc.
- Current Assignee: salesforce.com, inc.
- Current Assignee Address: US CA San Francisco
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06N3/08 ; G06N7/00 ; G06F17/27

Abstract:
The technology disclosed provides a so-called “pointer sentinel mixture architecture” for neural network sequence models that has the ability to either reproduce a token from a recent context or produce a token from a predefined vocabulary. In one implementation, a pointer sentinel-LSTM architecture achieves state of the art language modeling performance of 70.9 perplexity on the Penn Treebank dataset, while using far fewer parameters than a standard softmax LSTM.
Public/Granted literature
- US10565493B2 Pointer sentinel mixture architecture Public/Granted day:2020-02-18
Information query