Efficient decoding of output sequences using adaptive early exiting
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output sequences using auto-regressive decoder neural networks. In particular, during generation, adaptive early exiting is used to reduce the time required to generate the output sequence.
Information query
Patent Agency Ranking
0/0