-
公开(公告)号:US20200372356A1
公开(公告)日:2020-11-26
申请号:US16883772
申请日:2020-05-26
Applicant: Google LLC
Inventor: William Chan , Mitchell Thomas Stern , Nikita Kitaev , Kelvin Gu , Jakob D. Uszkoreit
IPC: G06N3/08 , G06N3/04 , G06F40/237
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.
-
公开(公告)号:US10521701B2
公开(公告)日:2019-12-31
申请号:US16417190
申请日:2019-05-20
Applicant: Google LLC
Inventor: Noam M. Shazeer , Jakob D. Uszkoreit , Mitchell Thomas Stern
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing parallel generation of output from an autoregressive sequence to sequence model. In one aspect, a blockwise parallel decoding method takes advantage of the fact that some architectures can score sequences in sublinear time. By generating predictions for multiple time steps at once then backing off to a longest prefix validated by the scoring model, the methods can substantially improve the speed of greedy decoding without compromising performance.
-
公开(公告)号:US12106064B2
公开(公告)日:2024-10-01
申请号:US18082357
申请日:2022-12-15
Applicant: Google LLC
Inventor: Jakob D. Uszkoreit , Mitchell Thomas Stern , Jamie Ryan Kiros , William Chan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating network outputs using insertion operations.
-
公开(公告)号:US11681954B2
公开(公告)日:2023-06-20
申请号:US16682611
申请日:2019-11-13
Applicant: Google LLC
Inventor: Noam M. Shazeer , Jakob D. Uszkoreit , Mitchell Thomas Stern
CPC classification number: G06N20/20 , G06F18/2185 , G06F18/22 , G06N7/00 , G06N20/00 , G06V10/764 , G06V10/82
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing parallel generation of output from an autoregressive sequence to sequence model. In one aspect, a blockwise parallel decoding method takes advantage of the fact that some architectures can score sequences in sublinear time. By generating predictions for multiple time steps at once then backing off to a longest prefix validated by the scoring model, the methods can substantially improve the speed of greedy decoding without compromising performance.
-
公开(公告)号:US20240028893A1
公开(公告)日:2024-01-25
申请号:US18321696
申请日:2023-05-22
Applicant: Google LLC
Inventor: William Chan , Mitchell Thomas Stern , Nikita Kitaev , Kelvin Gu , Jakob D. Uszkoreit
IPC: G06N3/08 , G06F40/237 , G06N3/04 , G06N3/084
CPC classification number: G06N3/08 , G06F40/237 , G06N3/04 , G06N3/084
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.
-
公开(公告)号:US20210019477A1
公开(公告)日:2021-01-21
申请号:US16988551
申请日:2020-08-07
Applicant: Google LLC
Inventor: Jakob D. Uszkoreit , Mitchell Thomas Stern , Jamie Ryan Kiros , William Chan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating network outputs using insertion operations.
-
公开(公告)号:US10740571B1
公开(公告)日:2020-08-11
申请号:US16751167
申请日:2020-01-23
Applicant: Google LLC
Inventor: Jakob D. Uszkoreit , Mitchell Thomas Stern , Jamie Ryan Kiros , William Chan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating network outputs using insertion operations.
-
公开(公告)号:US20200082226A1
公开(公告)日:2020-03-12
申请号:US16682611
申请日:2019-11-13
Applicant: Google LLC
Inventor: Noam M. Shazeer , Jakob D. Uszkoreit , Mitchell Thomas Stern
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing parallel generation of output from an autoregressive sequence to sequence model. In one aspect, a blockwise parallel decoding method takes advantage of the fact that some architectures can score sequences in sublinear time. By generating predictions for multiple time steps at once then backing off to a longest prefix validated by the scoring model, the methods can substantially improve the speed of greedy decoding without compromising performance.
-
公开(公告)号:US12086715B2
公开(公告)日:2024-09-10
申请号:US18321696
申请日:2023-05-22
Applicant: Google LLC
Inventor: William Chan , Mitchell Thomas Stern , Nikita Kitaev , Kelvin Gu , Jakob D. Uszkoreit
IPC: G06F40/30 , G06F40/237 , G06N3/04 , G06N3/08 , G06N3/084
CPC classification number: G06N3/08 , G06F40/237 , G06N3/04 , G06N3/084
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.
-
公开(公告)号:US11657277B2
公开(公告)日:2023-05-23
申请号:US16883772
申请日:2020-05-26
Applicant: Google LLC
Inventor: William Chan , Mitchell Thomas Stern , Nikita Kitaev , Kelvin Gu , Jakob D. Uszkoreit
CPC classification number: G06N3/08 , G06F40/237 , G06N3/04 , G06N3/084
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.
-
-
-
-
-
-
-
-
-