-
公开(公告)号:US11625572B2
公开(公告)日:2023-04-11
申请号:US16610466
申请日:2018-05-03
Applicant: GOOGLE LLC
Inventor: Chung-Cheng Chiu , Navdeep Jaitly , John Dieterich Lawson , George Jay Tucker
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a target sequence from a source sequence. In one aspect, the system includes a recurrent neural network configured to, at each time step, receive an input for the time step and process the input to generate a progress score and a set of output scores; and a subsystem configured to, at each time step, generate the recurrent neural network input and provide the input to the recurrent neural network; determine, from the progress score, whether or not to emit a new output at the time step; and, in response to determining to emit a new output, select an output using the output scores and emit the selected output as the output at a next position in the output order.
-
公开(公告)号:US20200151544A1
公开(公告)日:2020-05-14
申请号:US16610466
申请日:2018-05-03
Applicant: GOOGLE LLC
Inventor: Chung-Cheng Chiu , Navdeep Jaitly , John Dieterich Lawson , George Jay Tucker
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a target sequence from a source sequence. In one aspect, the system includes a recurrent neural network configured to, at each time step, receive an input for the time step and process the input to generate a progress score and a set of output scores; and a subsystem configured to, at each time step, generate the recurrent neural network input and provide the input to the recurrent neural network; determine, from the progress score, whether or not to emit a new output at the time step; and, in response to determining to emit a new output, select an output using the output scores and emit the selected output as the output at a next position in the output order.
-
公开(公告)号:US20250013881A1
公开(公告)日:2025-01-09
申请号:US18766415
申请日:2024-07-08
Applicant: Google LLC
Inventor: Yingjie Miao , John Dalton Co-Reyes , Esteban Alberto Real , George Jay Tucker , Aleksandra Faust
Abstract: Methods and systems for receiving training data for a machine learning (ML) task and searching, using the training data, for an optimized component of an ML algorithm for performing the ML task are described.
-
公开(公告)号:US20210201156A1
公开(公告)日:2021-07-01
申请号:US17056640
申请日:2019-05-20
Applicant: GOOGLE LLC
Inventor: Danijar Hafner , Jacob Buckman , Honglak Lee , Eugene Brevdo , George Jay Tucker
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sample-efficient reinforcement learning. One of the methods includes maintaining an ensemble of Q networks, an ensemble of transition models, and an ensemble of reward models; obtaining a transition; generating, using the ensemble of transition models, M trajectories; for each time step in each of the trajectories: generating, using the ensemble of reward models, N rewards for the time step, generating, using the ensemble of Q networks, L Q values for the time step, and determining, from the rewards, the Q values, and the training reward, L*N candidate target Q values for the trajectory and for the time step; for each of the time steps, combining the candidate target Q values; determining a final target Q value; and training at least one of the Q networks in the ensemble using the final target Q value.
-
-
-