-
公开(公告)号:US20240378427A1
公开(公告)日:2024-11-14
申请号:US18661499
申请日:2024-05-10
Applicant: Google LLC
Inventor: Slav Petrov , Yonghui Wu , Andrew M. Dai , David Richard So , Dmitry Lepikhin , Erica Ann Moreira , Gaurav Mishra , Jonathan Hudson Clark , Maxim Krikun , Melvin Jose Johnson Premkumar , Nan Du , Orhan Firat , Rohan Anil , Siamak Shakeri , Xavier Garcia , Yanping Huang , Yong Cheng , Yuanzhong Xu , Yujing Zhang , Zachary Alexander Nado , Eric Jun Jie Ni , Kefan Xiao , Vladimir Feinberg , Jin Young Sohn , Aurko Roy
IPC: G06N3/0475 , G06F40/284
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform any one or more of a variety of machine learning tasks. For example, the neural network can be configured as a generative neural network, e.g., an autoregressive generative neural network.
-
公开(公告)号:US20250077895A1
公开(公告)日:2025-03-06
申请号:US18826005
申请日:2024-09-05
Applicant: Google LLC
Inventor: Xinyi Wang , John Frederick Wieting , Jonathan Hudson Clark
IPC: G06N3/0985 , G06N3/045
Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for configuring a set of language model neural networks, e.g., a first large language model and a second smaller-sized language model, and performing a machine learning task on new inputs using the set of language model neural networks. Configuring the language model neural networks and performing a machine learning task can include leveraging the ability of a first large language model to follow prompt-engineered instructions and perform chain-of-thought reasoning, while also fine-tuning a second, smaller language model neural network to optimize the machine learning task performance.
-
公开(公告)号:US20240378441A1
公开(公告)日:2024-11-14
申请号:US18661447
申请日:2024-05-10
Applicant: Google LLC
Inventor: Slav Petrov , Yonghui Wu , Andrew M. Dai , David Richard So , Dmitry Lepikhin , Erica Ann Moreira , Gaurav Mishra , Jonathan Hudson Clark , Maxim Krikun , Melvin Jose Johnson Premkumar , Nan Du , Orhan Firat , Rohan Anil , Siamak Shakeri , Xavier Garcia , Yanping Huang , Yong Cheng , Yuanzhong Xu , Yujing Zhang , Zachary Alexander Nado , Eric Jun Jie Ni , Kefan Xiao , Vladimir Feinberg , Jin Young Sohn , Aurko Roy
IPC: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform any one or more of a variety of machine learning tasks. For example, the neural network can be configured as a generative neural network, e.g., an autoregressive generative neural network.
-
-