-
公开(公告)号:US20250005354A1
公开(公告)日:2025-01-02
申请号:US18698691
申请日:2022-10-05
Applicant: Google LLC
Inventor: Neil Zeghidour , Rachid Riad , Olivier Teboul , David Grangier
IPC: G06N3/08
Abstract: A method of training a machine learning model, includes receiving training data for the machine learning model, wherein the training data comprises a plurality of batches. The method also includes applying a downsampling layer of the machine learning model to the plurality of batches of the training data to determine a stride comprising a learnable parameter for the downsampling layer. Applying the downsampling layer of the machine learning model to a batch of the training data includes projecting an input in a spatial domain to a Fourier domain, constructing a mask in the Fourier domain based on a current value of the stride and dimensions of the input, applying the mask as a low-pass filter to the projected input to produce a tensor in the Fourier domain, cropping the tensor based on the mask, and transforming the cropped tensor to the spatial domain.