Skimming data sequences using recurrent neural networks

    公开(公告)号:US11048875B2

    公开(公告)日:2021-06-29

    申请号:US16865747

    申请日:2020-05-04

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing sequential data. In one aspect, a computer-implemented method includes receiving a request to generate a system output for an input data sequence, the input data sequence including a plurality of tokens. One or more tokens may be designated as tokens to be skipped. When a token has not been designated as a token to be skipped, the token is processed using a recurrent neural network to update a current internal state of the recurrent neural network. The system output is generated from the final internal state of the recurrent neural network.

    Systems and Methods for Pretraining Image Processing Models

    公开(公告)号:US20230281400A1

    公开(公告)日:2023-09-07

    申请号:US17685774

    申请日:2022-03-03

    Applicant: Google LLC

    CPC classification number: G06F40/58 G06F40/284 G06V10/766 G06V30/10

    Abstract: Example embodiments of the present disclosure relate to systems and methods for pretraining image-processing models on weakly-supervised image-text pairs. The pretraining can include receiving a training sequence for the machine-learned image-processing model. The training sequence can include text tokens and image tokens. A prefix sequence can contain the image tokens. A remainder sequence can include a remainder set of the text tokens. The pretraining can include determining, using the prefix sequence as an input to the machine-learned image-processing model, an objective based on recovery of the remainder sequence. The pretraining can include updating one or more learnable parameters of the machine-learned image-processing model based on the objective.

    Joint Architecture And Hyper-Parameter Search For Machine Learning Models

    公开(公告)号:US20210383223A1

    公开(公告)日:2021-12-09

    申请号:US17337834

    申请日:2021-06-03

    Applicant: Google LLC

    Abstract: The present disclosure provides a differentiable joint hyper-parameter and architecture search approach, with some implementations including the idea of discretizing the continuous space into a linear combination of multiple categorical basis. One example element of the proposed approach is the use of weight sharing across all architecture- and hyper-parameters which enables it to search efficiently over the large joint search space. Experimental results on MobileNet/ResNet/EfficientNet/BERT show that the proposed systems significantly improve the accuracy by up to 2% on ImageNet and the F1 by up to 0.4 on SQuAD, with search cost comparable to training a single model. Compared to other AutoML methods, such as random search or Bayesian method, the proposed techniques can achieve better accuracy with 10× less compute cost.

Patent Agency Ranking