-
公开(公告)号:US20210125038A1
公开(公告)日:2021-04-29
申请号:US17092837
申请日:2020-11-09
Applicant: Google LLC
Inventor: Samuel Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US10991074B2
公开(公告)日:2021-04-27
申请号:US16442365
申请日:2019-06-14
Applicant: Google LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , David Martin Dohan , Dumitru Erhan , Dilip Krishnan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the systems includes a domain transformation neural network implemented by one or more computers, wherein the domain transformation neural network is configured to: receive an input image from a source domain; and process a network input comprising the input image from the source domain to generate a transformed image that is a transformation of the input image from the source domain to a target domain that is different from the source domain.
-
公开(公告)号:US10970589B2
公开(公告)日:2021-04-06
申请号:US16321189
申请日:2016-07-28
Applicant: GOOGLE LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , Dilip Krishnan , George Trigeorgis , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the system includes a shared encoder neural network implemented by one or more computers, wherein the shared encoder neural network is configured to: receive an input image from a target domain; and process the input image to generate a shared feature representation of features of the input image that are shared between images from the target domain and images from a source domain different from the target domain; and a classifier neural network implemented by the one or more computers, wherein the classifier neural network is configured to: receive the shared feature representation; and process the shared feature representation to generate a network output for the input image that characterizes the input image.
-
公开(公告)号:US20190304065A1
公开(公告)日:2019-10-03
申请号:US16442365
申请日:2019-06-14
Applicant: Google LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , David Martin Dohan , Dumitru Erhan , Dilip Krishnan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the systems includes a domain transformation neural network implemented by one or more computers, wherein the domain transformation neural network is configured to: receive an input image from a source domain; and process a network input comprising the input image from the source domain to generate a transformed image that is a transformation of the input image from the source domain to a target domain that is different from the source domain.
-
公开(公告)号:US20240296313A1
公开(公告)日:2024-09-05
申请号:US18662584
申请日:2024-05-13
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US20230239499A1
公开(公告)日:2023-07-27
申请号:US18011922
申请日:2022-05-27
Applicant: Google LLC
Inventor: Mohammad Babaeizadeh , Chelsea Breanna Finn , Dumitru Erhan , Mohammad Taghi Saffar , Sergey Vladimir Levine , Suraj Nair
IPC: H04N19/59 , H04N19/117 , H04N19/176 , H04N19/42 , G06V10/70
CPC classification number: H04N19/59 , H04N19/117 , H04N19/176 , H04N19/42 , G06V10/70
Abstract: One aspect provides a machine-learned video prediction model configured to receive and process one or more previous video frames to generate one or more predicted subsequent video frames, wherein the machine-learned video prediction model comprises a convolutional variational auto encoder, and wherein the convolutional variational auto encoder comprises an encoder portion comprising one or more encoding cells and a decoder portion comprising one or more decoding cells.
-
公开(公告)号:US11361531B2
公开(公告)日:2022-06-14
申请号:US17222782
申请日:2021-04-05
Applicant: Google LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , Dilip Krishnan , George Trigeorgis , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the system includes a shared encoder neural network implemented by one or more computers, wherein the shared encoder neural network is configured to: receive an input image from a target domain; and process the input image to generate a shared feature representation of features of the input image that are shared between images from the target domain and images from a source domain different from the target domain; and a classifier neural network implemented by the one or more computers, wherein the classifier neural network is configured to: receive the shared feature representation; and process the shared feature representation to generate a network output for the input image that characterizes the input image.
-
公开(公告)号:US12014259B2
公开(公告)日:2024-06-18
申请号:US17092837
申请日:2020-11-09
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US20200042866A1
公开(公告)日:2020-02-06
申请号:US16538712
申请日:2019-08-12
Applicant: Google LLC
Inventor: Samuel Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
公开(公告)号:US10417557B2
公开(公告)日:2019-09-17
申请号:US15856453
申请日:2017-12-28
Applicant: Google LLC
Inventor: Samy Bengio , Oriol Vinyals , Alexander Toshkov Toshev , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating descriptions of input images. One of the methods includes obtaining an input image; processing the input image using a first neural network to generate an alternative representation for the input image; and processing the alternative representation for the input image using a second neural network to generate a sequence of a plurality of words in a target natural language that describes the input image.
-
-
-
-
-
-
-
-
-