-
公开(公告)号:US12190403B2
公开(公告)日:2025-01-07
申请号:US17792062
申请日:2020-01-13
Applicant: GOOGLE LLC
Inventor: Ruohan Zhan , Feng Yang , Xiyang Luo , Peyman Milanfar , Huiwen Chang , Ce Liu
Abstract: Methods, systems, and computer programs encoded on a computer storage medium, that relate to extracting digital watermarks from images, irrespective of distortions introduced into these images. Methods can include inputting a first data item into a channel encoder that can generate a first encoded data item that is greater in length than the first data item and that (1) includes the input data item and (2) new data this is redundant of the input data item. Based on the first encoded data item and a first image, an encoder model can generate a first encoded image into which the first encoded data is embedded as a digital watermark. A decoder model can decode the first encoded data item to generate a second data, which can be decoded by the channel decoder to generate data that is predicted to be the first data.
-
公开(公告)号:US20240249422A1
公开(公告)日:2024-07-25
申请号:US17907529
申请日:2021-08-05
Applicant: Google LLC
Inventor: Varun Jampani , Huiwen Chang , Kyle Sargent , Abhishek Kar , Richard Tucker , Dominik Kaeser , Brian L. Curless , David Salesin , William T. Freeman , Michael Krainin , Ce Liu
CPC classification number: G06T7/50 , G06T5/60 , G06T5/77 , G06T2207/20081
Abstract: A method includes determining, based on an image having an initial viewpoint, a depth image, and determining a foreground visibility map including visibility values that are inversely proportional to a depth gradient of the depth image. The method also includes determining, based on the depth image, a background disocclusion mask indicating a likelihood that pixel of the image will be disoccluded by a viewpoint adjustment. The method additionally includes generating, based on the image, the depth image, and the background disocclusion mask, an inpainted image and an inpainted depth image. The method further includes generating, based on the depth image and the inpainted depth image, respectively, a first three-dimensional (3D) representation of the image and a second 3D representation of the inpainted image, and generating a modified image having an adjusted viewpoint by combining the first and second 3D representation based on the foreground visibility map.
-
公开(公告)号:US20240020788A1
公开(公告)日:2024-01-18
申请号:US18256783
申请日:2021-03-24
Applicant: Google LLC
Inventor: Xiyang Luo , Feng Yang , Ce Liu , Huiwen Chang , Peyman Milanfar , Yinxiao Li
IPC: G06T1/00
CPC classification number: G06T1/0085 , G06T2201/0083
Abstract: Systems and methods of the present disclosure are directed to a computing system. The computing system can obtain a message vector and video data comprising a plurality of video frames. The computing system can process the input video with a transformation portion of a machine-learned watermark encoding model to obtain a three-dimensional feature encoding of the input video. The computing system can process the three-dimensional feature encoding of the input video and the message vector with an embedding portion of the machine-learned watermark encoding model to obtain spatial-temporal watermark encoding data descriptive of the message vector. The computing system can generate encoded video data comprising a plurality of encoded video frames, wherein at least one of the plurality of encoded video frames includes the spatial-temporal watermark encoding data.
-
公开(公告)号:US20230325961A1
公开(公告)日:2023-10-12
申请号:US18008544
申请日:2021-06-21
Applicant: Google LLC
Inventor: Dake He , Tianhao Zhang , Elnaz Barshan Tashnizi , Xiyang Luo , Huiwen Chang , Feng Yang , Ryan Matthew Haggarty
CPC classification number: G06T1/005 , G06T7/11 , G06T3/40 , G06T2201/0083 , G06T2207/20081 , G06T2201/0065
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a visually imperceptible or a visually perceptible watermark and outputting a result based on the determination. A watermark decoder receives an input image. The watermark decoder applies a decoder machine learning model to decode a watermarks at different levels of zoom. The water mark decoder determines whether a watermark was decoded to obtain a decoded watermark. The watermark decoder outputs a result based on the determination whether the watermark was decoded through application of the decoder machine learning model to the input image that includes outputting a zoomed output decoded through application of the decoder machine learning model to the input image.
-
公开(公告)号:US20240355017A1
公开(公告)日:2024-10-24
申请号:US18302508
申请日:2023-04-18
Applicant: Google LLC
Inventor: Shiran Elyahu Zada , Bahjat Kawar , Oran Lang , Omer Tov , Huiwen Chang , Tali Dekel , Inbar Mosseri
CPC classification number: G06T11/60 , G06T3/4053
Abstract: Methods and systems for editing an image are disclosed herein. The method includes receiving an input image and a target text, the target text indicating a desired edit for the input image and obtaining, by the computing system, a target text embedding based on the target text. The method also includes obtaining, by the computing system, an optimized text embedding based on the target text embedding and the input image and fine-tuning, by the computing system, a diffusion model based on the optimized text embedding. The method can further include interpolating, by the computing system, the target text embedding and the optimized text embedding to obtain an interpolated embedding and generating, by the computing system, an edited image including the desired edit using the diffusion model based on the input image and the interpolated embedding.
-
公开(公告)号:US20240311960A1
公开(公告)日:2024-09-19
申请号:US18028063
申请日:2022-05-20
Applicant: GOOGLE LLC
Inventor: Xiao Feng , Yuanzhen LI , Yihui Wang , Omer Gimenez , Han Xu , Mengjie Wang , Huiwen Chang , AJ Maschinot , Dilip Krishnan
IPC: G06T3/4046
CPC classification number: G06T3/4046
Abstract: To adjust an aspect ratio of an image to match the aspect ratio of a display area for presenting the image, a computing device receives an image having a first aspect ratio, and obtains a second aspect ratio for a display area of a display in which to present the image, where the second aspect ratio is different from the first aspect ratio. The computing device extends the image to include one or more additional features which were not included in the image. Additionally, the computing device automatically crops the extended image around an identified region of interest by selecting a portion of the extended image that has an aspect ratio which matches the second aspect ratio of the display area, and provides the cropped image for presentation within the display area of the display.
-
公开(公告)号:US20240013497A1
公开(公告)日:2024-01-11
申请号:US18252118
申请日:2020-12-21
Applicant: Google LLC
Inventor: Deqing Sun , Varun Jampani , Gengshan Yang , Daniel Vlasic , Huiwen Chang , Forrester H. Cole , Ce Liu , William Tafel Freeman
CPC classification number: G06T19/20 , G06T7/55 , G06T17/20 , G06T7/20 , G06T7/40 , G06T2207/30244 , G06T2207/10016 , G06T2207/20084 , G06T2219/2021 , G06T2207/20081
Abstract: A computing system and method can be used to render a 3D shape from one or more images. In particular, the present disclosure provides a general pipeline for learning articulated shape reconstruction from images (LASR). The pipeline can reconstruct rigid or nonrigid 3D shapes. In particular, the pipeline can automatically decompose non-rigidly deforming shapes into rigid motions near rigid-bones. This pipeline incorporates an analysis-by-synthesis strategy and forward-renders silhouette, optical flow, and color images which can be compared against the video observations to adjust the internal parameters of the model. By inverting a rendering pipeline and incorporating optical flow, the pipeline can recover a mesh of a 3D model from the one or more images input by a user.
-
公开(公告)号:US20230325959A1
公开(公告)日:2023-10-12
申请号:US17926213
申请日:2021-06-21
Applicant: Google LLC
Inventor: Dake He , Tianhao Zhang , Elnaz Barshan Tashnizi , Xiyang Luo , Huiwen Chang , Feng Yang , Ryan Matthew Haggarty
IPC: G06T1/00 , G06T3/40 , G06T5/20 , G06V10/764
CPC classification number: G06T1/0021 , G06T3/40 , G06T5/20 , G06V10/764 , G06T2201/0065 , G06T2207/20081
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting and decoding a visually imperceptible or perceptible watermark. A watermark detection apparatus determines whether the particular image includes a visually imperceptible or perceptible watermark using detector a machine learning model. If the watermark detection apparatus detects a watermark, the particular image is routed to a watermark decoder. If the watermark detection apparatus cannot detect a watermark in the particular image, the particular image is filtered from further processing. The watermark decoder decodes the visually imperceptible or perceptible watermark detected in the particular image. After decoding, an item depicted in the particular image is validated based data extracted from the decoded visually imperceptible or perceptible watermark.
-
公开(公告)号:US20230111326A1
公开(公告)日:2023-04-13
申请号:US17792062
申请日:2020-01-13
Applicant: GOOGLE LLC
Inventor: Ruohan Zhan , Feng Yang , Xiyang Luo , Peyman Milanfar , Huiwen Chang , Ce Liu
Abstract: Methods, systems, and computer programs encoded on a computer storage medium, that relate to extracting digital watermarks from images, irrespective of distortions introduced into these images. Methods can include inputting a first data item into a channel encoder that can generate a first encoded data item that is greater in length than the first data item and that (1) includes the input data item and (2) new data this is redundant of the input data item. Based on the first encoded data item and a first image, an encoder model can generate a first encoded image into which the first encoded data is embedded as a digital watermark. A decoder model can decode the first encoded data item to generate a second data, which can be decoded by the channel decoder to generate data that is predicted to be the first data.
-
-
-
-
-
-
-
-