-
公开(公告)号:US11726637B1
公开(公告)日:2023-08-15
申请号:US17978086
申请日:2022-10-31
Applicant: Google LLC
Inventor: Matthias Grundmann , Jokubas Zukerman , Marco Paglia , Kenneth Conley , Karthik Raveendran , Reed Morse
IPC: G06F3/0482 , G11B27/029 , G06F3/0485 , G06F3/04883 , H04N5/262 , G11B27/028
CPC classification number: G06F3/0482 , G06F3/0485 , G06F3/04883 , G11B27/028 , G11B27/029 , H04N5/2628
Abstract: The technology disclosed herein includes a user interface for viewing and combining media items into a video. An example method includes presenting a user interface that displays media items in a first portion of the user interface; receiving user input in the first portion that comprises a selection of a first media item; upon receiving the user input, adding the first media item to a set of selected media items and updating the user interface to comprise a control element and a second portion, wherein the first and second portions are concurrently displayed and are each scrollable along a different axis, and the second portion displays image content of the set and the control element enables a user to initiate the creation of the video based on the set of selected media items; and creating the video based on video content of the set of selected media items.
-
2.
公开(公告)号:US20210133508A1
公开(公告)日:2021-05-06
申请号:US16668303
申请日:2019-10-30
Applicant: Google LLC
Inventor: Valentin Bazarevsky , Yury Kartynnik , Andrei Vakunov , Karthik Raveendran , Matthias Grundmann
Abstract: A computing system is disclosed including a convolutional neural configured to receive an input that describes a facial image and generate a facial object recognition output that describes one or more facial feature locations with respect to the facial image. The convolutional neural network can include a plurality of convolutional blocks. At least one of the convolutional blocks can include one or more separable convolutional layers configured to apply a depthwise convolution and a pointwise convolution during processing of an input to generate an output. The depthwise convolution can be applied with a kernel size that is greater than 3×3. At least one of the convolutional blocks can include a residual shortcut connection from its input to its output.
-
公开(公告)号:US20250138704A1
公开(公告)日:2025-05-01
申请号:US19011541
申请日:2025-01-06
Applicant: Google LLC
Inventor: Matthias Grundmann , Jokubas Zukerman , Marco Paglia , Kenneth Conley , Karthik Raveendran , Reed Morse
IPC: G06F3/0482 , G06F3/0485 , G06F3/04883 , G11B27/028 , G11B27/029 , H04N5/262
Abstract: An example method includes presenting a user interface facilitating a creation of a video from an image associated with a first media item of a plurality of media items, wherein the first media item comprises the image and a video clip that are captured concurrently, receiving user input via the user interface, wherein the user input comprises a selection of a selectable control element presented in the user interface, and upon receiving the user input, presenting the video clip of the first media item in the user interface, wherein the video clip of the first media item is played in the user interface and comprises video content from before and after the image is captured.
-
公开(公告)号:US20230384911A1
公开(公告)日:2023-11-30
申请号:US18233823
申请日:2023-08-14
Applicant: Google LLC
Inventor: Matthias Grundmann , Jokubas Zukerman , Marco Paglia , Kenneth Conley , Karthik Raveendran , Reed Morse
IPC: G06F3/0482 , G11B27/029 , G06F3/0485 , G06F3/04883 , H04N5/262 , G11B27/028
CPC classification number: G06F3/0482 , G11B27/029 , G06F3/0485 , G06F3/04883 , H04N5/2628 , G11B27/028
Abstract: The technology disclosed herein includes a user interface for viewing and combining media items into a video. An example method includes presenting a user interface that displays media items in a first portion of the user interface; receiving user input in the first portion that comprises a selection of a first media item; upon receiving the user input, adding the first media item to a set of selected media items in a second portion of the user interface, and presenting a selectable control element in the second portion of the user interface, wherein the control element enables a user to initiate an operation pertaining to the creation of the video based on the set of selected media items, and creating the video based on video content of the set of selected media items.
-
公开(公告)号:US11487407B1
公开(公告)日:2022-11-01
申请号:US17536350
申请日:2021-11-29
Applicant: Google LLC
Inventor: Matthias Grundmann , Jokubas Zukerman , Marco Paglia , Kenneth Conley , Karthik Raveendran , Reed Morse
IPC: G06F3/0482 , G06F3/04883 , H04N5/262 , G06F3/0485 , G11B27/028 , G11B27/029
Abstract: The technology disclosed herein includes a user interface for viewing and combining media items into a video. An example method includes presenting a user interface that displays media items in a first portion of the user interface; receiving user input in the first portion that comprises a selection of a first media item; upon receiving the user input, adding the first media item to a set of selected media items and updating the user interface to comprise a control element and a second portion, wherein the first and second portions are concurrently displayed and are each scrollable along a different axis, and the second portion displays image content of the set and the control element enables a user to initiate the creation of the video based on the set of selected media items; and creating the video based on video content of the set of selected media items.
-
6.
公开(公告)号:US11449714B2
公开(公告)日:2022-09-20
申请号:US16668303
申请日:2019-10-30
Applicant: Google LLC
Inventor: Valentin Bazarevsky , Yury Kartynnik , Andrei Vakunov , Karthik Raveendran , Matthias Grundmann
Abstract: A computing system is disclosed including a convolutional neural configured to receive an input that describes a facial image and generate a facial object recognition output that describes one or more facial feature locations with respect to the facial image. The convolutional neural network can include a plurality of convolutional blocks. At least one of the convolutional blocks can include one or more separable convolutional layers configured to apply a depthwise convolution and a pointwise convolution during processing of an input to generate an output. The depthwise convolution can be applied with a kernel size that is greater than 3×3. At least one of the convolutional blocks can include a residual shortcut connection from its input to its output.
-
公开(公告)号:US10191920B1
公开(公告)日:2019-01-29
申请号:US14833887
申请日:2015-08-24
Applicant: Google LLC
Inventor: Matthias Grundmann , Karthik Raveendran , Daniel Castro Chin
IPC: G06F17/30 , G06F3/0484 , G06F3/0482 , G06K9/00 , G06T11/60
Abstract: A computing device is described that includes a camera configured to capture an image of a user of the computing device, a memory configured to store the image of the user, at least one processor, and at least one module. The at least one module is operable by the at least one processor to obtain, from the memory, an indication of the image of the user of the computing device, determine, based on the image, a first emotion classification tag, and identify, based on the first emotion classification tag, at least one graphical image from a database of pre-classified images that has an emotional classification that is associated with the first emotion classification tag. The at least one module is further operable by the at least one processor to output, for display, the at least one graphical image.
-
公开(公告)号:US12189921B2
公开(公告)日:2025-01-07
申请号:US18233823
申请日:2023-08-14
Applicant: Google LLC
Inventor: Matthias Grundmann , Jokubas Zukerman , Marco Paglia , Kenneth Conley , Karthik Raveendran , Reed Morse
IPC: G06F3/0482 , G06F3/0485 , G06F3/04883 , G11B27/028 , G11B27/029 , H04N5/262
Abstract: The technology disclosed herein includes a user interface for viewing and combining media items into a video. An example method includes presenting a user interface that displays media items in a first portion of the user interface; receiving user input in the first portion that comprises a selection of a first media item; upon receiving the user input, adding the first media item to a set of selected media items in a second portion of the user interface, and presenting a selectable control element in the second portion of the user interface, wherein the control element enables a user to initiate an operation pertaining to the creation of the video based on the set of selected media items, and creating the video based on video content of the set of selected media items.
-
9.
公开(公告)号:US11694087B2
公开(公告)日:2023-07-04
申请号:US17947816
申请日:2022-09-19
Applicant: Google LLC
Inventor: Valentin Bazarevsky , Yury Kartynnik , Andrei Vakunov , Karthik Raveendran , Matthias Grundmann
IPC: G06N20/10 , G06N3/084 , G06N3/04 , G06N3/08 , G06V40/16 , G06F18/21 , G06V10/764 , G06V10/82 , G06V10/44
CPC classification number: G06N3/084 , G06F18/217 , G06N3/04 , G06N3/08 , G06V10/454 , G06V10/764 , G06V10/82 , G06V40/165 , G06V40/171
Abstract: A computing system is disclosed including a convolutional neural configured to receive an input that describes a facial image and generate a facial object recognition output that describes one or more facial feature locations with respect to the facial image. The convolutional neural network can include a plurality of convolutional blocks. At least one of the convolutional blocks can include one or more separable convolutional layers configured to apply a depthwise convolution and a pointwise convolution during processing of an input to generate an output. The depthwise convolution can be applied with a kernel size that is greater than 3×3. At least one of the convolutional blocks can include a residual shortcut connection from its input to its output.
-
10.
公开(公告)号:US20230017459A1
公开(公告)日:2023-01-19
申请号:US17947816
申请日:2022-09-19
Applicant: Google LLC
Inventor: Valentin Bazarevsky , Yury Kartynnik , Andrei Vakunov , Karthik Raveendran , Matthias Grundmann
Abstract: A computing system is disclosed including a convolutional neural configured to receive an input that describes a facial image and generate a facial object recognition output that describes one or more facial feature locations with respect to the facial image. The convolutional neural network can include a plurality of convolutional blocks. At least one of the convolutional blocks can include one or more separable convolutional layers configured to apply a depthwise convolution and a pointwise convolution during processing of an input to generate an output. The depthwise convolution can be applied with a kernel size that is greater than 3×3. At least one of the convolutional blocks can include a residual shortcut connection from its input to its output.
-
-
-
-
-
-
-
-
-