-
公开(公告)号:US10936877B2
公开(公告)日:2021-03-02
申请号:US16715498
申请日:2019-12-16
Applicant: Google LLC
Inventor: Filip Pavetic , Matthias Konrad , Roman Vorushin
IPC: G06K9/00 , G06F16/783
Abstract: Methods, systems, and media for analyzing spherical video content are provided. More particularly, methods, systems, and media for detecting two-dimensional videos placed on a sphere in abusive spherical video content by tiling the sphere are provided. In some embodiments, the method comprises: receiving an identifier of a spherical video content item, wherein the spherical video content item has a plurality of views and wherein the spherical video content item is encoded into a plurality of two-dimensional video frames; selecting a first frame of the plurality of two-dimensional video frames associated with the spherical video content item; dividing the first frame into a plurality of tiles spanning the first frame of the spherical video content item; calculating, for each tile of the plurality of tiles, a probability that the tile includes content of a particular type of content; determining, for each tile, whether the probability exceeds a predetermined threshold; in response to determining, for a particular tile, that the probability exceeds the predetermined threshold, causing the content associated with the tile to be analyzed using a video fingerprinting technique; and in response to determining, using the video fingerprinting technique, that the content associated with the tile matches a reference content item of a plurality of reference content items, generating an indication of the match in association with the identifier of the spherical video content item.
-
12.
公开(公告)号:US20190354767A1
公开(公告)日:2019-11-21
申请号:US16418399
申请日:2019-05-21
Applicant: Google LLC
Inventor: Filip Pavetic
Abstract: Methods, systems, and media for analyzing spherical video content are provided. More particularly, methods, systems, and media for detecting two-dimensional videos placed on a sphere in abusive spherical video content are provided. In some embodiments, the method comprises: receiving an identifier of a spherical video content item, wherein the spherical video content item has a plurality of views; selecting a first frame of the spherical video content item; projecting the first frame of the spherical video content item to a two-dimensional region using a projection defined by a mapping according to which neighboring points of the first frame are mapped to respective neighboring points of the region, and one or more contiguous portions of the frame are each mapped to a corresponding plurality of contiguous portions of the region; identifying an area within the region which meets a criterion indicative of the region having a likelihood above a threshold of including a particular type of content; in response to identifying the area within the region which meets the criterion, analyzing the identified area of the region using a video fingerprinting technique; and, in response to determining that content associated with the identified area of the region matches a reference content item of a plurality of reference content items using the video fingerprinting technique, generating an indication of the match in association with the identifier of the spherical video content item.
-
公开(公告)号:US20240169715A1
公开(公告)日:2024-05-23
申请号:US18518075
申请日:2023-11-22
Applicant: GOOGLE LLC
Inventor: Lucas Klaus Beyer , Pavel Izmailov , Simon Kornblith , Alexander Kolesnikov , Mathilde Caron , Xiaohua Zhai , Matthias Johannes Lorenz Minderer , Ibrahim Alabdulmohsin , Michael Tobias Tschannen , Filip Pavetic
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network that is configured to process an input image to generate a network output for the input image. In one aspect, a method comprises, at each of a plurality of training steps: obtaining a plurality of training images for the training step; obtaining, for each of the plurality of training images, a respective target output; and selecting, from a plurality of image patch generation schemes, an image patch generation scheme for the training step, wherein, given an input image, each of the plurality of image patch generation schemes generates a different number of patches of the input image, and wherein each patch comprises a respective subset of the pixels of the input image.
-
14.
公开(公告)号:US20210374418A1
公开(公告)日:2021-12-02
申请号:US17403804
申请日:2021-08-16
Applicant: Google LLC
Inventor: Filip Pavetic , King Hong Thomas Leung , Dmitrii Tochilkin
Abstract: A system and methods are disclosed for using a trained machine learning model to identify constituent images within composite images. A method may include providing pixel data of a first image as input to the trained machine learning model, obtaining one or more outputs from the trained machine learning model, and extracting, from the one or more outputs, an indication that the first image is a composite image that includes a constituent image, wherein at least a portion of the constituent image is in a spatial area of the first image.
-
公开(公告)号:US10509965B2
公开(公告)日:2019-12-17
申请号:US15380535
申请日:2016-12-15
Applicant: Google LLC
Inventor: Filip Pavetic , Matthias Konrad , Roman Vorushin
IPC: G06K9/00 , G06F16/783
Abstract: Methods, systems, and media for analyzing spherical video content are provided. More particularly, methods, systems, and media for detecting two-dimensional videos placed on a sphere in abusive spherical video content by tiling the sphere are provided. In some embodiments, the method comprises: receiving an identifier of a spherical video content item, wherein the spherical video content item has a plurality of views and wherein the spherical video content item is encoded into a plurality of two-dimensional video frames; selecting a first frame of the plurality of two-dimensional video frames associated with the spherical video content item; dividing the first frame into a plurality of tiles spanning the first frame of the spherical video content item; calculating, for each tile of the plurality of tiles, a probability that the tile includes content of a particular type of content; determining, for each tile, whether the probability exceeds a predetermined threshold; in response to determining, for a particular tile, that the probability exceeds the predetermined threshold, causing the content associated with the tile to be analyzed using a video fingerprinting technique; and in response to determining, using the video fingerprinting technique, that the content associated with the tile matches a reference content item of a plurality of reference content items, generating an indication of the match in association with the identifier of the spherical video content item.
-
公开(公告)号:US10452891B2
公开(公告)日:2019-10-22
申请号:US15966713
申请日:2018-04-30
Applicant: Google LLC
Inventor: Pedro Gonnet Anders , Filip Pavetic
IPC: G06K9/00
Abstract: A method of identifying similar media items is described. The method include identifying a first multiplicity of fingerprints representative of content segments of variable duration for a first media item and a second multiplicity of fingerprints representative of content segments of variable duration for a second media item. The method further includes comparing, by a processing device, a first group of the first multiplicity of fingerprints to a second group of the second multiplicity of fingerprints to generate a first similarity score indicative of a similarity between the first group of fingerprints and the second group of fingerprints. The method also includes determining an alignment score for the first multiplicity of fingerprints and the second multiplicity of fingerprints using the first similarity score.
-
公开(公告)号:US20180247104A1
公开(公告)日:2018-08-30
申请号:US15966713
申请日:2018-04-30
Applicant: Google LLC
Inventor: Pedro Gonnet Anders , Filip Pavetic
IPC: G06K9/00
Abstract: A method of identifying similar media items is described. The method include identifying a first multiplicity of fingerprints representative of content segments of variable duration for a first media item and a second multiplicity of fingerprints representative of content segments of variable duration for a second media item. The method further includes comparing, by a processing device, a first group of the first multiplicity of fingerprints to a second group of the second multiplicity of fingerprints to generate a first similarity score indicative of a similarity between the first group of fingerprints and the second group of fingerprints. The method also includes determining an alignment score for the first multiplicity of fingerprints and the second multiplicity of fingerprints using the first similarity score.
-
18.
公开(公告)号:US20180232845A1
公开(公告)日:2018-08-16
申请号:US15949424
申请日:2018-04-10
Applicant: GOOGLE LLC
Inventor: Filip Pavetic , Matthias Rochus Konrad , Hanna Pasula
CPC classification number: G06T1/0021 , G06F21/10 , G06K9/00744 , G06K9/4604 , G06K9/6202 , G06T7/11 , G06T7/20 , G06T7/90 , G06T2207/10016 , G06T2207/10024 , H04N21/23418 , H04N21/2541 , H04N21/2743 , H04N21/8352
Abstract: Fingerprinting a video including video frames is disclosed. A method includes receiving the video, generating sub-images, generating sub-fingerprints for the video using the sub-images, and matching the video to a reference video using the sub-fingerprints. Generating sub-images includes, for a video frame of some of the video frames, generating a binary image for the video frame, identifying a first region of the binary image, and identifying a sub-image of the video frame that is co-extensive and co-located with the first region of the binary image. A pixel of the video frame is identified in the binary image by a first value or a second value where the first value indicates a motion pixel and the second value indicates a still pixel. The first region includes more of the first value than the second value, and the first region is indicative of a motion in the video frame.
-
19.
公开(公告)号:US20180176619A1
公开(公告)日:2018-06-21
申请号:US15837583
申请日:2017-12-11
Applicant: Google LLC
Inventor: Valerii Zamaraiev , Filip Pavetic
IPC: H04N21/254 , H04N21/222 , H04N21/4627 , H04N13/00 , H04N19/46
Abstract: Methods, systems, and media for identifying content in stereoscopic videos and, more particularly, for detecting abusive stereoscopic videos by generating fingerprints for multiple portions of a video frame are provided. The method comprises: receiving, from a user device, a video content item for uploading to a content provider; selecting a frame from a plurality of frames of the video content item for generating one or more fingerprints corresponding to the video content item; generating a first fingerprint corresponding to the selected frame, a second fingerprint corresponding to a first encoded portion of the selected frame, and a third fingerprint corresponding to a second encoded portion of the selected frame; comparing each of the first fingerprint, the second fingerprint, and the third fingerprint to a plurality of reference fingerprints corresponding to reference video content items; determining whether at least one of the first fingerprint, the second fingerprint, and the third fingerprint match a reference fingerprint of the plurality of reference fingerprints; and, in response to determining that at least one of the first fingerprint, the second fingerprint, and the third fingerprint match the reference fingerprint, causing an indication of the match to be presented on the user device.
-
公开(公告)号:US20240054102A1
公开(公告)日:2024-02-15
申请号:US17886860
申请日:2022-08-12
Applicant: Google LLC
Inventor: Filip Pavetic , David Simcha , Alexander-Teodor Voicu , Felix Chern , Philip Wenjie Sun , Ruiqi Guo , Hanna Maria Pasula , Martin Ulrich Seiler
CPC classification number: G06F16/13 , G06F3/0649 , G06F3/0611 , G06F3/0685
Abstract: Provided is a scalable and cost-efficient storage architecture for large-scale datasets, such as Internet-scale datasets that include very large numbers (e.g., billions) of data elements. More particularly, provided is a bifurcated storage architecture that includes a first data index stored by a first set of storage media and a second data index stored by a second set of storage media, where the first set of storage media has a lower latency than the second set of storage media.
-
-
-
-
-
-
-
-
-