-
公开(公告)号:US11810353B2
公开(公告)日:2023-11-07
申请号:US17163841
申请日:2021-02-01
Applicant: Google LLC
Inventor: Filip Pavetic
CPC classification number: G06V20/46 , G06F21/16 , G06T3/0031 , G06V20/48 , G06F2221/0737
Abstract: Methods, systems, and media for analyzing spherical video content are provided. More particularly, methods, systems, and media for detecting two-dimensional videos placed on a sphere in abusive spherical video content are provided. In some embodiments, the method comprises: receiving an identifier of a spherical video content item, wherein the spherical video content item has a plurality of views; selecting a first frame of the spherical video content item; projecting the first frame of the spherical video content item to a two-dimensional region using a projection defined by a mapping according to which neighboring points of the first frame are mapped to respective neighboring points of the region, and one or more contiguous portions of the frame are each mapped to a corresponding plurality of contiguous portions of the region; identifying an area within the region which meets a criterion indicative of the region having a likelihood above a threshold of including a particular type of content; in response to identifying the area within the region which meets the criterion, analyzing the identified area of the region using a video fingerprinting technique; and, in response to determining that content associated with the identified area of the region matches a reference content item of a plurality of reference content items using the video fingerprinting technique, generating an indication of the match in association with the identifier of the spherical video content item.
-
公开(公告)号:US11734908B2
公开(公告)日:2023-08-22
申请号:US17228750
申请日:2021-04-13
Applicant: GOOGLE LLC
Inventor: Mayank Kandpal , Bakhodir Ashirmatov , Filip Pavetic
CPC classification number: G06V10/25 , G06N3/08 , G06V10/26 , G06V10/462
Abstract: Generating a training image for use in training a region-of-interest detector that is trained to detect regions-of-interest within images includes generating a closed geometric shape; filling the closed geometric shape with a filler to obtain a blob; overlaying the blob on an edge of an image to obtain the training image, where the image includes a region-of-interest and a background region, and where the edge separates the region-of-interest from the background region; and using the training image to train the region-of-interest detector to detect a boundary of the region-of-interest. An input to the region-of-interest detector in a training phase includes the training image and a first indication of coordinates of the region-of-interest in the training image. An output of the region-of-interest detector includes a second indication of an area of the training image and a probability of the area of the training image being the region-of-interest.
-
23.
公开(公告)号:US11093751B2
公开(公告)日:2021-08-17
申请号:US16813686
申请日:2020-03-09
Applicant: Google LLC
Inventor: Filip Pavetic , King Hong Thomas Leung , Dmitrii Tochilkin
Abstract: A system and methods are disclosed for using a trained machine learning model to identify constituent images within composite images. A method may include providing pixel data of a first image as input to the trained machine learning model, obtaining one or more outputs from the trained machine learning model, and extracting, from the one or more outputs, a level of confidence that (i) the first image is a composite image that includes a constituent image, and (ii) at least a portion of the constituent image is in a particular spatial area of the first image.
-
24.
公开(公告)号:US20200162770A1
公开(公告)日:2020-05-21
申请号:US16615366
申请日:2017-12-13
Applicant: Google LLC
Inventor: Filip Pavetic , Hanna Pasula
IPC: H04N21/234 , G06F21/10 , G06F16/783
Abstract: Methods, systems, and media for detecting and transforming rotated video content items are provided. The method comprises: receiving a video having a plurality of frames, wherein the video is associated with a first fingerprint; determining a rotation value associated with at least a portion of the plurality of frames to obtain a plurality of rotation values; determining an overall rotation value associated with the video based on a portion of the plurality of rotation values; determining whether at least one additional fingerprint of the video should be generated based on the overall rotation value; in response to determining that the at least one additional fingerprint of the video should be generated based on the overall rotation value, selecting a rotation transform based on the overall rotation value that rotates the plurality of frames of the video to an initial rotation position; applying the rotation transform to at least a portion of the plurality of frames of the video; generating a second fingerprint that represents the transformed video; and comparing the second fingerprint of the transformed video to a plurality of fingerprints associated with reference videos to determine whether the video corresponding to the transformed video matches one of the reference videos.
-
公开(公告)号:US10462506B2
公开(公告)日:2019-10-29
申请号:US15837583
申请日:2017-12-11
Applicant: Google LLC
Inventor: Valerii Zamaraiev , Filip Pavetic
IPC: H04N21/24 , H04N21/25 , H04N21/27 , H04N21/64 , H04N21/65 , H04N21/254 , H04N19/46 , H04N21/222 , H04N21/4627 , H04N13/161 , G06F21/10 , G06K9/00 , H04N21/835 , H04N13/194 , H04N21/231 , H04N21/234 , H04N21/2743 , H04N21/81 , G06F16/783
Abstract: Methods, systems, and media for identifying content in stereoscopic videos and, more particularly, for detecting abusive stereoscopic videos by generating fingerprints for multiple portions of a video frame are provided. The method comprises: receiving, from a user device, a video content item for uploading to a content provider; selecting a frame from a plurality of frames of the video content item for generating one or more fingerprints corresponding to the video content item; generating a first fingerprint corresponding to the selected frame, a second fingerprint corresponding to a first encoded portion of the selected frame, and a third fingerprint corresponding to a second encoded portion of the selected frame; comparing each of the first fingerprint, the second fingerprint, and the third fingerprint to a plurality of reference fingerprints corresponding to reference video content items; determining whether at least one of the first fingerprint, the second fingerprint, and the third fingerprint match a reference fingerprint of the plurality of reference fingerprints; and, in response to determining that at least one of the first fingerprint, the second fingerprint, and the third fingerprint match the reference fingerprint, causing an indication of the match to be presented on the user device.
-
-
-
-