Abstract:
Systems, methods, and non-transitory computer-readable media can obtain a first video frame and a second video frame. The first video frame can be processed using a convolutional neural network to output a first set of feature maps. The second video frame can be processed using the convolutional neural network to output a second set of feature maps. The first set of feature maps and the second set of feature maps can be processed using a spatial matching layer of the convolutional neural network to determine an optical flow for at least one pixel.
Abstract:
Systems, methods, and non-transitory computer readable media configured to receive an advertisement including an image. A fraud assessment value for the advertisement can be determined. An image assessment value for the image can be determined. The fraud assessment value and a threshold value for fraud assessment can be compared. The image assessment value and a threshold value for image assessment can be compared. Fraud associated with the advertisement can be determined based on comparison of the fraud assessment value and the threshold value for fraud assessment and comparison of the image assessment value and the threshold value for image assessment.
Abstract:
Systems, methods, and non-transitory computer-readable media can receive a first image including a representation of a first user. A second image including a representation of a second user can be received. A first set of poselets associated with the first user can be detected in the first image. A second set of poselets associated with the second user can be detected in the second image. The first image including the first set of poselets can be inputted into a first instance of a neural network to generate a first multi-dimensional vector. The second image including the second set of poselets can be inputted into a second instance of the neural network to generate a second multi-dimensional vector. A first distance metric between the first multi-dimensional vector and the second multi-dimensional vector can be determined.
Abstract:
Systems, methods, and non-transitory computer-readable media can obtain a first video frame and a second video frame. The first video frame can be processed using a convolutional neural network to output a first set of feature maps. The second video frame can be processed using the convolutional neural network to output a second set of feature maps. The first set of feature maps and the second set of feature maps can be processed using a spatial matching layer of the convolutional neural network to determine an optical flow for at least one pixel.
Abstract:
Systems, methods, and non-transitory computer-readable media can receive a first image including a representation of a first user. A second image including a representation of a second user can be received. A first set of poselets associated with the first user can be detected in the first image. A second set of poselets associated with the second user can be detected in the second image. The first image including the first set of poselets can be inputted into a first instance of a neural network to generate a first multi-dimensional vector. The second image including the second set of poselets can be inputted into a second instance of the neural network to generate a second multi-dimensional vector. A first distance metric between the first multi-dimensional vector and the second multi-dimensional vector can be determined.
Abstract:
Systems, methods, and non-transitory computer-readable media can identify a set of regions corresponding to a geographical area. A collection of training images can be acquired. Each training image in the collection can be associated with one or more respective recognized objects and with a respective region in the set of regions. Histogram metrics for a plurality of object categories within each region in the set of regions can be determined based at least in part on the collection of training images. A neural network can be developed based at least in part on the histogram metrics for the plurality of object categories within each region in the set of regions and on the collection of training images.
Abstract:
Systems, methods, and non-transitory computer-readable media can acquire video content for which video feature descriptors are to be determined. The video content can be processed based at least in part on a convolutional neural network including a set of two-dimensional convolutional layers and a set of three-dimensional convolutional layers. One or more outputs can be generated from the convolutional neural network. A plurality of video feature descriptors for the video content can be determined based at least in part on the one or more outputs from the convolutional neural network.
Abstract:
Systems, methods, and non-transitory computer-readable media can receive a first content item having a set of frames. A binary hash code that represents the first content item is generated using at least an aggregation model and an iterative quantization hash model, the binary hash code being determined based at least in part on the set of frames of the first content item. The binary hash code is stored, wherein a similarity between the first content item and a second content item is capable of being measured based at least in part on a comparison of the binary hash code of the first content item and a binary hash code of the second content item.
Abstract:
Systems, methods, and non-transitory computer-readable media can detect an operation that causes a challenge response process to be initiated. An image category associated with a recognized category label can be identified. At least one image associated with the image category can be displayed during the challenge response process. The operation can be executed when the challenge response process, based on the at least one image, is successfully completed.
Abstract:
Systems, methods, and non-transitory computer-readable media can identify a set of regions corresponding to a geographical area. A collection of training images can be acquired. Each training image in the collection can be associated with one or more respective recognized objects and with a respective region in the set of regions. Histogram metrics for a plurality of object categories within each region in the set of regions can be determined based at least in part on the collection of training images. A neural network can be developed based at least in part on the histogram metrics for the plurality of object categories within each region in the set of regions and on the collection of training images.