摘要:
An information processing system, a computer readable storage medium, and a method for providing a recommendation for a plaything as a recommended item can include analyzing information received from a person monitoring system to provide an analysis for providing the recommendation for the plaything, and based on the analysis, sending a representation of the recommended item in a signal to a shopping cart such as an online shopping cart. The system can include an analysis module that receives information from a person monitoring system, and at least one processor configured to analyze information received from the person monitoring system (for one or more persons) to provide an analysis. The analysis provides a recommendation for a plaything. The processor can further send a representation of the recommended item in a signal to a shopping cart to upload into the shopping cart based on the analysis. Other embodiments are disclosed.
摘要:
An approach for re-identifying an object in a test image is presented. Similarity measures between the test image and training images captured by a first camera are determined. The similarity measures are based on Bhattacharyya distances between feature representations of an estimated background region of the test image and feature representations of background regions of the training images. A transformed test image based on the Bhattacharyya distances has a brightness that is different from the test image's brightness and matches a brightness of training images captured by a second camera. An appearance of the transformed test image resembles an appearance of a capture of the test image by the second camera. Another image included in test images captured by the second camera is identified as being closest in appearance to the transformed test image and another object in the identified other image is a re-identification of the object.
摘要:
Machine logic that pre-processes and post-processes images for visual object detection by performing the following steps: receiving a set of image(s); filtering the set of image(s) using a set of multimodal integral filter(s), thereby removing at least a portion of the set of image(s) and resulting in a filtered set of image(s); performing object detection on the filtered set of image(s) to generate a set of object-detected image(s); assembling a first plurality of object-detected image(s) from the set of object-detected image(s); and upon assembling the first plurality of object-detected image(s), performing non-maximum suppression on the assembled first plurality of object-detected image(s).
摘要:
An embodiment of the invention provides a method for finding missing persons by learning features for person attribute classification based on deep learning. A first component of a neural network identifies geographic locations of training images; and, a second component of the neural network identifies weather information for each of the identified geographic locations. A third component of the neural network generates image pairs from the training images. For each image pair of the image pairs, the third component of the neural network determines whether images of the image pair include the same person. The neural network generates neural network parameters with the identified geographic locations, the weather information for each of the identified geographic locations, and the determination of whether the images of the image pairs include the same person.
摘要:
An aspect of automated color adjustment of media files includes receiving profile data corresponding to a subject of image capture. The profile data indicates color values of an element associated with the subject. An aspect also includes storing the profile data in a memory device coupled to a computer processor, capturing an image of the subject, and processing the image and adjusting color aspects based on the color values associated with the element.
摘要:
An approach for re-identifying an object in a first test image is presented. Brightness transfer functions (BTFs) between respective pairs of training images are determined. Respective similarity measures are determined between the first test image and each of the training images captured by the first camera (first training images). A weighted brightness transfer function (WBTF) is determined by combining the BTFs weighted by weights of the first training images. The weights are based on the similarity measures. The first test image is transformed by the WBTF to better match one of the training images captured by the second camera. Another test image, captured by the second camera, is identified because it is closer in appearance to the transformed test image than other test images captured by the second camera. An object in the identified test image is a re-identification of the object in the first test image.
摘要:
Images are retrieved and ranked according to relevance to attributes of a multi-attribute query through training image attribute detectors for different attributes annotated in a training dataset. Pair-wise correlations are learned between pairs of the annotated attributes from the training dataset of images. Image datasets may are searched via the trained attribute detectors for images comprising attributes in a multi-attribute query. The retrieved images are ranked as a function of comprising attributes that are not within the query subset plurality of attributes but are paired to one of the query subset plurality of attributes by the pair-wise correlations, wherein the ranking is an order of likelihood that the different ones of the attributes will appear in an image with the paired one of the query subset plurality of attributes.
摘要:
Objects within two-dimensional video data are modeled by three-dimensional models as a function of object type and motion through manually calibrating a two-dimensional image to the three spatial dimensions of a three-dimensional modeling cube. Calibrated three-dimensional locations of an object in motion in the two-dimensional image field of view of a video data input are determined and used to determine a heading direction of the object as a function of the camera calibration and determined movement between the determined three-dimensional locations. The two-dimensional object image is replaced in the video data input with an object-type three-dimensional polygonal model having a projected bounding box that best matches a bounding box of an image blob, the model oriented in the determined heading direction. The bounding box of the replacing model is then scaled to fit the object image blob bounding box, and rendered with extracted image features.
摘要:
A camera at a fixed vertical height positioned above a reference plane, with an axis of a camera lens at an acute angle with respect to a perpendicular of the reference plane. One or more processors receive camera images of a multiplicity of people of unknown height and vertical axis of the images are transformed into pixel counts. The known heights of people from a known statistical distribution of heights of people are received by one or more processors and transformed to a normalized measurement of pixel counts, based in part on a focal length of the camera lens, the angle of the camera, and an objective function summing differences between pixel counts of the known heights of people and the unknown heights of people. The fixed vertical height of the camera is determined by adjusting the estimated camera height to minimize the objective function.
摘要:
Techniques, systems, and articles of manufacture for multispectral detection of attributes for video surveillance. A method includes generating one or more training sets of one or more multispectral images, generating a group of one or more multispectral box features, using the one or more training sets to select one or more of the one or more multispectral box features to generate a multispectral attribute detector, and using the multispectral attribute detector to identify a location of an attribute in video surveillance, wherein using the multispectral attribute detector comprises, for one or more locations on each spectral band level of the multispectral image, applying the multispectral attribute detector and producing an output indicating attribute detection or an output indicating no attribute detection, and wherein the attribute corresponds to the multispectral attribute detector.