Abstract:
Embodiments relate to a method for determining a search region including acquiring object information of a target object included in an image query, generating a set of non-image features of the target object based on the object information, setting a search candidate region based on a user input, acquiring information associated with the search candidate region from a region database, and determining a search region based on at least one of the information associated with the search candidate region or at least part of the set of non-image features, and a system for performing the same.
Abstract:
Disclosed is a method for facial age simulation based on an age of each facial part and environmental factors, which includes: measuring an age of each facial part on the basis of an input face image; designating a personal environmental factor; transforming an age of each facial part by applying an age transformation model according to the age of each facial part and the environmental factor; reconstructing the image transformed for each facial part; and composing the reconstructed images to generate an age-transformed face. Accordingly, it is possible to transform a face realistically based on an age measured for each facial part and an environmental factor.
Abstract:
Disclosed is a device and method for inferring a correlation between objects through image recognition. The device for inferring a correlation between objects through image recognition according to an embodiment comprises a communicator and an interaction inferencer configured to select a main object interacting with a target object at a predetermined distance in the input image or generate a social graph including the main object and the target object.
Abstract:
A kiosk for providing a recommendation service according to an embodiment displays an orderer's past ordered product as a recommended product on the screen of the kiosk, the past ordered product read based on a similarity calculation result between a current input attribute representing a contextual feature of a current order status and a past input attribute stored in memory.
Abstract:
Embodiments relate to a dynamic image capturing method and apparatus using an arbitrary viewpoint image generation technology, in which an image of background content displayed on a background content display unit or an image of background content implemented in a virtual space through a chroma key screen, having a view matching to a view of seeing a subject at a viewpoint of a camera is generated, and a final image including the image of the background content and a subject area is obtained.
Abstract:
Disclosed are an X-RAY image reading support method including the steps of acquiring a target X-RAY image photographed by transmitting or reflecting X-RAY in a reading space in which an object to be read is disposed; applying the target X-RAY image to a reading model that extracts features from an input image; and identifying the object to be read as an object corresponding to a classified class when the object to be read is classified as a set class based on a first feature set extracted from the target X-RAY image, and an X-RAY image reading support system performing the method.
Abstract:
Embodiments relate to a method and system for determining a situation of a facility by imaging a sensing data of the facility including receiving sensing data through a plurality of sensors at a query time, generating a situation image at the query time, showing the situation of the facility at the query time based on the sensing data, and determining if an abnormal situation occurred at the query time by applying the situation image to a pre-learned situation determination model.
Abstract:
A method of multi-view deblurring for 3-dimensional (3D) shape reconstruction includes: receiving images captured by multiple synchronized cameras at multiple viewpoints; performing iteratively estimation of depth map, latent image, and 3D motion at each viewpoint for the received images; determining whether image deblurring at each viewpoint is completed; and performing 3D reconstruction based on final depth maps and latent images at each viewpoint. Accordingly, it is possible to achieve accurate deblurring and 3D reconstruction even from any motion blurred images.
Abstract:
A video deblurring method based on a layered blur model includes estimating a latent image, an object motion and a mask for each layer in each frame using images consisting of a combination of layers during an exposure time of a camera when receiving a blurred video frame, applying the estimated latent image, object motion and mask for each layer in each frame to the layered blur model to generate a blurry frame, comparing the generated blurry frame and the received blurred video frame, and outputting a final latent image based on the estimated object motion and mask for each layer in each frame, when the generated blurry frame and the received blurred video frame match. Accordingly, by modeling a blurred image as an overlap of images consisting of a combination of foreground and background during exposure, more accurate deblurring results at object boundaries can be obtained.
Abstract:
Embodiments relate to a human behavior recognition system using hierarchical class learning considering safety, the human behavior recognition system including a behavior class definer configured to form a plurality of behavior classes by sub-setting a plurality of images each including a subject according to pre-designated behaviors and assign a behavior label to the plurality of images, a safety class definer configured to calculate a safety index for the plurality of images, form a plurality of safety classes by sub-setting the plurality of images based on the safety index, and additionally assign a safety label to the plurality of images, and a trainer configured to train a human recognition model by using the plurality of images defined as hierarchical classes by assigning the behavior label and the safety label as training images.