Abstract:
Disclosed are a method and an apparatus for generating metadata of immersive media and disclosed also are an apparatus and a method for transmitting metadata related information. The apparatus includes: at least one of a camera module photographing or capturing the image; a gyro module sensing horizontality; a global positioning sensor (GPS) module calculating a position by receiving a satellite signal; and an audio module recording audio; and a network module receiving sensor effect information from a sensor aggregator through a wireless communication network; and an application generating metadata by performing timer-synchronization of an image photographed based on the camera module, a sensor effect collected by using the gyro module or the GPS module, or audio collected based on the audio module.
Abstract:
The present disclosure provides a method and a device for training a neural network model for use in analyzing captured images, and an intelligent image capturing apparatus employing the same. The neural network model can be trained by performing the image reconstruction and the image classification using based on image data received from a plurality of image capturing devices installed in the monitoring area, calculating at least one loss function based on data processed by the neural network model or the neural network model training device, and determining parameters minimizing the loss function. In addition, the neural network model can be updated through the re-training taking into account the newly acquired image data. Accordingly, the image analysis neural network model can operate with high precision and accuracy.
Abstract:
Disclosed is a sensory information providing apparatus. The sensory information providing apparatus may comprise a learning model database storing a plurality of learning models related to sensory effect information with respect to a plurality of videos; and a video analysis engine generating the plurality of learning models by extracting sensory effect association information by analyzing the plurality of videos and sensory effect meta information of the plurality of videos, and extracting sensory information corresponding to an input video stream by analyzing the input video stream based on the plurality of learning model.
Abstract:
Disclosed is an apparatus for managing energy. The apparatus includes a user recognition unit configured to recognize each of users who use an energy consuming device; an energy usage receiving unit configured to receive energy usage that is consumed by the user; and a communication unit configured to communicate with an energy volume-rate server that manages the energy usage for each of the user. Further, the apparatus includes a controlling unit configured to control the driving of the energy consuming device depending on regulatory guidance the energy consuming device that is received via the communication unit. Furthermore, the regulatory guidance is determined by the energy volume-rate server.
Abstract:
Provided is an apparatus and method for controlling a terminal based on a living pattern, the apparatus including an interface configured to read n sets of physical terminal use information from a database, and select, from among the read n sets of physical terminal use information, m sets of physical terminal use information of which a difference in length between a set reference time and a start time or a termination time is less than or equal to a length of set interval time, each of n and m denoting a natural number, and a processor configured to generate a virtual terminal profile based on the m sets of physical terminal use information and control a virtual terminal included in the generated virtual terminal profile.
Abstract:
An image encoding method using a Binary Partition Tree (BPT) includes performing the BPT on a reference frame, detecting blocks, each having a difference in a pixel value exceeding a threshold value in a current frame, based on a result of the BPT of the reference frame, and performing the BPT of the current frame on the detected blocks. In accordance with the present invention, block partition is not applied to all frames, but a partial partition method based on a difference between the pixel values of a reference frame and a current frame to be encoded is provided. Accordingly, the encoding speed within the P frame or the B frame can be improved. Furthermore, the PSNR of a corresponding frame can be maintained within a specific range of the PSNR of a reference frame, and a compression effect can be improved.
Abstract:
The present invention relates to an image encoding method using an adaptive preprocessing scheme, including loading an input image for each frame, determining an encoding type of each of the frames, determining the size of a block to be encoded in each frame according to the determined encoding type, determining blocks that can be replicated from the blocks having the determined size and performing an intra-picture replication preprocessing or inter-picture replication preprocessing procedure on the determined blocks according to the encoding types of the frames, and encoding the frames on which the preprocessing procedure has been performed.