摘要:
A method for encoding a video stream generated from at least one ultra-high resolution camera capturing sequential image frames from a fixed viewpoint of a scene includes decomposing the sequential image frames into quasi-static background and dynamic image features; distinguishing between different objects represented by the dynamic image features by recognizing characteristics and tracking movement of the objects in the sequential image frames. The dynamic image features are formatted into a sequence of miniaturized image frames that reduces at least one of: inter-frame movement of the objects; and high spatial frequency data. The sequence is compressed into a dynamic data layer and the quasi-static background into a quasi-static data layer. The dynamic data layer and the quasi-static data layer are encoded with setting metadata pertaining to the scene and the at least one ultra-high resolution camera, and corresponding consolidated formatting metadata pertaining to the decomposing and formatting procedures.