摘要:
An image processing apparatus and method generate a three dimensional representation of a scene which includes a plurality of objects disposed on a plane. The three dimensional representation is generated from one or more video images of the scene, which include the objects on the plane produced from a view of the scene by a video camera. The method comprises processing the captured video images so as to extract one or more image features from each object, comparing the one or more image features with sample image features from a predetermined set of possible example objects which the video images may contain, and identifying the objects from the comparison of the image features with the stored image features of the possible example objects. The method also includes generating object path data, which includes object identification data for each object which identifies the respective object; and provides a position of the object on the plane in the video images with respect to time. The method further includes calculating a projection matrix for projecting the position of each of the objects according to the object path data from the plane into a three dimensional model of the plane. As such a three dimensional representation of the scene which includes a synthesised representation of each of the plurality of objects on the plane can be produced, by projecting the position of the objects according to the object path data into the plane of the three dimensional model of the scene using the projection matrix and a predetermined assumption of the height of each of the objects. Accordingly, a three dimensional representation of a live video image of, for example, a football match can be generated, or tracking information included on the live video images. As such, a change in a relative view of the generated three dimensional representation can be made, so that a view can be provided in the three dimensional representation of the scene from a view point at which no camera is actually present to capture video images of the live scene.
摘要:
Method and system for determining whether a spherical element impacts with a component of a playing field, or arranged on or proximate thereto The method comprises: a) acquiring images of a surveillance area of a field that covers at least part of said component, such as a delimiting perimeter line (L) of a game area (Z1-Z10) or a target (A1-A8); b) performing an approximate detection of an impact of a spherical element relative to that component or proximate thereto, with an object detection and recognition system that can discern when the detected object is indeed a spherical element; c) automatically selecting one of the images acquired for the same point in time and that includes the area where said impact has occurred; and d) analysing the selected image to check if the spherical element has impacted or not with the component. The system is adapted to implement the method of the invention.
摘要:
A method of processing image data comprises the steps of: • providing an image sequence such as a video sequence, or a camera transition, • identifying a region-of-interest (1) in at least one image of the image sequence, • defining a transition region around the region-of-interest (1) and defining a remaining portion of the image to be a default region or background region, • applying different image effects to the region-of-interest (1), the transition region and the background region.
摘要:
The invention concerns a device and an automatic resetting method using electronic means intended for a geometric model of a scene over a picture of the scene including fixed references, the electronic means comparing the picture with model having been adjusted in perspective by homography for superimposition of the references. According to the invention, the electronic device calculates a fine homography function Hf for resetting into three phases: a first preliminary phase of determination of an average resetting homography consisting in determining an average homography function Hm applicable to the model with average adjustment over a sample of pictures of the scene taken previously, a second, rough resetting phase consisting after application of the average homography function Hm, to the model in determining a rough homography function Hg, and a third, fine resetting phase consisting after application of the rough homography function Hg to the model in determining the fine homography function Hf.
摘要:
An information processing apparatus, comprising: receiver circuitry configured to receive i) a video stream comprising a plurality of images, each image containing at least one object located at a position in a plane and the image having associated image timing information indicating its temporal position in the video stream and ii) a stream of object tracking information each having associated object timing information indicating its temporal position in the stream of object tracking information wherein the stream of object tracking information corresponds to the video stream and separately defines the position of the object in the plane captured in each image in the video stream; and controller circuitry configured to perform pattern matching on the position of each of the objects in the video stream and the object tracking information, and when there is a closest matching position, synchronising the image timing information and the object timing information.
摘要:
A method for encoding a video stream generated from at least one ultra-high resolution camera capturing sequential image frames from a fixed viewpoint of a scene includes decomposing the sequential image frames into quasi-static background and dynamic image features; distinguishing between different objects represented by the dynamic image features by recognizing characteristics and tracking movement of the objects in the sequential image frames. The dynamic image features are formatted into a sequence of miniaturized image frames that reduces at least one of: inter-frame movement of the objects; and high spatial frequency data. The sequence is compressed into a dynamic data layer and the quasi-static background into a quasi-static data layer. The dynamic data layer and the quasi-static data layer are encoded with setting metadata pertaining to the scene and the at least one ultra-high resolution camera, and corresponding consolidated formatting metadata pertaining to the decomposing and formatting procedures.
摘要:
The invention relates to a method for recording and playing back a sequence of events, in particular a sporting event, on a three-dimensional playing field, such as e.g. a soccer pitch, in which image data of the playing field is continuously generated by at least two cameras (2) and said image data is transmitted to a processing unit (3) having a ROI-sampling unit. The ROI-sampling unit selects algorithm-based relevant image regions of the particular camera (2), from which local feature data is then generated and subsequently relayed to a higher-level central evaluating unit (4). In the central evaluating unit (4) the local feature data for determining an object part is combined, in order by repetition of this step for local feature data in adjoining image regions of the cameras to construct a virtual 3D object (5) in a virtual 3D space (6). As a result, a real event that has been recorded by the cameras (2) is reconstructed in the virtual 3D space (6) using the constructed 3D objects (5) and finally displayed in the form of an image sequence.
摘要:
In one aspect, lines in image data of an event are automatically found and repaired. For example, the event may be a sporting event which is played on a field, and the line segment is a field line on the field which may be obscured by a player, game ball or other object. The line segment is automatically detected in a mask image, and a portion of the line segment which is occluded by the object is automatically determined, and the object is automatically removed. The line segment can also be repaired. Optionally, a virtual viewpoint of the event is provided from the image, with the line repaired and the object removed. In another aspect, an object in an image of an event is automatically located by detecting blobs in the image which meet at least one specified criterion, such as size, aspect ratio, density or color profile.