摘要:
A method for pairing and operating a multi-mode pointing device is provided. A pointing device may be automatically paired with an image display device. A pairing request signal may be transmitted on a prescribed frequency channel, and a signal may be received indicating information of a plurality of frequency channels. One of the frequency channels may be selected as a pairing frequency channel for operating in a radio frequency mode.
摘要:
An apparatus, for use in a low bit-rate image signal encoder, for detecting a foreground region in an image signal, wherein the image signal has a number of objects therein and the foreground region has at least one moving object therein, which comprises: a previous frame memory for storing a previous frame of the image signal; a current frame memory for storing a current frame of the image signal; a change detector for comparing the current frame and the previous frame to detect changed pixels, each changed pixel having a differential luminance value; a region detector for detecting an object region for one of the objects in the current frame; and a foreground/background determinator for counting the number of all of the pixels contained in the object region to determine a first size of the object region; and for counting the number of changed pixels which are located in the object region to determine a second size of the object region; and for comparing the first size with the second size to determine whether the object region is the foreground region.
摘要:
An apparatus for determining a set of motion vectors for a set of feature points located on a contour of a moving object of a video signal comprises a contour displacement detection circuit for detecting a contour displacement range between a previous contour of the moving object contained in the previous frame and a current contour of the moving object contained in the current frame; and a feature point motion estimation circuit for comparing one of feature point blocks, having one of the feature points, with each of candidate blocks derived from a corresponding feature point search region adaptively regulated by the contour displacement range and for assigning a spatial displacements between one of the feature point block and a most similar candidate block thereto as a motion vector for a feature point corresponding to the one to thereby generating the set of motion vectors for the set of feature points.
摘要:
A motion-compensated video signal encoder has a circuit for determining a predicted current frame based on a current frame and a previous frame of a digital video signal. The circuit includes a region detection circuit for detecting a processing region encompassing a moving object from the previous frame based on a difference between the current and the previous frames to generate region information representing the detected processing region. Therefore, a number of pixels is selected from the pixels contained in the detected processing region as feature points based on the region information. A first set of motion vectors between the current and the previous frames, each of the first set of motion vectors representing a motion for each of the selected pixels is then detected. The first set of motion vectors is used for predicting the predicted current frame and transmitted as a set of motion vectors of the video signal together with the region information.
摘要:
A method for determining a set of motion vectors between a current frame and a reference frame of video signals, which comprises the steps of: (a) one dimensionally comparing a search block from the current frame with said plurality of candidate blocks included in a search region corresponding to the search block, on a block-by-block basis, by employing a one-dimensional error function using horizontal and vertical integral projections to select a predetermined number of candidate blocks in an ascending order of the one-dimensional error; (b) two dimensionally comparing the search block with the predetermined number of candidate blocks selected in step(a) above, on a block-by-block basis, by employing a two-dimensional error function to thereby select a most similar candidate block and derive a motion vector representing the displacement of pixels between the search block and the most similar candidate block so as to assign the derived motion vector as the motion vector for the search block; and (c) repeating steps (a) and (b) until all of the search blocks constituting the current frame are searched against the entire reference frame, to thereby derive a set of motion vectors.
摘要:
A method, for use in a segmentation-based coding system, for detecting motion vectors between a current frame and a previous frame of digital video signal comprises the steps of (a) generating segmentation data for each of the segmented regions, the segmentation data including contour information representing the shape and location for each segmented region and texture information representing a mean gray level of all pixels contained in each segmented region, (b) generating candidate segmentation data for each of the candidate regions, (c) producing a minimum-sized predetermined geometric pattern for encompassing a segmented region, (d) producing a minimum-sized predetermined geometric pattern for encompassing each of the candidate regions, (e) calculating an error value between the segmented region and each of the candidate regions, (f) selecting a candidate region yielding a smallest error value as a search region, (g) detecting one or more motion vectors between the segmented region and the search region; and (h) repeating said steps (c) to (g) for all the segmented regions.
摘要:
An inventive apparatus estimates correct motion vectors between a current and its previous frames for feature points selected from the previous frame. A motion vector detector first determines a set of initial motion vectors for the feature points. Subsequently, a displacement generator provides N number of displacements to an initial motion vector of a selected feature point to thereby produce a set of N candidate motion vectors for the selected feature point, wherein the selected feature point is surrounded by its, e.g., six neighboring feature points to form a previous hexagon having a set of six triangles. And then, a predictor provides quasi-feature points in the current frame from the neighboring feature points through the use of the initial motion vectors thereof; and also generates a set of N selected quasi-feature points from the selected feature point by using the N candidate motion vectors. And a weight factor generator selectively generates one from a multiplicity of weigh factors based on position data of each of the pixels in the current hexagon and each of the N subject quasi-feature points. Finally, a motion vector selector selects a motion vector from the N candidate motion vectors corresponding to a set of six triangles in the current hexagon which yields a minimum error value among N error values obtained through the use of the weight factors between the current and the previous hexagons.
摘要:
An apparatus, for use in an image signal encoding system, encodes a bidirectionally predictive frame (B frame) of a digital video signal to generate an encoded B frame signal. The apparatus selects two groups of pixels from pixels contained in a detected processing regions of a first and a second reference frames as a first and a second sets of feature points, respectively, by using a feature points selection circuit; and then determines a first set of motion vectors between the current and the first reference frames and a second set of motion vectors between the current and the second reference frames through the use of a feature point motion estimation circuit. Thereafter, the apparatus selects a set of motion vectors for the set of feature points from the first and the second sets of motion vectors to thereby generate a selected set of motion vectors and selection information representing the selected set as the encoded B frame.
摘要:
A method for adaptively selecting feature points according to motions of objects between the current frame and the previous frame, comprising the steps of: subtracting a pixel value of the previous frame from a pixel value of the current frame to thereby provide a differential pixel value; comparing the differential pixel value with a first threshold value TH1 and selecting processing regions having differential pixel values larger than the first threshold value TH1; detecting a motion vector for each pixel of the processing region in the previous frame; providing a pixel value of the current frame based on the motion vector for each pixel of the processing region in the previous frame; subtracting a pixel value of said each pixel of the processing region in the previous frame from the pixel value of the current frame to thereby provide an error pixel value; comparing the error pixel value with a second threshold value TH2 and selecting grid regions having error pixel values larger than the second threshold value TH2; generating a first grid for each of the processing regions in the previous frame and generating a second grid for said each of the grid regions; detecting edge points from the previous frame; and selecting, as the feature points, a multiplicity of pixels in the previous frame based on the first and the second grids and the edge points.
摘要:
In a three-dimensional model-based coding system, a shape of mouth, especially, a shape of lip, among a shape of face, has an important visual information. As each person has the different shape of lip, however, it is desirable to transform the shape of lip based on an individual person. First, a characteristic three-dimensional lip model based on the individual person is generated by matching a three-dimensional lip model to a basic lip of the individual person. Subsequently, the characteristic three-dimensional lip model is converted into a two-dimensional lip model and rotated according an angle of a actual lip image. In case that the person is talking, the two-dimensional lip model is reconstructed based on a recognized syllable. Finally, a difference between the two-dimensional lip model and the actual lip image is calculated. As not a whole real lip image but the difference between the actual image and the two-dimensional lip model similar to the actual lip image is transmitted, the amount of transmission data can be reduced.