摘要:
Simple, computational efficient, and robust audio features are applied in a uniform program indexing method for picking up video segments relating to highlight plays in a recorded program worthy of being reviewed. By focusing on certain frequencies in an audio sequence of the program, a computational complexity of the uniform program indexing method is significantly decreased. With the aid of MFCC coefficients and a DFBE coefficient generated from the MFCC coefficients, audio patterns may be utilized for differentiating exciting events in the program from other unnecessary information. Scores corresponding to various audio segments are regarded as standards for picking up video segments in the program worthy of being chosen in a recorded highlight collection. Some low-level-feature parameters, some video segments having highlight-related visual characteristics, and a re-ranking procedure are utilized for enhancing precision of the scores for providing video segments worthy of being reviewed.
摘要:
A camera motion parameter retrieving system and solving process begins with using the equation of that the vector field divergence volume integral is equal to the total outgoing throughput passing through the surface area of the volume through the vector in conjunction with camera motion detection to solve the Motion Vectors (MVs) in the domain, followed by operation to further solve camera motion parameter respectively contains that for PAN, TILT and ZOOM while the value of the parameter represents the motion value; upon obtaining those three parameters, the moving direction of the camera motion of the present image being indicated after simple addition and subtraction operation.
摘要:
A method of generating pixel data of a missing block in an image frame is disclosed. Edge points are detected from neighboring image side adjacent to the missing block. A direction is calculated for each edge point. Edge lines are formed from edge points based on the direction thereof to partition the missing block into a plurality of missing regions. Data for missing pixels in each missing region are then calculated using reference pixels from neighboring image sides adjacent to the missing region.
摘要:
A method of indexing final pitching shots for each batter in a video recording of a baseball game is disclosed. The method includes locating pitching video frames in the video, identifying individual pitching shots contained in the video, determining which of the pitching shots is a final pitching shot for each batter in the baseball game, and creating an index of the final pitching shots.
摘要:
A method of generating pixel data of a missing block in an image frame is disclosed. Edge points are detected from neighboring image side adjacent to the missing block. A direction is calculated for each edge point. Edge lines are formed from edge points based on the direction thereof to partition the missing block into a plurality of missing regions. Data for missing pixels in each missing region are then calculated using reference pixels from neighboring image sides adjacent to the missing region.
摘要:
An apparatus for detecting highlights of a media stream, the apparatus including: a video processing module, an audio processing module, a shot change detector, and a post processor. The video processing module determines a video threshold value; the audio processing module determines at least one audio threshold value; the shot change detector is electrically connected to the video processing module and the audio processing module for deciding a shot change to inform the video processing module and the audio processing module; and the post processor is electrically connected to the video processing module and the audio processing module for determining video highlights according to video parameters and the video threshold value, and audio highlights according to audio parameters and the audio threshold value, and then deciding the highlights of the media stream according to the video highlights and the audio highlights.
摘要:
A method and apparatus for encrypting and decrypting digital data employing multiple Huffman tables and at least one encryption key to enhance security of the digital data. At least one image parameter for characterizing the digital data, such as a motion vector table or DC-luminance, is selected as an image parameter. All possible Huffman tables according to the image parameter are then generated by Huffman tree mutation. A predetermined number of active Huffman tables from all possible Huffman tables are selected using a first encryption key and a hash function. Afterward, a coding sequence for the active Huffman tables is generated using a second encryption key and the hash function. Finally, the digital data is encrypted into an encrypted bit stream by the active Huffman tables with the coding sequence. Encrypted symbols of the image parameter can be reduced by symbol statistic analysis, thus reducing computation effort.
摘要:
A camera motion parameter retrieving system and solving process begins with using the equation of that the vector field divergence volume integral is equal to the total outgoing throughput passing through the surface area of the volume through the vector in conjunction with camera motion detection to solve the Motion Vectors (MVs) in the domain, followed by operation to further solve camera motion parameter respectively contains that for PAN, TILT and ZOOM while the value of the parameter represents the motion value; upon obtaining those three parameters, the moving direction of the camera motion of the present image being indicated after simple addition and subtraction operation.
摘要:
A method of identifying a target synchronization point pair including a source synchronization point corresponding to a source end and a destination synchronization point corresponding to a destination end is provided. The method includes receiving a source scan-line image corresponding to a first video source from the source end; selecting at least a comparing frame out of the source scan-line image; providing a destination scan-line image corresponding to a second video source at the destination end; and determining the target synchronization point pair by comparing the comparing frame and the destination scan-line image.
摘要:
A method to detect anchorperson segment in news reporting by using visual characteristics to provide the basis to divide news into various categories includes steps of providing news image for skin color detection on the image with color space; applying morphology depending on whether the object in the image subject to skin color detection is moving to eliminate noise surrounding the image of the face to solve the region of the face of the anchorperson; and performing anchorperson detection once again by detecting the probable anchorperson segment.