摘要:
A method and system for preprocessing text containing region of a video The invention provides a method and system for preprocessing the text containing region of video for improving the optical character recognition input.
摘要:
A computerized method for recognition of a logo is described herein. The method comprises obtaining a plurality of feed frame of a feed video wherein the feed video has a logo embedded therein. At least one feed frame from the plurality of feed frames is compared with each template from a plurality of templates. For each template compared with the feed frame, a correlation parameter is computed and the logo is recognized based on the computing.
摘要:
A method and system for preprocessing text containing region of a video The invention provides a method and system for preprocessing the text containing region of video for improving the optical character recognition input.
摘要:
A computerized method for recognition of a logo is described herein. The method comprises obtaining a plurality of feed frame of a feed video wherein the feed video has a logo embedded therein. At least one feed frame from the plurality of feed frames is compared with each template from a plurality of templates. For each template compared with the feed frame, a correlation parameter is computed and the logo is recognized based on the computing.
摘要:
A computer-based system and method to improve the multimodal fusion output at the decision level is disclosed. The method proposes computation of a confidence weighted measure for the individual score values obtained for each modality and fuse these new updated scores to get the final decision. These confidence weights are the performance parameters (measured in terms of F-measure) during the offline training step. The process significantly increases the accuracy of the multimodal system.
摘要:
The system and method of the present invention are described for automatic detection of error in the entry of particular category of individuals, especially referring to gender and age classification either real time while creating a database of such information or on an existing database on the record of individuals by analyzing their biometric characteristics like speech, image or face and other related demographic information like name of the individual in order to accord each individual with a unique identification.
摘要:
A system for adaptive learning based human detection for channel input of captured human image signals, the system comprising: a sensor for tracking real-time images of an environment of interest; a feature extraction and classifiers generation processor for extracting a plurality of features and classifying the features associated with time-space descriptors of image comprising background modeling, Histogram of Oriented Gradients (HOG) and Haar like wavelet; a processor configured to process extracted feature classifiers associated with plurality of real-time images; combine the plurality of feature classifiers of time-space descriptors; evaluate a linear probability of human detection based on a predetermined threshold value of the feature classifiers in a time window having at least one image frame; a counter for counting the number of humans in the real-time images; and a transmission device configured to send the final human detection decision and number thereof to a storage device.
摘要:
A method and system for broadcast of additional content such as metadata required for client specific interactive application in an analog domain along with conventional audio, video and PSI or SI data is disclosed. The present invention enables transmission of encoded audio data or EPG data, timestamp information required for audio video synchronization referred to as metadata by embedding such metadata in the pixels of video pixels and then encoding by the standard video encoder to generate an encoded stream. The encoded stream is decoded using the standard video decoder at the receiving station to generate a Composite Video Blanking and Sync (CVBS) analog video signal. From the CVBS signal, the RGB or YUV pixels of the videos are extracted. Finally a data extractor module retrieves the embedded metadata from the RGB or YUV pixels.
摘要:
The present invention provides a method and system for broadcast of additional content such as metadata required for client specific interactive application in an analog domain along with conventional audio, video and PSI or SI data. The present invention enables transmission of encoded audio data or EPG data, timestamp information required for audio video synchronization referred to as metadata by embedding such metadata in the pixels of video pixels and then encoding by the standard video encoder to generate an encoded stream. The encoded stream is decoded using the standard video decoder at the receiving station to generate a Composite Video Blanking and Sync (CVBS) analog video signal. From the CVBS signal, the RGB or YUV pixels of the videos are extracted. Finally a data extractor module retrieves the embedded metadata from the RGB or YUV pixels.
摘要:
A method and system for multiplexing of multiple channels of video data through a single analog broadcasting channel is disclosed. The method enables a spatial and temporal multiplexing of videos of each of the multiple channels. The multiplexed content is created as a result of multiplexing that is encoded to generate digital transport stream that is transmitted through analog medium. The system enables a STB receiver to decode each of the videos from the stream. At least one video from the multiple videos is played on the television based on user selection.