Method and System for Association and Decision Fusion of Multimodal Inputs
    5.
    发明申请
    Method and System for Association and Decision Fusion of Multimodal Inputs 有权
    多模态输入的关联和决策融合方法与系统

    公开(公告)号:US20120290526A1

    公开(公告)日:2012-11-15

    申请号:US13219345

    申请日:2011-08-26

    IPC分类号: G06N5/02

    摘要: A computer-based system and method to improve the multimodal fusion output at the decision level is disclosed. The method proposes computation of a confidence weighted measure for the individual score values obtained for each modality and fuse these new updated scores to get the final decision. These confidence weights are the performance parameters (measured in terms of F-measure) during the offline training step. The process significantly increases the accuracy of the multimodal system.

    摘要翻译: 公开了一种基于计算机的系统和方法,用于在决策级别改进多模态融合输出。 该方法提出了对于每个模态获得的各个得分值的置信度加权度量的计算,并且将这些新的更新得分融合以获得最终决定。 这些置信度权重是在离线训练步骤期间的性能参数(以F度测量)。 该过程显着提高了多模态系统的准确性。

    System and method for human detection and counting using background modeling, HOG and Haar features
    7.
    发明授权
    System and method for human detection and counting using background modeling, HOG and Haar features 有权
    使用背景建模,HOG和Haar功能进行人体检测和计数的系统和方法

    公开(公告)号:US09001199B2

    公开(公告)日:2015-04-07

    申请号:US13160743

    申请日:2011-06-15

    IPC分类号: H04N9/47 G06K9/00

    CPC分类号: G06K9/00369

    摘要: A system for adaptive learning based human detection for channel input of captured human image signals, the system comprising: a sensor for tracking real-time images of an environment of interest; a feature extraction and classifiers generation processor for extracting a plurality of features and classifying the features associated with time-space descriptors of image comprising background modeling, Histogram of Oriented Gradients (HOG) and Haar like wavelet; a processor configured to process extracted feature classifiers associated with plurality of real-time images; combine the plurality of feature classifiers of time-space descriptors; evaluate a linear probability of human detection based on a predetermined threshold value of the feature classifiers in a time window having at least one image frame; a counter for counting the number of humans in the real-time images; and a transmission device configured to send the final human detection decision and number thereof to a storage device.

    摘要翻译: 一种用于基于捕获的人类图像信号的信道输入的用于自适应学习的人类检测的系统,所述系统包括:用于跟踪感兴趣的环境的实时图像的传感器; 特征提取和分类器生成处理器,用于提取多个特征并对与图像的时间 - 空间描述符相关联的特征进行分类,该图像包括背景建模,定向梯度(HOG)直方图和哈尔像小波; 处理器,被配置为处理与多个实时图像相关联的提取的特征分类器; 组合时空描述符的多个要素分类器; 基于具有至少一个图像帧的时间窗中的特征分类器的预定阈值来评估人类检测的线性概率; 用于计数实时图像中的人数的计数器; 以及发送装置,被配置为将最终的人类检测决定及其数量发送到存储装置。

    METHOD AND SYSTEM FOR EMBEDDING METADATA IN MULTIPLEXED ANALOG VIDEOS BROADCASTED THROUGH DIGITAL BROADCASTING MEDIUM
    8.
    发明申请
    METHOD AND SYSTEM FOR EMBEDDING METADATA IN MULTIPLEXED ANALOG VIDEOS BROADCASTED THROUGH DIGITAL BROADCASTING MEDIUM 审中-公开
    用于通过数字广播介质广播的多路复用模拟视频中嵌入元数据的方法和系统

    公开(公告)号:US20140208379A1

    公开(公告)日:2014-07-24

    申请号:US14238728

    申请日:2012-08-23

    IPC分类号: H04N21/236

    摘要: A method and system for broadcast of additional content such as metadata required for client specific interactive application in an analog domain along with conventional audio, video and PSI or SI data is disclosed. The present invention enables transmission of encoded audio data or EPG data, timestamp information required for audio video synchronization referred to as metadata by embedding such metadata in the pixels of video pixels and then encoding by the standard video encoder to generate an encoded stream. The encoded stream is decoded using the standard video decoder at the receiving station to generate a Composite Video Blanking and Sync (CVBS) analog video signal. From the CVBS signal, the RGB or YUV pixels of the videos are extracted. Finally a data extractor module retrieves the embedded metadata from the RGB or YUV pixels.

    摘要翻译: 公开了一种用于广播附加内容的方法和系统,例如模拟域中的客户特定交互应用所需的元数据以及常规音频,视频和PSI或SI数据。 本发明能够通过将视频像素的像素嵌入这些元数据,然后通过标准视频编码器进行编码以产生编码流,从而传送编码的音频数据或EPG数据,将音频视频同步所需的时间戳信息称为元数据。 在接收站使用标准视频解码器解码编码的流,以产生复合视频消隐和同步(CVBS)模拟视频信号。 从CVBS信号中,提取视频的RGB或YUV像素。 最后,数据提取器模块从RGB或YUV像素检索嵌入的元数据。

    Method and system for embedding metadata in multiplexed analog videos broadcasted through digital broadcasting medium

    公开(公告)号:US10097869B2

    公开(公告)日:2018-10-09

    申请号:US14238728

    申请日:2012-08-23

    IPC分类号: H04N21/236 H04N7/025 H04N7/08

    摘要: The present invention provides a method and system for broadcast of additional content such as metadata required for client specific interactive application in an analog domain along with conventional audio, video and PSI or SI data. The present invention enables transmission of encoded audio data or EPG data, timestamp information required for audio video synchronization referred to as metadata by embedding such metadata in the pixels of video pixels and then encoding by the standard video encoder to generate an encoded stream. The encoded stream is decoded using the standard video decoder at the receiving station to generate a Composite Video Blanking and Sync (CVBS) analog video signal. From the CVBS signal, the RGB or YUV pixels of the videos are extracted. Finally a data extractor module retrieves the embedded metadata from the RGB or YUV pixels.