专利检索 ap:("Rohit Kumar Gupta" OR "Sandeep Gattani" OR "Aniruddha Sinha" OR "Ayan Chaki" OR "Arpan Pal") AND inv:"Aniruddha Sinha" 第 1 页

1.

发明申请
Method and System for Association and Decision Fusion of Multimodal Inputs 有权
标题翻译：多模态输入的关联和决策融合方法与系统

公开(公告)号：US20120290526A1

公开(公告)日：2012-11-15

申请号：US13219345

申请日：2011-08-26

申请人： Rohit Kumar Gupta , Sandeep Gattani , Aniruddha Sinha , Ayan Chaki , Arpan Pal

发明人： Rohit Kumar Gupta , Sandeep Gattani , Aniruddha Sinha , Ayan Chaki , Arpan Pal

IPC分类号： G06N5/02

CPC分类号： G06N7/00 , G06K9/6293 , G06N99/005

摘要： A computer-based system and method to improve the multimodal fusion output at the decision level is disclosed. The method proposes computation of a confidence weighted measure for the individual score values obtained for each modality and fuse these new updated scores to get the final decision. These confidence weights are the performance parameters (measured in terms of F-measure) during the offline training step. The process significantly increases the accuracy of the multimodal system.

摘要翻译： 公开了一种基于计算机的系统和方法，用于在决策级别改进多模态融合输出。该方法提出了对于每个模态获得的各个得分值的置信度加权度量的计算，并且将这些新的更新得分融合以获得最终决定。这些置信度权重是在离线训练步骤期间的性能参数（以F度测量）。该过程显着提高了多模态系统的准确性。

2.

发明授权
Method and system for association and decision fusion of multimodal inputs 有权
标题翻译：多模态输入的关联和决策融合的方法和系统

公开(公告)号：US08700557B2

公开(公告)日：2014-04-15

申请号：US13219345

申请日：2011-08-26

申请人： Rohit Kumar Gupta , Sandeep Gattani , Aniruddha Sinha , Ayan Chaki , Arpan Pal

发明人： Rohit Kumar Gupta , Sandeep Gattani , Aniruddha Sinha , Ayan Chaki , Arpan Pal

IPC分类号： G06N5/02

CPC分类号： G06N7/00 , G06K9/6293 , G06N99/005

摘要： A computer-based system and method to improve the multimodal fusion output at the decision level is disclosed. The method proposes computation of a confidence weighted measure for the individual score values obtained for each modality and fuse these new updated scores to get the final decision. These confidence weights are the performance parameters (measured in terms of F-measure) during the offline training step. The process significantly increases the accuracy of the multimodal system.

摘要翻译： 公开了一种基于计算机的系统和方法，用于在决策级别改进多模态融合输出。该方法提出了对于每个模态获得的各个得分值的置信度加权度量的计算，并且将这些新的更新得分融合以获得最终决定。这些置信度权重是在离线训练步骤期间的性能参数（以F度测量）。该过程显着提高了多模态系统的准确性。

3.

发明授权
System and method for human detection and counting using background modeling, HOG and Haar features 有权
标题翻译：使用背景建模，HOG和Haar功能进行人体检测和计数的系统和方法

公开(公告)号：US09001199B2

公开(公告)日：2015-04-07

申请号：US13160743

申请日：2011-06-15

申请人： Aniruddha Sinha , Rohit Gupta , Ayan Chaki , Arpan Pal

发明人： Aniruddha Sinha , Rohit Gupta , Ayan Chaki , Arpan Pal

IPC分类号： H04N9/47 , G06K9/00

CPC分类号： G06K9/00369

摘要： A system for adaptive learning based human detection for channel input of captured human image signals, the system comprising: a sensor for tracking real-time images of an environment of interest; a feature extraction and classifiers generation processor for extracting a plurality of features and classifying the features associated with time-space descriptors of image comprising background modeling, Histogram of Oriented Gradients (HOG) and Haar like wavelet; a processor configured to process extracted feature classifiers associated with plurality of real-time images; combine the plurality of feature classifiers of time-space descriptors; evaluate a linear probability of human detection based on a predetermined threshold value of the feature classifiers in a time window having at least one image frame; a counter for counting the number of humans in the real-time images; and a transmission device configured to send the final human detection decision and number thereof to a storage device.

摘要翻译： 一种用于基于捕获的人类图像信号的信道输入的用于自适应学习的人类检测的系统，所述系统包括：用于跟踪感兴趣的环境的实时图像的传感器; 特征提取和分类器生成处理器，用于提取多个特征并对与图像的时间 - 空间描述符相关联的特征进行分类，该图像包括背景建模，定向梯度（HOG）直方图和哈尔像小波; 处理器，被配置为处理与多个实时图像相关联的提取的特征分类器; 组合时空描述符的多个要素分类器; 基于具有至少一个图像帧的时间窗中的特征分类器的预定阈值来评估人类检测的线性概率; 用于计数实时图像中的人数的计数器; 以及发送装置，被配置为将最终的人类检测决定及其数量发送到存储装置。

4.

发明申请
SYSTEM AND METHOD FOR HUMAN DETECTION AND COUNTING USING BACKGROUND MODELING, HOG AND HAAR FEATURES 有权
标题翻译：用于人类检测和计数的系统和方法，使用背景建模，HOG和HAAR特征

公开(公告)号：US20120274755A1

公开(公告)日：2012-11-01

申请号：US13160743

申请日：2011-06-15

申请人： Aniruddha Sinha , Rohit Gupta , Ayan Chaki , Arpan Pal

发明人： Aniruddha Sinha , Rohit Gupta , Ayan Chaki , Arpan Pal

IPC分类号： G06K9/00 , H04N7/18

CPC分类号： G06K9/00369

摘要： A system for adaptive learning based human detection for channel input of captured human image signals, the system comprising: a sensor for tracking real-time images of an environment of interest; a feature extraction and classifiers generation processor for extracting a plurality of features and classifying the features associated with time-space descriptors of image comprising background modeling, Histogram of Oriented Gradients (HOG) and Haar like wavelet; a processor configured to process extracted feature classifiers associated with plurality of real-time images; combine the plurality of feature classifiers of time-space descriptors; evaluate a linear probability of human detection based on a predetermined threshold value of the feature classifiers in a time window having at least one image frame; a counter for counting the number of humans in the real-time images; and a transmission device configured to send the final human detection decision and number thereof to a storage device.

摘要翻译： 一种用于基于捕获的人类图像信号的信道输入的用于自适应学习的人类检测的系统，所述系统包括：用于跟踪感兴趣的环境的实时图像的传感器; 特征提取和分类器生成处理器，用于提取多个特征并对与图像的时间 - 空间描述符相关联的特征进行分类，该图像包括背景建模，定向梯度（HOG）直方图和哈尔像小波; 处理器，被配置为处理与多个实时图像相关联的提取的特征分类器; 组合时空描述符的多个要素分类器; 基于具有至少一个图像帧的时间窗中的特征分类器的预定阈值来评估人类检测的线性概率; 用于计数实时图像中的人数的计数器; 以及发送装置，被配置为将最终的人类检测决定及其数量发送到存储装置。

5.

发明申请
METHOD AND SYSTEM FOR PREPROCESSING THE REGION OF VIDEO CONTAINING TEXT 有权
标题翻译：用于预处理视频包含文本区域的方法和系统

公开(公告)号：US20120242897A1

公开(公告)日：2012-09-27

申请号：US13395754

申请日：2010-12-29

申请人： Tanushyam Chattopadhyay , Aniruddha Sinha , Arpan Pal

发明人： Tanushyam Chattopadhyay , Aniruddha Sinha , Arpan Pal

IPC分类号： H04N11/00

CPC分类号： G06K9/325 , G06K9/348 , G06K2209/01

摘要： A method and system for preprocessing text containing region of a video The invention provides a method and system for preprocessing the text containing region of video for improving the optical character recognition input.

摘要翻译： 一种用于预处理包含视频区域的文本的方法和系统本发明提供一种用于预处理包含视频区域的文本以改善光学字符识别输入的方法和系统。

6.

发明授权
Method for gender verification of individuals based on multimodal data analysis utilizing an individual's expression prompted by a greeting 有权
标题翻译：基于通过问候语提示的个人表情的基于多模态数据分析的个人性别验证方法

公开(公告)号：US09135562B2

公开(公告)日：2015-09-15

申请号：US14007421

申请日：2012-04-12

申请人： Aniruddha Sinha , Prateep Misra , Snehasis Banerjee , Arpan Pal

发明人： Aniruddha Sinha , Prateep Misra , Snehasis Banerjee , Arpan Pal

IPC分类号： G06N5/04 , G06N7/00 , G06Q10/10

CPC分类号： G06N5/048 , G06N7/005 , G06Q10/10

摘要： The system and method of the present invention are described for automatic detection of error in the entry of particular category of individuals, especially referring to gender and age classification either real time while creating a database of such information or on an existing database on the record of individuals by analyzing their biometric characteristics like speech, image or face and other related demographic information like name of the individual in order to accord each individual with a unique identification.

摘要翻译： 描述了本发明的系统和方法，用于自动检测特定类别个人的进入中的错误，特别是在创建这种信息的数据库的同时，或者在现有数据库的记录上实时地参考性别和年龄分类个人通过分析他们的生物特征，如语言，图像或面部和其他相关的人口信息，如个人的姓名，以使每个人具有唯一的身份。

7.

发明申请
METHOD AND SYSTEM FOR EMBEDDING METADATA IN MULTIPLEXED ANALOG VIDEOS BROADCASTED THROUGH DIGITAL BROADCASTING MEDIUM 审中-公开
标题翻译：用于通过数字广播介质广播的多路复用模拟视频中嵌入元数据的方法和系统

公开(公告)号：US20140208379A1

公开(公告)日：2014-07-24

申请号：US14238728

申请日：2012-08-23

申请人： Aniruddha Sinha , Arindam Saha , Arpan Pal

发明人： Aniruddha Sinha , Arindam Saha , Arpan Pal

IPC分类号： H04N21/236

CPC分类号： H04N21/23614 , H04N7/025 , H04N7/08

摘要： A method and system for broadcast of additional content such as metadata required for client specific interactive application in an analog domain along with conventional audio, video and PSI or SI data is disclosed. The present invention enables transmission of encoded audio data or EPG data, timestamp information required for audio video synchronization referred to as metadata by embedding such metadata in the pixels of video pixels and then encoding by the standard video encoder to generate an encoded stream. The encoded stream is decoded using the standard video decoder at the receiving station to generate a Composite Video Blanking and Sync (CVBS) analog video signal. From the CVBS signal, the RGB or YUV pixels of the videos are extracted. Finally a data extractor module retrieves the embedded metadata from the RGB or YUV pixels.

摘要翻译： 公开了一种用于广播附加内容的方法和系统，例如模拟域中的客户特定交互应用所需的元数据以及常规音频，视频和PSI或SI数据。本发明能够通过将视频像素的像素嵌入这些元数据，然后通过标准视频编码器进行编码以产生编码流，从而传送编码的音频数据或EPG数据，将音频视频同步所需的时间戳信息称为元数据。在接收站使用标准视频解码器解码编码的流，以产生复合视频消隐和同步（CVBS）模拟视频信号。从CVBS信号中，提取视频的RGB或YUV像素。最后，数据提取器模块从RGB或YUV像素检索嵌入的元数据。

8.

发明授权
Method and system for embedding metadata in multiplexed analog videos broadcasted through digital broadcasting medium 有权

公开(公告)号：US10097869B2

公开(公告)日：2018-10-09

申请号：US14238728

申请日：2012-08-23

申请人： Aniruddha Sinha , Arindam Saha , Arpan Pal

发明人： Aniruddha Sinha , Arindam Saha , Arpan Pal

IPC分类号： H04N21/236 , H04N7/025 , H04N7/08

摘要： The present invention provides a method and system for broadcast of additional content such as metadata required for client specific interactive application in an analog domain along with conventional audio, video and PSI or SI data. The present invention enables transmission of encoded audio data or EPG data, timestamp information required for audio video synchronization referred to as metadata by embedding such metadata in the pixels of video pixels and then encoding by the standard video encoder to generate an encoded stream. The encoded stream is decoded using the standard video decoder at the receiving station to generate a Composite Video Blanking and Sync (CVBS) analog video signal. From the CVBS signal, the RGB or YUV pixels of the videos are extracted. Finally a data extractor module retrieves the embedded metadata from the RGB or YUV pixels.

9.

发明授权
System and method for multiplexing video contents from multiple broadcasting channels into single broadcasting channel 有权
标题翻译：将来自多个广播频道的视频内容复用为单个广播频道的系统和方法

公开(公告)号：US09001276B2

公开(公告)日：2015-04-07

申请号：US14130103

申请日：2012-06-26

申请人： Arpan Pal , Aniruddha Sinha , Arindam Saha , Hiranmay Ghosh , Gautam Shroff

发明人： Arpan Pal , Aniruddha Sinha , Arindam Saha , Hiranmay Ghosh , Gautam Shroff

IPC分类号： H04N5/38 , H04N7/14 , G06F21/00 , H04N7/173 , H04N7/08 , H04N21/2343 , H04N21/61 , H04N5/455

CPC分类号： H04N7/0806 , H04N5/38 , H04N5/455 , H04N21/234363 , H04N21/234381 , H04N21/6143

摘要： A method and system for multiplexing of multiple channels of video data through a single analog broadcasting channel is disclosed. The method enables a spatial and temporal multiplexing of videos of each of the multiple channels. The multiplexed content is created as a result of multiplexing that is encoded to generate digital transport stream that is transmitted through analog medium. The system enables a STB receiver to decode each of the videos from the stream. At least one video from the multiple videos is played on the television based on user selection.

摘要翻译： 公开了一种通过单个模拟广播信道复用多个视频数据通道的方法和系统。该方法能够对多个通道中的每一个的视频进行空间和时间复用。作为多路复用的结果产生多路复用的内容，其被编码以产生通过模拟媒体传输的数字传输流。该系统使得STB接收机能够从流中解码每个视频。基于用户选择，在电视上播放来自多个视频的至少一个视频。

10.

发明申请
SYSTEM AND METHOD FOR DEMOGRAPHIC ANALYTICS BASED ON MULTIMODAL INFORMATION 有权
标题翻译：基于多模态信息的人脸分析系统与方法

公开(公告)号：US20140025624A1

公开(公告)日：2014-01-23

申请号：US14007421

申请日：2012-04-12

申请人： Aniruddha Sinha , Prateep Misra , Snehasis Banerjee , Arpan Pal

发明人： Aniruddha Sinha , Prateep Misra , Snehasis Banerjee , Arpan Pal

IPC分类号： G06N5/04

CPC分类号： G06N5/048 , G06N7/005 , G06Q10/10

摘要： The system and method of the present invention are described for automatic detection of error in the entry of particular category of individuals, especially referring to gender and age classification either real time while creating a database of such information or on an existing database on the record of individuals by analyzing their biometric characteristics like speech, image or face and other related demographic information like name of the individual in order to accord each individual with a unique identification.

摘要翻译： 描述了本发明的系统和方法，用于自动检测特定类别个人的进入中的错误，特别是在创建这种信息的数据库的同时，或者在现有数据库的记录上实时地参考性别和年龄分类个人通过分析他们的生物特征，如语言，图像或面部和其他相关的人口信息，如个人的姓名，以使每个人具有唯一的身份。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类