-
公开(公告)号:US20170309297A1
公开(公告)日:2017-10-26
申请号:US15135671
申请日:2016-04-22
Applicant: XEROX CORPORATION
Inventor: Harish Arsikere , Arunasish Sen , Prathosh Aragulla Prasad
CPC classification number: G10L25/51 , G10L25/18 , G10L25/21 , G10L25/87 , G10L25/90 , G10L25/93 , G10L2025/932 , G10L2025/937
Abstract: The disclosed embodiments illustrate a method for classifying one or more audio segments of an audio signal. The method includes determining one or more first features of a first audio segment of the one or more audio segments. The method further includes determining one or more second features based on the one or more first features. The method includes determining one or more third features of the first audio segment, wherein each of the one or more third features is determined based on a second feature of the one or more second features of the first audio segment and at least one second feature associated with a second audio segment. Additionally, the method includes classifying the first audio segment either in an interrogative category or a non-interrogative category based on one or more of the one or more second features and the one or more third features.
-
公开(公告)号:US09785834B2
公开(公告)日:2017-10-10
申请号:US14798499
申请日:2015-07-14
Applicant: XEROX CORPORATION
Inventor: Arijit Biswas , Harish Arsikere , Kundan Shrivastava , Om D Deshmukh
CPC classification number: G06K9/00577 , G06F17/3079 , G06K9/00302 , G06K9/00362 , G06K9/00751
Abstract: According to embodiments illustrated herein, a method and system is provided for indexing a multimedia content. The method includes extracting, by one or more processors, a set of frames from the multimedia content, wherein the set of frames comprises at least one of a human object and an inanimate object. Thereafter, a body language information pertaining to the human object is determined from the set of frames by utilizing one or more image processing techniques. Further, an interaction information is determined from the set of frames. The interaction information is indicative of an action performed by the human object on the inanimate object. Thereafter, the multimedia content is indexed in a content database based at least on the body language information and the interaction information.
-
公开(公告)号:US20170017838A1
公开(公告)日:2017-01-19
申请号:US14798499
申请日:2015-07-14
Applicant: XEROX CORPORATION
Inventor: Arijit Biswas , Harish Arsikere , Kundan Shrivastava , Om D. Deshmukh
CPC classification number: G06K9/00577 , G06F17/3079 , G06K9/00302 , G06K9/00362 , G06K9/00751
Abstract: According to embodiments illustrated herein, a method and system is provided for indexing a multimedia content. The method includes extracting, by one or more processors, a set of frames from the multimedia content, wherein the set of frames comprises at least one of a human object and an inanimate object. Thereafter, a body language information pertaining to the human object is determined from the set of frames by utilizing one or more image processing techniques. Further, an interaction information is determined from the set of frames. The interaction information is indicative of an action performed by the human object on the inanimate object. Thereafter, the multimedia content is indexed in a content database based at least on the body language information and the interaction information.
Abstract translation: 根据本文所示的实施例,提供了一种用于索引多媒体内容的方法和系统。 该方法包括由一个或多个处理器提取来自多媒体内容的一组帧,其中该组帧包括人类对象和无生命对象中的至少一个。 此后,通过利用一个或多个图像处理技术从该组帧确定与人类对象有关的身体语言信息。 此外,从该组帧确定交互信息。 交互信息指示人类对象在无生命物体上执行的动作。 此后,多媒体内容至少基于身体语言信息和交互信息被索引到内容数据库中。
-
-