-
公开(公告)号:US09799325B1
公开(公告)日:2017-10-24
申请号:US15098343
申请日:2016-04-14
Applicant: XEROX CORPORATION
Inventor: Vivek Tyagi , Prathosh Aragulla Prasad
CPC classification number: G10L15/14 , G10L15/142 , G10L2015/022 , G10L2015/088
Abstract: The disclosed embodiments relate to a method of keyword recognition in a speech signal. The method includes determining a first likelihood score and a second likelihood score of one or more features of a frame of said speech signal being associated with one or more states in a first model and one or more states in a second model, respectively. The one or more states in the first model corresponds to one or more tied triphone states and the one or more states in the second model corresponds to one or more monophone states of a keyword to be recognized in the speech signal. The method further includes determining a third likelihood score based on the first likelihood score and the second likelihood score. The first likelihood score and the third likelihood score are utilizable to determine presence of the keyword in the speech signal.
-
公开(公告)号:US20170309297A1
公开(公告)日:2017-10-26
申请号:US15135671
申请日:2016-04-22
Applicant: XEROX CORPORATION
Inventor: Harish Arsikere , Arunasish Sen , Prathosh Aragulla Prasad
CPC classification number: G10L25/51 , G10L25/18 , G10L25/21 , G10L25/87 , G10L25/90 , G10L25/93 , G10L2025/932 , G10L2025/937
Abstract: The disclosed embodiments illustrate a method for classifying one or more audio segments of an audio signal. The method includes determining one or more first features of a first audio segment of the one or more audio segments. The method further includes determining one or more second features based on the one or more first features. The method includes determining one or more third features of the first audio segment, wherein each of the one or more third features is determined based on a second feature of the one or more second features of the first audio segment and at least one second feature associated with a second audio segment. Additionally, the method includes classifying the first audio segment either in an interrogative category or a non-interrogative category based on one or more of the one or more second features and the one or more third features.
-
3.
公开(公告)号:US20170278009A1
公开(公告)日:2017-09-28
申请号:US15077049
申请日:2016-03-22
Applicant: XEROX CORPORATION
Inventor: Vaibhav Rajan , Sakyajit Bhattacharya , Vijay Huddar , Abhishek Sengupta , James D. Kirkendall , Stephen Fullerton , Katerina Sinclair , Bhupendra Singh Solanki , Prathosh Aragulla Prasad
CPC classification number: G06N20/00 , G06F16/24578 , G06N7/005 , H04W4/70
Abstract: A method and a system for predicting admission of a human subject to a first ward in a medical center are disclosed. A patient dataset is generated based on at least a measure of one or more physiological parameters associated with one or more first human subjects and a first information pertaining to the admission of each of the one or more first human subjects to the first ward. For a first human subject of the one or more first human subjects, a first score at each of the one or more first time instants is determined. Further, one or more second time instants from the one or more first time instants are identified. Further, a second score at each of the one or more second time instants is determined. In an embodiment, the first classifier is trained based on at least the second score, and the first information.
-
公开(公告)号:US20170301341A1
公开(公告)日:2017-10-19
申请号:US15098343
申请日:2016-04-14
Applicant: XEROX CORPORATION
Inventor: Vivek Tyagi , Prathosh Aragulla Prasad
CPC classification number: G10L15/14 , G10L15/142 , G10L2015/022 , G10L2015/088
Abstract: The disclosed embodiments relate to a method of keyword recognition in a speech signal. The method includes determining a first likelihood score and a second likelihood score of one or more features of a frame of said speech signal being associated with one or more states in a first model and one or more states in a second model, respectively. The one or more states in the first model corresponds to one or more tied triphone states and the one or more states in the second model corresponds to one or more monophone states of a keyword to be recognized in the speech signal. The method further includes determining a third likelihood score based on the first likelihood score and the second likelihood score. The first likelihood score and the third likelihood score are utilizable to determine presence of the keyword in the speech signal.
-
公开(公告)号:US09861302B1
公开(公告)日:2018-01-09
申请号:US15197080
申请日:2016-06-29
Applicant: Xerox Corporation
Inventor: Avishek Chatterjee , Prathosh Aragulla Prasad , Pragathi Praveena
CPC classification number: A61B5/113 , A61B5/0077 , A61B5/1128 , G06T7/20 , G06T2207/10016 , G06T2207/10024 , G06T2207/30004 , G06T2207/30196
Abstract: What is disclosed is a system and method for determining a respiration rate from a video of a subject breathing. One embodiment of the present method involves the following. A video is received of a subject breathing which comprises a first portion of N image frames, and a second portion of M image frames, N+M=T and N≧10 seconds of video. For each image frame of the first portion, flow vectors Ft are determined for each (x,y) pixel location. A correlated flow field V is then calculated for the first portion of video. For each image frame of the second portion, flow vectors Ft(x,y) are determined for each (x,y) pixel location and a projection of Ft along V is calculated to obtain a velocity of thoracoabdominal motion in the direction of V. The velocity is integrated to obtain an integrated signal. Respiration rate is determined from the integrated signal.
-
公开(公告)号:US20170286569A1
公开(公告)日:2017-10-05
申请号:US15083466
申请日:2016-03-29
Applicant: Xerox Corporation
Inventor: Abhishek Sengupta , Prathosh Aragulla Prasad , Satya Narayan Shukla , Vaibhav Rajan , Katerina Sinclair , Stephen Fullerton
CPC classification number: G06F17/5009 , G06F17/18 , G06F2217/10 , G06K9/00536 , G06N5/04 , G06N20/00
Abstract: Systems and methods of modeling irregularly sampled time series signals with unknown temporal dynamics are disclosed wherein a temporal difference variable (TDV) is introduced to model irregular time differences between subsequent measurements. A hierarchical model is designed comprising two linear dynamical systems that model the effects of evolving TDV on temporal observations. All the parameters of the model, including the temporal dynamics are statistically estimated using historical data.
-
公开(公告)号:US20170206915A1
公开(公告)日:2017-07-20
申请号:US15000068
申请日:2016-01-19
Applicant: XEROX CORPORATION
Inventor: Prathosh Aragulla Prasad , Vivek Tyagi
Abstract: A method and a system for detecting sentiment of a human based on an analysis of human speech are disclosed. In an embodiment, one or more time instances of glottal closure are determined from a speech signal of the human. A voice source signal based on the determined one or more time instances of glottal closure is generated. A set of relative harmonic strengths is determined based on one or more harmonic contours of the voice source signal. The RHS is indicative of a deviation of the one or more harmonics of the voice source signal from a fundamental frequency of the voice source signal. A set of feature vectors is determined based on the RHS. The set of feature vectors are utilizable to detect the sentiment of the human.
-
-
-
-
-
-