-
公开(公告)号:US11669743B2
公开(公告)日:2023-06-06
申请号:US16874478
申请日:2020-05-14
申请人: Niamul Quader , Juwei Lu , Peng Dai , Wei Li
发明人: Niamul Quader , Juwei Lu , Peng Dai , Wei Li
IPC分类号: H04N21/462 , G06K9/62 , H04N21/466 , G06N3/08 , G06V20/40 , G06N3/04 , H04N21/4402 , G06F9/50
CPC分类号: H04N21/4621 , G06F9/5055 , G06K9/6277 , G06N3/0454 , G06N3/08 , G06V20/41 , H04N21/440227 , H04N21/4666 , G06V20/44
摘要: An adaptive action recognizer for video that performs multiscale spatiotemporal decomposition of video to generate lower complexity video. The adaptive action recognizer has a number of processing pathways, one for each level of video complexity with each processing pathway having a different computational cost. The adaptive action recognizer applies a decision making scheme that encourages using low average computational costs while retaining high accuracy.
-
公开(公告)号:US11902548B2
公开(公告)日:2024-02-13
申请号:US17203613
申请日:2021-03-16
申请人: Deepak Sridhar , Niamul Quader , Srikanth Muralidharan , Yaoxin Li , Juwei Lu , Peng Dai
发明人: Deepak Sridhar , Niamul Quader , Srikanth Muralidharan , Yaoxin Li , Juwei Lu , Peng Dai
摘要: Systems, methods, and computer media of processing a video are disclosed. An example method may include: receiving a plurality of video frames of a video; generating a plurality of first input features based on the plurality of video frames; generating a plurality of second input features based on reversing a temporal order of the plurality of first input features; generating a first set of joint attention features based on the plurality of first input features; generating a second set of joint attention features based on the plurality of second input features; and concatenating the first set of joint attention features and the second set of joint attention features to generate a final set of joint attention features.
-
公开(公告)号:US11698926B2
公开(公告)日:2023-07-11
申请号:US17524862
申请日:2021-11-12
申请人: Arnab Kumar Mondal , Deepak Sridhar , Niamul Quader , Juwei Lu , Peng Dai , Chao Xing
发明人: Arnab Kumar Mondal , Deepak Sridhar , Niamul Quader , Juwei Lu , Peng Dai , Chao Xing
IPC分类号: G06F16/30 , G06F16/732 , G06N3/04 , G06F16/783 , G06V20/40
CPC分类号: G06F16/7343 , G06F16/783 , G06N3/04 , G06V20/40
摘要: Methods and systems are described for performing video retrieval together with video grounding. A word-based query for a video is and encoded into a query representation using a trained query encoder. One or more similar video representations are identified, from a plurality of video representations that are similar to the query representation. Each similar video representation represents a respective relevant video. A grounding is generated for each relevant video by forward propagating each respective similar video representation together with the query representation through a trained grounding module. The relevant videos or identifiers of the relevant videos are outputted together with the grounding generated for each relevant video.
-
公开(公告)号:US11023710B2
公开(公告)日:2021-06-01
申请号:US16280760
申请日:2019-02-20
申请人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
发明人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
摘要: System and method for classifying data objects occurring in an unstructured dataset, comprising: extracting feature vectors from the unstructured dataset, each feature vector representing an occurrence of a data object in the unstructured dataset; classifying the feature vectors into feature vector sets that each correspond to a respective object class from a plurality of object classes; for each feature vector set: performing multiple iterations of a clustering operation, each iteration including clustering feature vectors from the feature vector set into clusters of similar feature vectors and identifying outlier feature vectors, wherein for at least one iteration after a first iteration of the clustering operation, outlier feature vectors identified in a previous iteration are excluded from the clustering operation; and outputting a key cluster for the feature vector set from a final iteration of the multiple iterations, the key cluster including a greater number of similar feature vectors than any of the other clusters of the final iteration; and assembling a dataset that includes at least the feature vectors from the key clusters of the feature vector sets.
-
公开(公告)号:US20200265218A1
公开(公告)日:2020-08-20
申请号:US16280760
申请日:2019-02-20
申请人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
发明人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
摘要: System and method for classifying data objects occurring in an unstructured dataset, comprising: extracting feature vectors from the unstructured dataset, each feature vector representing an occurrence of a data object in the unstructured dataset; classifying the feature vectors into feature vector sets that each correspond to a respective object class from a plurality of object classes; for each feature vector set: performing multiple iterations of a clustering operation, each iteration including clustering feature vectors from the feature vector set into clusters of similar feature vectors and identifying outlier feature vectors, wherein for at least one iteration after a first iteration of the clustering operation, outlier feature vectors identified in a previous iteration are excluded from the clustering operation; and outputting a key cluster for the feature vector set from a final iteration of the multiple iterations, the key cluster including a greater number of similar feature vectors than any of the other clusters of the final iteration; and assembling a dataset that includes at least the feature vectors from the key clusters of the feature vector sets.
-
公开(公告)号:US11195046B2
公开(公告)日:2021-12-07
申请号:US16441918
申请日:2019-06-14
申请人: Varshanth Ravindra Rao , Uzair Ahmad , Peng Dai , Juwei Lu , Wei Li , Jianpeng Xu
发明人: Varshanth Ravindra Rao , Uzair Ahmad , Peng Dai , Juwei Lu , Wei Li , Jianpeng Xu
IPC分类号: G06K9/46 , G06K9/40 , G06F16/538 , G06N20/00 , G06T3/40
摘要: Methods and systems for processing an image are described. A saliency map is generated from the image. The saliency map indicates one or more salient portions of the image that have saliency values satisfying a saliency criterion. A scene graph is generated for at least the one or more salient portions. The scene graph represents a plurality of objects detected in the image. The scene graph further represents one or more relationships between each respective object pairs. One or more dataset entries associated with the image are generated. Each of the one or more relationships for each of the one or more object pairs is indicated by a respective dataset entry. The one or more dataset entries are stored in a first dataset.
-
公开(公告)号:US10979761B2
公开(公告)日:2021-04-13
申请号:US16279230
申请日:2019-02-19
申请人: Bharath Sekar , Juwei Lu , Peng Dai , Wei Li , Jianpeng Xu
发明人: Bharath Sekar , Juwei Lu , Peng Dai , Wei Li , Jianpeng Xu
IPC分类号: H04N21/433 , H04N21/4545 , H04N21/4722 , H04N21/4402 , H04N21/234
摘要: A method and system for displaying data content associated with a video, comprising: receiving video data for a video at the user equipment; playing the video in a user interface rendered on a display device of the user equipment; pausing the video at a selected frame; accessing, based on the selected frame, content data associated with the selected frame; and displaying information about the content data associated with the selected frame in the user interface.
-
公开(公告)号:US20190289359A1
公开(公告)日:2019-09-19
申请号:US16279230
申请日:2019-02-19
申请人: Bharath Sekar , Juwei Lu , Peng Dai , Wei Li , Jianpeng Xu
发明人: Bharath Sekar , Juwei Lu , Peng Dai , Wei Li , Jianpeng Xu
IPC分类号: H04N21/433 , H04N21/4545 , H04N21/234 , H04N21/4402 , H04N21/4722
摘要: A method and system for displaying data content associated with a video, comprising: receiving video data for a video at the user equipment; playing the video in a user interface rendered on a display device of the user equipment; pausing the video at a selected frame; accessing, based on the selected frame, content data associated with the selected frame; and displaying information about the content data associated with the selected frame in the user interface.
-
公开(公告)号:US10963702B1
公开(公告)日:2021-03-30
申请号:US16566179
申请日:2019-09-10
申请人: Ruiwen Li , Peng Dai , Varshanth Ravindra Rao , Juwei Lu , Wei Li , Jianpeng Xu
发明人: Ruiwen Li , Peng Dai , Varshanth Ravindra Rao , Juwei Lu , Wei Li , Jianpeng Xu
IPC分类号: G06K9/00 , G11B27/031 , G11B27/30 , G06F40/205
摘要: Methods and systems for video segmentation and scene recognition are described. A video having a plurality of frames and a subtitle file associated with the video are received. Segmentation is performed on the video to generate a first set video frames comprising one or more video frames based on a frame-by-frame comparison of features in the frames of the video. Each video frame in the first includes a frame indicator which indicates at least a first start frame of the video frame. The subtitle file associated with the video is parsed to generate one or more subtitle segments based on a start and an end time of each dialogue in the subtitle file. A second set of video frames comprising one or more second video frames are generated based on the video frames of the first set of video frames and the e or more subtitle segments.
-
公开(公告)号:US12001613B2
公开(公告)日:2024-06-04
申请号:US17827939
申请日:2022-05-30
申请人: Juwei Lu , Sayem Mohammad Siam , Wei Zhou , Peng Dai , Xiaofei Wu , Songcen Xu
发明人: Juwei Lu , Sayem Mohammad Siam , Wei Zhou , Peng Dai , Xiaofei Wu , Songcen Xu
摘要: Methods and systems for gesture-based control of a device are described. A virtual gesture-space is determined in a received input frame. The virtual gesture-space is associated with a primary user from a ranked user list of users. The received input frame is processed in only the virtual gesture-space, to detect and track a hand. Using a hand bounding box generated by detecting and tracking the hand, gesture classification is performed to determine a gesture input associated with the hand. A command input associated with the determined gesture input is processed. The device may be a smart television, a smart phone, a tablet, etc.
-
-
-
-
-
-
-
-
-