-
公开(公告)号:US11900260B2
公开(公告)日:2024-02-13
申请号:US16810524
申请日:2020-03-05
申请人: Deepak Sridhar , Juwei Lu
发明人: Deepak Sridhar , Juwei Lu
摘要: Methods, devices and processor-readable media for an integrated teacher-student machine learning system. One or more teacher-student modules are trained as part of the teacher neural network training. Each student sub-network uses a portion of the teacher neural network to generate an intermediate feature map, then provides the intermediate feature map to a student sub-network to generate inferences. The student sub-network may use a feature enhancement block to map the intermediate feature map to a subsequent feature map. A compression block may be used to compress intermediate feature map data for transmission in some embodiments.
-
公开(公告)号:US11698926B2
公开(公告)日:2023-07-11
申请号:US17524862
申请日:2021-11-12
申请人: Arnab Kumar Mondal , Deepak Sridhar , Niamul Quader , Juwei Lu , Peng Dai , Chao Xing
发明人: Arnab Kumar Mondal , Deepak Sridhar , Niamul Quader , Juwei Lu , Peng Dai , Chao Xing
IPC分类号: G06F16/30 , G06F16/732 , G06N3/04 , G06F16/783 , G06V20/40
CPC分类号: G06F16/7343 , G06F16/783 , G06N3/04 , G06V20/40
摘要: Methods and systems are described for performing video retrieval together with video grounding. A word-based query for a video is and encoded into a query representation using a trained query encoder. One or more similar video representations are identified, from a plurality of video representations that are similar to the query representation. Each similar video representation represents a respective relevant video. A grounding is generated for each relevant video by forward propagating each respective similar video representation together with the query representation through a trained grounding module. The relevant videos or identifiers of the relevant videos are outputted together with the grounding generated for each relevant video.
-
公开(公告)号:US11902548B2
公开(公告)日:2024-02-13
申请号:US17203613
申请日:2021-03-16
申请人: Deepak Sridhar , Niamul Quader , Srikanth Muralidharan , Yaoxin Li , Juwei Lu , Peng Dai
发明人: Deepak Sridhar , Niamul Quader , Srikanth Muralidharan , Yaoxin Li , Juwei Lu , Peng Dai
摘要: Systems, methods, and computer media of processing a video are disclosed. An example method may include: receiving a plurality of video frames of a video; generating a plurality of first input features based on the plurality of video frames; generating a plurality of second input features based on reversing a temporal order of the plurality of first input features; generating a first set of joint attention features based on the plurality of first input features; generating a second set of joint attention features based on the plurality of second input features; and concatenating the first set of joint attention features and the second set of joint attention features to generate a final set of joint attention features.
-
公开(公告)号:US11810329B2
公开(公告)日:2023-11-07
申请号:US16953029
申请日:2020-11-19
申请人: Yuanhao Yu , Shuhao Li , Juwei Lu , Jin Tang
发明人: Yuanhao Yu , Shuhao Li , Juwei Lu , Jin Tang
摘要: Methods and systems for determining a surface color of a target surface under an environment with an environmental light source. A plurality of images of the target surface are captured as the target surface is illuminated with a variable intensity, constant color light source and a constant intensity, constant color environmental light source, wherein the intensity of the light source on the target surface is varied by a known amount between the capturing of the images. A color feature tensor, independent of the environmental light source, is extracted from the image data, and used to infer a surface color of the target surface.
-
公开(公告)号:US11669743B2
公开(公告)日:2023-06-06
申请号:US16874478
申请日:2020-05-14
申请人: Niamul Quader , Juwei Lu , Peng Dai , Wei Li
发明人: Niamul Quader , Juwei Lu , Peng Dai , Wei Li
IPC分类号: H04N21/462 , G06K9/62 , H04N21/466 , G06N3/08 , G06V20/40 , G06N3/04 , H04N21/4402 , G06F9/50
CPC分类号: H04N21/4621 , G06F9/5055 , G06K9/6277 , G06N3/0454 , G06N3/08 , G06V20/41 , H04N21/440227 , H04N21/4666 , G06V20/44
摘要: An adaptive action recognizer for video that performs multiscale spatiotemporal decomposition of video to generate lower complexity video. The adaptive action recognizer has a number of processing pathways, one for each level of video complexity with each processing pathway having a different computational cost. The adaptive action recognizer applies a decision making scheme that encourages using low average computational costs while retaining high accuracy.
-
公开(公告)号:US11023710B2
公开(公告)日:2021-06-01
申请号:US16280760
申请日:2019-02-20
申请人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
发明人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
摘要: System and method for classifying data objects occurring in an unstructured dataset, comprising: extracting feature vectors from the unstructured dataset, each feature vector representing an occurrence of a data object in the unstructured dataset; classifying the feature vectors into feature vector sets that each correspond to a respective object class from a plurality of object classes; for each feature vector set: performing multiple iterations of a clustering operation, each iteration including clustering feature vectors from the feature vector set into clusters of similar feature vectors and identifying outlier feature vectors, wherein for at least one iteration after a first iteration of the clustering operation, outlier feature vectors identified in a previous iteration are excluded from the clustering operation; and outputting a key cluster for the feature vector set from a final iteration of the multiple iterations, the key cluster including a greater number of similar feature vectors than any of the other clusters of the final iteration; and assembling a dataset that includes at least the feature vectors from the key clusters of the feature vector sets.
-
公开(公告)号:US20200265218A1
公开(公告)日:2020-08-20
申请号:US16280760
申请日:2019-02-20
申请人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
发明人: Peng Dai , Juwei Lu , Bharath Sekar , Wei Li , Jianpeng Xu , Ruiwen Li
摘要: System and method for classifying data objects occurring in an unstructured dataset, comprising: extracting feature vectors from the unstructured dataset, each feature vector representing an occurrence of a data object in the unstructured dataset; classifying the feature vectors into feature vector sets that each correspond to a respective object class from a plurality of object classes; for each feature vector set: performing multiple iterations of a clustering operation, each iteration including clustering feature vectors from the feature vector set into clusters of similar feature vectors and identifying outlier feature vectors, wherein for at least one iteration after a first iteration of the clustering operation, outlier feature vectors identified in a previous iteration are excluded from the clustering operation; and outputting a key cluster for the feature vector set from a final iteration of the multiple iterations, the key cluster including a greater number of similar feature vectors than any of the other clusters of the final iteration; and assembling a dataset that includes at least the feature vectors from the key clusters of the feature vector sets.
-
公开(公告)号:US07844085B2
公开(公告)日:2010-11-30
申请号:US11759460
申请日:2007-06-07
IPC分类号: G06K9/00
CPC分类号: G06K9/00228 , G06K9/6257
摘要: Systems and methods for training an AdaBoost based classifier for detecting symmetric objects, such as human faces, in a digital image. In one example embodiment, such a method includes first selecting a sub-window of a digital image. Next, the AdaBoost based classifier extracts multiple sets of two symmetric scalar features from the sub-window, one being in the right half side and one being in the left half side of the sub-window. Then, the AdaBoost based classifier minimizes the joint error of the two symmetric features for each set of two symmetric scalar features. Next, the AdaBoost based classifier selects one of the features from the set of two symmetric scalar features for each set of two symmetric scalar features. Finally, the AdaBoost based classifier linearly combines multiple weak classifiers, each of which corresponds to one of the selected features, into a stronger classifier.
摘要翻译: 用于训练基于AdaBoost的分类器的系统和方法,用于在数字图像中检测对象对象(例如人脸)。 在一个示例实施例中,这种方法包括首先选择数字图像的子窗口。 接下来,基于AdaBoost的分类器从子窗口中提取多组两个对称标量特征,一组位于子窗口的右半边,一个位于子窗口的左半边。 然后,基于AdaBoost的分类器最小化两组对称标量特征的两个对称特征的联合误差。 接下来,基于AdaBoost的分类器从两组对称标量特征中选择两个对称标量特征集合中的一个特征。 最后,基于AdaBoost的分类器将多个弱分类器线性组合,每个弱分类器对应于所选特征之一,成为更强的分类器。
-
9.
公开(公告)号:US07840037B2
公开(公告)日:2010-11-23
申请号:US11684478
申请日:2007-03-09
申请人: Juwei Lu , Hui Zhou , Mohanaraj Thiyagarajah
发明人: Juwei Lu , Hui Zhou , Mohanaraj Thiyagarajah
CPC分类号: G06K9/00234 , G06K9/00248
摘要: A method and system for efficiently detecting faces within a digital image. One example method includes identifying a digital image comprised of a plurality of sub-windows and performing a first scan of the digital image using a coarse detection level to eliminate the sub-windows that have a low likelihood of representing a face. The subset of the sub-windows that were not eliminated during the first scan are then scanned a second time using a fine detection level having a higher accuracy level than the coarse detection level used during the first scan to identify sub-windows having a high likelihood of representing a face.
摘要翻译: 一种用于有效检测数字图像内的面部的方法和系统。 一个示例性方法包括识别由多个子窗口组成的数字图像,并使用粗略检测水平执行数字图像的第一次扫描,以消除具有低的表示脸部可能性的子窗口。 然后使用具有比在第一次扫描期间使用的粗略检测级别更高的精度水平的精细检测级别来第二次扫描在第一次扫描期间未被消除的子窗口的子集,以识别具有高似然性的子窗口 代表一个脸。
-
10.
公开(公告)号:US11474614B2
公开(公告)日:2022-10-18
申请号:US17085866
申请日:2020-10-30
申请人: Wei Li , Wei Zhou , Sachi Mizobuchi , Ghazaleh Saniee-Monfared , Juwei Lu , Taslim Arefin Khan , Rafael Veras Guimaraes
发明人: Wei Li , Wei Zhou , Sachi Mizobuchi , Ghazaleh Saniee-Monfared , Juwei Lu , Taslim Arefin Khan , Rafael Veras Guimaraes
摘要: Methods, devices, and processor-readable media for adjusting the control-display gain of a gesture-controlled device are described. Adjusting the control-display gain may facilitate user interaction with content or UI elements rendered on a display screen of the gesture-controlled device. The control-display gain may be adjusted based on a property of how a mid-air dragging gesture is being performed by a user's hand. The property may be the location of the gesture, the orientation of the hand performing the gesture, or the velocity of the gesture. A hand that becomes stationary for a threshold time period while performing the dragging gesture may adjust the control-display gain to a different level. Control-display gain may be set to a different value based on the current velocity of the hand performing the gesture. The control-display gain levels may be selected from a continuous range of values or a set of discrete values. Devices for performing the methods are described.
-
-
-
-
-
-
-
-
-