-
公开(公告)号:US20230419733A1
公开(公告)日:2023-12-28
申请号:US17846770
申请日:2022-06-22
申请人: Yannick VERDIE , Zi Hao YANG , Deepak SRIDHAR , Juwei LU
发明人: Yannick VERDIE , Zi Hao YANG , Deepak SRIDHAR , Juwei LU
CPC分类号: G06V40/28 , G06V20/64 , G06V30/1456 , G06T7/246 , G06T7/73 , G06T2207/30196
摘要: Methods and devices are described for computer vision-based gesture detection. From a frame of image data, extracted locations of keypoints of a detected hand are obtained. The extracted locations are normalized to obtain normalized features. The normalized features are processed using a trained decision tree ensemble to generate a probability of a valid gesture for the detected hand. The generated probability is compared with a defined decision threshold to generate a binary classification to classify the detected hand as a valid gesture or invalid gesture.
-
公开(公告)号:US20240193866A1
公开(公告)日:2024-06-13
申请号:US18078832
申请日:2022-12-09
申请人: Yannick VERDIE , Zihao YANG , Deepak SRIDHAR , Steven George MCDONAGH , Juwei LU
发明人: Yannick VERDIE , Zihao YANG , Deepak SRIDHAR , Steven George MCDONAGH , Juwei LU
摘要: Methods and systems for estimation of a 3D hand pose are disclosed. A 2D image containing a detected hand is processed using a U-net network to obtain a global feature vector and a heatmap for the keypoints of the hand. Information from the global feature vector and the heatmap are concatenated to obtain a set of input tokens that are processed using a transformer encoder to obtain a first set of 2D keypoints representing estimated 2D locations of the keypoints in a first view. The first set of 2D keypoints are inputted as a query to a transformer decoder, to obtain a second set of 2D keypoints representing estimated 2D locations of the keypoints in a second view. The first and second sets of 2D keypoints are aggregated to output the set of estimated 3D keypoints.
-